Um estudo sobre o agrupamento de séries temporais e sua aplicação em curvas de carga residenciais

Detalhes bibliográficos
Ano de defesa: 2016
Autor(a) principal: Pedro Leal Pazzini da Silva
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Minas Gerais
UFMG
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: http://hdl.handle.net/1843/BUOS-APWMJD
Resumo: The union of computational, measurement and communication systems applied to electrical power systems will form the Smart Grids. One of its main components are the Smart Meters, which are measurement systems that periodically measure the consumed power in electrical loads, and that send the measurements to the electricity companies. The research, and mainly the installation of smart meters is more developed in the north hemisphere countries, although it alreadystarted in Brazil. When this measures become ubiquitous in the Brazilian electrical power system, a considerable amount of data will be produced, and the extraction of useful information from it will be necessary. One of the possible data mining tasks that could be applied to this data, andwhich is the object of this dissertation, is the clustering of load curves, from which is expected to get information and knowlodge that can enhance the distribution service, the load forecast , the planning of expansion, and others. As the data of the smart meters installed in the Brazilian electrical system is not publicly available, the load curves of Australian residences were clustered. Since that, in this country, the climatic variations that could influence in the consuming profile arevery similar to the climatical brazilian variations. The problem of clustering load profiles is a time series clustering problem, since that for each measurement there is the temporal component associated.Thus, in this master dissertation was presented, initially, a study of clustering of time series, and this study is focused in the preprocessing, dissimilarity measure and algorithm applied, as well as the method of evaluating the results. Initially, after an empirical study in labeled datasets, the general clustering of time series strategy was proposed, and it is composed by: Z normalization in the preprocessing step, CID-DTW as the dissimilarity the hierarchical clustering algorithmusing the average linkage and the silhouette And Calinski-Harabasz index for the evaluation of the obtained partitions. In tha case study, with the australian dataset, such strategy generated highly unbalanced partitions, which motivated the proposition of a multiobjective approach, in which partitions with a good compromise between the validation index values, wich is a measure of the desired low variation intragroup and high variance intergroup, and balanced partitions, in which the variance of the number of instances in each cluster is low. The multi-objective approach indicates the tha max normalization, the k-means algorithm and the euclidean distance is the most appropiate configuration for the clustering of load profiles generated by the smart meters.