Detecção de fraude em unidades consumidoras não telemedidas com uso de técnicas de aprendizado de máquina

Detalhes bibliográficos
Ano de defesa: 2020
Autor(a) principal: Paulo, Fernanda Rodrigues
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal da Paraíba
Brasil
Engenharia Elétrica
Programa de Pós-Graduação em Engenharia Elétrica
UFPB
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: https://repositorio.ufpb.br/jspui/handle/123456789/18759
Resumo: In 2018, estimates that about 310 TWh were destined to supply irregular connections and measurements in Brazil, approximately R$ 9 billion losses for distributors. The concessionaire of this study faces challenges to detect fraud, mainly due to the volume of data and the limitation on finding patterns without a structured tool. Considering this scenario, the development of an automated methodology is proposed to detect fraud in low voltage customers, without telemetry, using artificial intelligence tools. Information was extracted from the company's database, attributes were implemented, the main variables were selected and then the models were evaluated. The main variable proposed compares the average consumption of the unit with the closest geographic neighbors with similar size characteristics. Variables are also proposed aiming to detect the moment of a reduction in the energy consumption, as well as its value. The most common Machine Learning techniques were tested and four models were proposed: Support Vector Machine was used for consumers with an indication of possible fraud; for residential units without this indication, Gradient Boosting was used; for rural units, Random Forest was used; for the other classes, a Multilayer Perceptron Neural Network was used. The models were qualified based on a new metric, proposed as an alternative to the usual evaluation metrics, which computes the percentage of the energy benefit theoretically recovered by the model in relation to all the energy that could have been recovered. In theoretical tests, it was possible to obtain an accuracy of 39.4%, surpassing 19.5% the current methodology of the company, with 69.8% greater recall. The energy benefit metric also shows that the proposed methodology was able to recover 59.5% of the total amount of energy available, 153.2% higher than the company's current model. New research involves the application of the proposed methodology to the company's base for the classification of the consumers and inspections will be sent to verify the results.