Modelos de classificação em fraudes financeiras: comparação de desempenho em casos de crime de smurfing

Detalhes bibliográficos
Ano de defesa: 2022
Autor(a) principal: Alcino, Matheus Saraiva
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Lavras
Programa de Pós-Graduação em Estatística e Experimentação Agropecuária
UFLA
brasil
Departamento de Estatística
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: http://repositorio.ufla.br/jspui/handle/1/49447
Resumo: The difficulty in identifying financial fraud is directly related to technological advances, as the new possibilities of forms of financial transactions, in turn, generate new forms of fraudulent agents to act. In this context, the aim of this study is to explore the theoretical construction of six machine learning (ML) models, in addition to comparing them through specific performance evaluation metrics. Furthermore, this work develops an algorithm to detect a type of financial crime known as smurfing. This algorithm does not use ML techniques, but aims to classify financial transactions as possible fraud through the analysis of pooled data. Given the impossibility of using real financial data, due to its confidentiality, this work is using simulated data. Two different scenarios were generated, both highly unbalanced, in which the behavior of financial fraud varies according to specific parameters. The chosen classification models were logistic model, Fuzzy Rule Based Systems, Artificial Neural Networks, Random Forest, Extreme Gradient Reinforcement and Support Vector Machine. The comparison of the models in the different scenarios was done through a combination of the metrics Area Under de Curve, Recall and Fb , once data are imbalanced. The results showed that the Random Forest and Extreme Gradient Boosting models had the best performances, therefore, it is believed that the use of such models in real data, even with different parameters, can help in tracking illegal financial transactions and identifying fraudsters