The impact of feature selection methods on online handwritten signature by using clustering-based analysis

Detalhes bibliográficos
Ano de defesa: 2021
Autor(a) principal: Marques, Julliana Caroline Gonçalves de Araújo Silva
Orientador(a): Abreu, Marjory Cristiany da Costa
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal do Rio Grande do Norte
Programa de Pós-Graduação: PROGRAMA DE PÓS-GRADUAÇÃO EM SISTEMAS E COMPUTAÇÃO
Departamento: Não Informado pela instituição
País: Brasil
Palavras-chave em Português:
Link de acesso: https://repositorio.ufrn.br/handle/123456789/32052
Resumo: Handwritten signature is one of the oldest and most accepted biometric authentication methods for human identity establishment in society. With the popularisation of computers and, consequently, computational biometric authentication systems, the signature was chosen for being one of the biometric traits of an individual that is likely to be relatively unique for every person. However, when dealing with biometric data, including signature data, problems related to high dimensional space, can be generated. Among other issues, irrelevant, redundant data and noise are the most significant, as they result in a decreased of identification accuracy. Thus, it is necessary to reduce the space by selecting the smallest set of features that contain the most discriminative features, increasing the accuracy of the system. In this way, our proposal in this work is to analyse the impact of feature selection on individuals identification accuracy based on the handwritten online signature. For this, we will use two well-known online signature databases: SVC2004 and xLongSignDB. For the feature selection process, we have applied two filter and one wrapper methods. Then, the resulted datasets are evaluated by classification algorithms and validated with a clustering technique. Besides, we have used a statistical test to corroborate our conclusions. Experiments presented satisfactory results when using a smaller number of features which are more representative, showing that we reached an average accuracy of over 98\% for both datasets which were validated with the clustering methods, which achieved an average accuracy over 80\% (SVC2004) and 70\% (xLongSignDB).