Convolutional neural network for distortion Classification in face images.

Detalhes bibliográficos
Ano de defesa: 2021
Autor(a) principal: Pacheco Reina, Patricia Alejandra
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: eng
Instituição de defesa: Biblioteca Digitais de Teses e Dissertações da USP
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
CNN
Link de acesso: https://www.teses.usp.br/teses/disponiveis/3/3141/tde-25102021-151818/
Resumo: Face processing algorithms are becoming more popular in recent days due to the great domain of application in which they can be used. As a consequence, research about the quality of face images is also increasing. The current approach to Face Image Quality Assessment (FIQA) is focused on improving the performance of face recognition systems, as a result, current FIQA algorithms don\'t provide an indication of quality, but a performance estimation for face recognition algorithms. This approach makes the FIQA algorithms potentially unsuited for other scenarios regarding face images, and susceptible to inherit the limitations of face recognition. The present work tackles the main limitations of the current FIQA algorithms by proposing a new approach based on the distortions affecting the images. We developed two models based on Convolutional Neural Networks (CNN), to classify facial images according to the type and the degree of the distortion present in them. The models\' output provides qualitative information about the quality of facial images, useful for face recognition systems, as well as other face processing algorithms. Additionally, the proposed method can be a starting point to image enhancement processes like denoising, and deblurring. Two other contributions can be outlined from this work: a comprehensive study about the impact of blur, noise, brightness, contrast, and JPEG compression in face processing algorithms; and a new dataset for image quality assessment and distortion classification in face images.