On deeply learning features for automatic person image re-identification

Franco, Alexandre da Costa e Silva

On deeply learning features for automatic person image re-identification

Detalhes bibliográficos
Ano de defesa:	2016
Autor(a) principal:	Franco, Alexandre da Costa e Silva
Orientador(a):	Oliveira, Luciano Rebouças de
Banca de defesa:	Schnitman, Leizer, Lemes, Rubisley de Paula, Loula, Angelo Conrado, Papa, João Paulo
Tipo de documento:	Tese
Tipo de acesso:	Acesso aberto
Idioma:	por
Instituição de defesa:	Escola Politécnica / Instituto de Matemática
Programa de Pós-Graduação:	Programa de Pós-Graduação em Mecatrônica
Departamento:	Não Informado pela instituição
País:	Brasil
Palavras-chave em Português:	Autonomous robots Polynomial interpolation Standalone navigation Inteligência artificial visão computacional Aprendizagem de máquina
Link de acesso:	http://repositorio.ufba.br/ri/handle/ri/21639
Resumo:	The automatic person re-identification (re-id) problem resides in matching an unknown person image to a database of previously labeled images of people. Among several issues to cope with this research field, person re-id has to deal with person appearance and environment variations. As such, discriminative features to represent a person identity must be robust regardless those variations. Comparison among two image features is commonly accomplished by distance metrics. Although features and distance metrics can be handcrafted or trainable, the latter type has demonstrated more potential to breakthroughs in achieving state-of-the-art performance over public data sets. A recent paradigm that allows to work with trainable features is deep learning, which aims at learning features directly from raw image data. Although deep learning has recently achieved significant improvements in person re-identification, found on some few recent works, there is still room for learning strategies, which can be exploited to increase the current state-of-the-art performance. In this work a novel deep learning strategy is proposed, called here as coarse-to-fine learning (CFL), as well as a novel type of feature, called convolutional covariance features (CCF), for person re-identification. CFL is based on the human learning process. The core of CFL is a framework conceived to perform a cascade network training, learning person image features from generic-to-specific concepts about a person. Each network is comprised of a convolutional neural network (CNN) and a deep belief network denoising autoenconder (DBN-DAE). The CNN is responsible to learn local features, while the DBN-DAE learns global features, robust to illumination changing, certain image deformations, horizontal mirroring and image blurring. After extracting the convolutional features via CFL, those ones are then wrapped in covariance matrices, composing the CCF. CCF and flat features were combined to improve the performance of person re-identification in comparison with component features. The performance of the proposed framework was assessed comparatively against 18 state-of-the-art methods by using public data sets (VIPeR, i-LIDS, CUHK01 and CUHK03), cumulative matching characteristic curves and top ranking references. After a thorough analysis, our proposed framework demonstrated a superior performance.

On deeply learning features for automatic person image re-identification

Registros relacionados