Avalia??o da rela??o entre qualidade perceptual da fala e taxa de acerto de sistemas de reconhecimento de fala em ambientes ruidosos

Detalhes bibliográficos
Ano de defesa: 2005
Autor(a) principal: Chiovato, Andr? Godoi lattes
Orientador(a): Ynoguti, Carlos Alberto lattes
Banca de defesa: Ynoguti, Carlos Alberto lattes, Silva, Francisco Jos? Fraga da lattes, Ramirez, Miguel Arjona lattes
Tipo de documento: Dissertação
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Instituto Nacional de Telecomunica??es
Programa de Pós-Graduação: Mestrado em Engenharia de Telecomunica??es
Departamento: Instituto Nacional de Telecomunica??es
País: Brasil
Palavras-chave em Português:
Área do conhecimento CNPq:
Link de acesso: http://tede.inatel.br:8080/tede/handle/tede/59
Resumo: Abstract: The goal of this work is to evaluate the distortion of the noisy speech signal being after enhanced by noise-reduction algorithms. This is performed by comparison of word accuracy (%) of a standardized automatic speech recognition (ASR) system and objective measures of perceptual speech quality (PESQ-MOS score), obtained after applying noise-reduction methods. The test scenario, composed of ETSI STQ-aurora DRS working group data base and a standardized ASR system, evaluated the following algoritms: WI008 (ETSI STQ-aurora standard), EMSR (ephraim and malah noise suppressor rule algorithm), NMT-PSS (noise masking threshold - power spectral subtraction) and EMSR + NMT-PSS (EMSR algorithm with the concept of noise masking threshold). Moreover a curve that models the relationship between PESQ-MOS score and recognition rate (%) is proposed. The purpose is to predict, under certain conditions, The system perfomance by means of the PESQ evalution. This approximations is based inthe logistic curve, which configuration parameters have physical meanings, validated by experimental results. Finally, some analysis are presented to indicate the advantages and disadvantages of several noise types present at aurora1 database over recognition system performance.