Detalhes bibliográficos
Ano de defesa: |
2005 |
Autor(a) principal: |
Chiovato, Andr? Godoi
 |
Orientador(a): |
Ynoguti, Carlos Alberto
 |
Banca de defesa: |
Ynoguti, Carlos Alberto
,
Silva, Francisco Jos? Fraga da
,
Ramirez, Miguel Arjona
 |
Tipo de documento: |
Dissertação
|
Tipo de acesso: |
Acesso aberto |
Idioma: |
por |
Instituição de defesa: |
Instituto Nacional de Telecomunica??es
|
Programa de Pós-Graduação: |
Mestrado em Engenharia de Telecomunica??es
|
Departamento: |
Instituto Nacional de Telecomunica??es
|
País: |
Brasil
|
Palavras-chave em Português: |
|
Área do conhecimento CNPq: |
|
Link de acesso: |
http://tede.inatel.br:8080/tede/handle/tede/59
|
Resumo: |
Abstract: The goal of this work is to evaluate the distortion of the noisy speech signal being after enhanced by noise-reduction algorithms. This is performed by comparison of word accuracy (%) of a standardized automatic speech recognition (ASR) system and objective measures of perceptual speech quality (PESQ-MOS score), obtained after applying noise-reduction methods. The test scenario, composed of ETSI STQ-aurora DRS working group data base and a standardized ASR system, evaluated the following algoritms: WI008 (ETSI STQ-aurora standard), EMSR (ephraim and malah noise suppressor rule algorithm), NMT-PSS (noise masking threshold - power spectral subtraction) and EMSR + NMT-PSS (EMSR algorithm with the concept of noise masking threshold). Moreover a curve that models the relationship between PESQ-MOS score and recognition rate (%) is proposed. The purpose is to predict, under certain conditions, The system perfomance by means of the PESQ evalution. This approximations is based inthe logistic curve, which configuration parameters have physical meanings, validated by experimental results. Finally, some analysis are presented to indicate the advantages and disadvantages of several noise types present at aurora1 database over recognition system performance. |