Predição de intensidade sonora percebida (loudness ) para áudio espacial

Detalhes bibliográficos
Ano de defesa: 2019
Autor(a) principal: Leandro da Silva Pires
Orientador(a): Não Informado pela instituição
Banca de defesa: Não Informado pela instituição
Tipo de documento: Tese
Tipo de acesso: Acesso aberto
Idioma: por
Instituição de defesa: Universidade Federal de Minas Gerais
Brasil
ENG - DEPARTAMENTO DE ENGENHARIA ELÉTRICA
Programa de Pós-Graduação em Engenharia Elétrica
UFMG
Programa de Pós-Graduação: Não Informado pela instituição
Departamento: Não Informado pela instituição
País: Não Informado pela instituição
Palavras-chave em Português:
Link de acesso: http://hdl.handle.net/1843/30272
Resumo: Loudness control for brodcasting is a common and legally required practice since the International Telecommunication Union (ITU) Recommendation ITUR BS.1770 for objective measurements in multichannel audio. Recommendations and regulations based on the ITU-R algorithm have been published worldwide, including Brazil. There is scope for improving national regulations in light of recent contributions to the field, and also for adapting the ITU-R model to measurements in advanced audio systems. This work pursues these two goals by testing the parameters of the Brazilian standard with a real-time loudness controller using short-form descriptors and by developing a new objective measurement model adapted to the new spatial audio formats. The proposed method performed well compared to other loudness models, although it was purely signal processing based and its readings were not very close to subject responses. The potential benefits of a more perceptually motivated model led to a PhD placement in the Institute of Sound Recording at the University of Surrey (UK), where listening tests were conducted to assess positional parameters of distance, azimuth and elevation, whose results served as a basis for deriving gain correction curves and a new directional weighting for the ITU-R model. General results point to advancements in the regulatory and standardization fronts, either by the elaboration of a strategy to improve the Brazilian standard of loudness, or by comparing this new prediction method with the critical fortune of loudness models through measurements on audio content for multichannel reproduction systems. The developed model resulted in the best trade-off between prediction errors (RMSE*), correlation between predictions and subject responses, and mean run time.