Sound Classification and Processing of Urban Environments: A Systematic Literature Review

Bibliographic Details
Main Author: Ana Filipa Rodrigues Nogueira
Publication Date: 2022
Other Authors: Hugo S. Oliveira, José J. M. Machado, João Manuel R. S. Tavares
Format: Other
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: https://hdl.handle.net/10216/145992
Summary: Audio recognition can be used in smart cities for security, surveillance, manufacturing, autonomous vehicles, and noise mitigation, just to name a few. However, urban sounds are everyday audio events that occur daily, presenting unstructured characteristics containing different genres of noise and sounds unrelated to the sound event under study, making it a challenging problem. Therefore, the main objective of this literature review is to summarize the most recent works on this subject to understand the current approaches and identify their limitations. Based on the reviewed articles, it can be realized that Deep Learning (DL) architectures, attention mechanisms, data augmentation techniques, and pretraining are the most crucial factors to consider while creating an efficient sound classification model. The best-found results were obtained by Mushtaq and Su, in 2020, using a DenseNet-161 with pretrained weights from ImageNet, and NA-1 and NA-2 as augmentation techniques, which were of 97.98%, 98.52%, and 99.22% for UrbanSound8K, ESC-50, and ESC-10 datasets, respectively. Nonetheless, the use of these models in real-world scenarios has not been properly addressed, so their effectiveness is still questionable in such situations.
id RCAP_6590ec136d6ae806460e903aa940858c
oai_identifier_str oai:repositorio-aberto.up.pt:10216/145992
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Sound Classification and Processing of Urban Environments: A Systematic Literature ReviewCiências Tecnológicas, Ciências da engenharia e tecnologiasTechnological sciences, Engineering and technologyAudio recognition can be used in smart cities for security, surveillance, manufacturing, autonomous vehicles, and noise mitigation, just to name a few. However, urban sounds are everyday audio events that occur daily, presenting unstructured characteristics containing different genres of noise and sounds unrelated to the sound event under study, making it a challenging problem. Therefore, the main objective of this literature review is to summarize the most recent works on this subject to understand the current approaches and identify their limitations. Based on the reviewed articles, it can be realized that Deep Learning (DL) architectures, attention mechanisms, data augmentation techniques, and pretraining are the most crucial factors to consider while creating an efficient sound classification model. The best-found results were obtained by Mushtaq and Su, in 2020, using a DenseNet-161 with pretrained weights from ImageNet, and NA-1 and NA-2 as augmentation techniques, which were of 97.98%, 98.52%, and 99.22% for UrbanSound8K, ESC-50, and ESC-10 datasets, respectively. Nonetheless, the use of these models in real-world scenarios has not been properly addressed, so their effectiveness is still questionable in such situations.2022-112022-11-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/otherapplication/pdfimage/pnghttps://hdl.handle.net/10216/145992eng1424-321010.3390/s22228608Ana Filipa Rodrigues NogueiraHugo S. OliveiraJosé J. M. MachadoJoão Manuel R. S. Tavaresinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-02-27T19:50:32Zoai:repositorio-aberto.up.pt:10216/145992Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T23:34:20.128477Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Sound Classification and Processing of Urban Environments: A Systematic Literature Review
title Sound Classification and Processing of Urban Environments: A Systematic Literature Review
spellingShingle Sound Classification and Processing of Urban Environments: A Systematic Literature Review
Ana Filipa Rodrigues Nogueira
Ciências Tecnológicas, Ciências da engenharia e tecnologias
Technological sciences, Engineering and technology
title_short Sound Classification and Processing of Urban Environments: A Systematic Literature Review
title_full Sound Classification and Processing of Urban Environments: A Systematic Literature Review
title_fullStr Sound Classification and Processing of Urban Environments: A Systematic Literature Review
title_full_unstemmed Sound Classification and Processing of Urban Environments: A Systematic Literature Review
title_sort Sound Classification and Processing of Urban Environments: A Systematic Literature Review
author Ana Filipa Rodrigues Nogueira
author_facet Ana Filipa Rodrigues Nogueira
Hugo S. Oliveira
José J. M. Machado
João Manuel R. S. Tavares
author_role author
author2 Hugo S. Oliveira
José J. M. Machado
João Manuel R. S. Tavares
author2_role author
author
author
dc.contributor.author.fl_str_mv Ana Filipa Rodrigues Nogueira
Hugo S. Oliveira
José J. M. Machado
João Manuel R. S. Tavares
dc.subject.por.fl_str_mv Ciências Tecnológicas, Ciências da engenharia e tecnologias
Technological sciences, Engineering and technology
topic Ciências Tecnológicas, Ciências da engenharia e tecnologias
Technological sciences, Engineering and technology
description Audio recognition can be used in smart cities for security, surveillance, manufacturing, autonomous vehicles, and noise mitigation, just to name a few. However, urban sounds are everyday audio events that occur daily, presenting unstructured characteristics containing different genres of noise and sounds unrelated to the sound event under study, making it a challenging problem. Therefore, the main objective of this literature review is to summarize the most recent works on this subject to understand the current approaches and identify their limitations. Based on the reviewed articles, it can be realized that Deep Learning (DL) architectures, attention mechanisms, data augmentation techniques, and pretraining are the most crucial factors to consider while creating an efficient sound classification model. The best-found results were obtained by Mushtaq and Su, in 2020, using a DenseNet-161 with pretrained weights from ImageNet, and NA-1 and NA-2 as augmentation techniques, which were of 97.98%, 98.52%, and 99.22% for UrbanSound8K, ESC-50, and ESC-10 datasets, respectively. Nonetheless, the use of these models in real-world scenarios has not been properly addressed, so their effectiveness is still questionable in such situations.
publishDate 2022
dc.date.none.fl_str_mv 2022-11
2022-11-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/other
format other
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/145992
url https://hdl.handle.net/10216/145992
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 1424-3210
10.3390/s22228608
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
image/png
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833600209075044352