Sound Classification and Processing of Urban Environments: A Systematic Literature Review
| Main Author: | |
|---|---|
| Publication Date: | 2022 |
| Other Authors: | , , |
| Format: | Other |
| Language: | eng |
| Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| Download full: | https://hdl.handle.net/10216/145992 |
Summary: | Audio recognition can be used in smart cities for security, surveillance, manufacturing, autonomous vehicles, and noise mitigation, just to name a few. However, urban sounds are everyday audio events that occur daily, presenting unstructured characteristics containing different genres of noise and sounds unrelated to the sound event under study, making it a challenging problem. Therefore, the main objective of this literature review is to summarize the most recent works on this subject to understand the current approaches and identify their limitations. Based on the reviewed articles, it can be realized that Deep Learning (DL) architectures, attention mechanisms, data augmentation techniques, and pretraining are the most crucial factors to consider while creating an efficient sound classification model. The best-found results were obtained by Mushtaq and Su, in 2020, using a DenseNet-161 with pretrained weights from ImageNet, and NA-1 and NA-2 as augmentation techniques, which were of 97.98%, 98.52%, and 99.22% for UrbanSound8K, ESC-50, and ESC-10 datasets, respectively. Nonetheless, the use of these models in real-world scenarios has not been properly addressed, so their effectiveness is still questionable in such situations. |
| id |
RCAP_6590ec136d6ae806460e903aa940858c |
|---|---|
| oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/145992 |
| network_acronym_str |
RCAP |
| network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| repository_id_str |
https://opendoar.ac.uk/repository/7160 |
| spelling |
Sound Classification and Processing of Urban Environments: A Systematic Literature ReviewCiências Tecnológicas, Ciências da engenharia e tecnologiasTechnological sciences, Engineering and technologyAudio recognition can be used in smart cities for security, surveillance, manufacturing, autonomous vehicles, and noise mitigation, just to name a few. However, urban sounds are everyday audio events that occur daily, presenting unstructured characteristics containing different genres of noise and sounds unrelated to the sound event under study, making it a challenging problem. Therefore, the main objective of this literature review is to summarize the most recent works on this subject to understand the current approaches and identify their limitations. Based on the reviewed articles, it can be realized that Deep Learning (DL) architectures, attention mechanisms, data augmentation techniques, and pretraining are the most crucial factors to consider while creating an efficient sound classification model. The best-found results were obtained by Mushtaq and Su, in 2020, using a DenseNet-161 with pretrained weights from ImageNet, and NA-1 and NA-2 as augmentation techniques, which were of 97.98%, 98.52%, and 99.22% for UrbanSound8K, ESC-50, and ESC-10 datasets, respectively. Nonetheless, the use of these models in real-world scenarios has not been properly addressed, so their effectiveness is still questionable in such situations.2022-112022-11-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/otherapplication/pdfimage/pnghttps://hdl.handle.net/10216/145992eng1424-321010.3390/s22228608Ana Filipa Rodrigues NogueiraHugo S. OliveiraJosé J. M. MachadoJoão Manuel R. S. Tavaresinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-02-27T19:50:32Zoai:repositorio-aberto.up.pt:10216/145992Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T23:34:20.128477Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
| dc.title.none.fl_str_mv |
Sound Classification and Processing of Urban Environments: A Systematic Literature Review |
| title |
Sound Classification and Processing of Urban Environments: A Systematic Literature Review |
| spellingShingle |
Sound Classification and Processing of Urban Environments: A Systematic Literature Review Ana Filipa Rodrigues Nogueira Ciências Tecnológicas, Ciências da engenharia e tecnologias Technological sciences, Engineering and technology |
| title_short |
Sound Classification and Processing of Urban Environments: A Systematic Literature Review |
| title_full |
Sound Classification and Processing of Urban Environments: A Systematic Literature Review |
| title_fullStr |
Sound Classification and Processing of Urban Environments: A Systematic Literature Review |
| title_full_unstemmed |
Sound Classification and Processing of Urban Environments: A Systematic Literature Review |
| title_sort |
Sound Classification and Processing of Urban Environments: A Systematic Literature Review |
| author |
Ana Filipa Rodrigues Nogueira |
| author_facet |
Ana Filipa Rodrigues Nogueira Hugo S. Oliveira José J. M. Machado João Manuel R. S. Tavares |
| author_role |
author |
| author2 |
Hugo S. Oliveira José J. M. Machado João Manuel R. S. Tavares |
| author2_role |
author author author |
| dc.contributor.author.fl_str_mv |
Ana Filipa Rodrigues Nogueira Hugo S. Oliveira José J. M. Machado João Manuel R. S. Tavares |
| dc.subject.por.fl_str_mv |
Ciências Tecnológicas, Ciências da engenharia e tecnologias Technological sciences, Engineering and technology |
| topic |
Ciências Tecnológicas, Ciências da engenharia e tecnologias Technological sciences, Engineering and technology |
| description |
Audio recognition can be used in smart cities for security, surveillance, manufacturing, autonomous vehicles, and noise mitigation, just to name a few. However, urban sounds are everyday audio events that occur daily, presenting unstructured characteristics containing different genres of noise and sounds unrelated to the sound event under study, making it a challenging problem. Therefore, the main objective of this literature review is to summarize the most recent works on this subject to understand the current approaches and identify their limitations. Based on the reviewed articles, it can be realized that Deep Learning (DL) architectures, attention mechanisms, data augmentation techniques, and pretraining are the most crucial factors to consider while creating an efficient sound classification model. The best-found results were obtained by Mushtaq and Su, in 2020, using a DenseNet-161 with pretrained weights from ImageNet, and NA-1 and NA-2 as augmentation techniques, which were of 97.98%, 98.52%, and 99.22% for UrbanSound8K, ESC-50, and ESC-10 datasets, respectively. Nonetheless, the use of these models in real-world scenarios has not been properly addressed, so their effectiveness is still questionable in such situations. |
| publishDate |
2022 |
| dc.date.none.fl_str_mv |
2022-11 2022-11-01T00:00:00Z |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/other |
| format |
other |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/145992 |
| url |
https://hdl.handle.net/10216/145992 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
1424-3210 10.3390/s22228608 |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.format.none.fl_str_mv |
application/pdf image/png |
| dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
| instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
| instacron_str |
RCAAP |
| institution |
RCAAP |
| reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
| repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
| repository.mail.fl_str_mv |
info@rcaap.pt |
| _version_ |
1833600209075044352 |