In-car violence detection based on the audio signal
Main Author: | |
---|---|
Publication Date: | 2021 |
Other Authors: | , , , , , |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | https://hdl.handle.net/1822/89901 |
Summary: | When it is intended to detect violence in the car, audio, speech processing, music, and ambient sound are some of the main points of this problem since it is necessary to find the similarities and differences between these domains. The recent increase in interest in deep learning has allowed practical applications in many areas of signal processing, often surpassing traditional signal processing on a large scale. This paper presents a comparative study of state-of-the-art deep learning architectures applied for inside car violence detection based only on the audio signal. The methodology proposed for audio signal representation was Mel-spectrogram, after an in-depth review of the literature. We build an In-Car video dataset in the experiments and apply four different deep learning architectures to solve the classification problem. The results have shown that the ResNet-18 model presents the best accuracy results on the test set. |
id |
RCAP_cf9727a90e8c67be14be3acb4818eb00 |
---|---|
oai_identifier_str |
oai:repositorium.sdum.uminho.pt:1822/89901 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
In-car violence detection based on the audio signalAudio action recognitionAudio violence detectionDeep learningInterior vehicleEngenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e InformáticaWhen it is intended to detect violence in the car, audio, speech processing, music, and ambient sound are some of the main points of this problem since it is necessary to find the similarities and differences between these domains. The recent increase in interest in deep learning has allowed practical applications in many areas of signal processing, often surpassing traditional signal processing on a large scale. This paper presents a comparative study of state-of-the-art deep learning architectures applied for inside car violence detection based only on the audio signal. The methodology proposed for audio signal representation was Mel-spectrogram, after an in-depth review of the literature. We build an In-Car video dataset in the experiments and apply four different deep learning architectures to solve the classification problem. The results have shown that the ResNet-18 model presents the best accuracy results on the test set.This work is supported by: European Structural and Investment Funds in the FEDER component, through the Operational Competitiveness and Internationalization Programme (COMPETE 2020) [Project nº 039334; Funding Reference: POCI-01-0247-FEDER- 039334].Springer, ChamUniversidade do MinhoSantos, FlávioDurães, DalilaMarcondes, Francisco S.Hammerschmidt, NiklasLange, SaschaMachado, José ManuelNovais, Paulo20212021-01-01T00:00:00Zconference paperinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://hdl.handle.net/1822/89901engSantos, F. et al. (2021). In-Car Violence Detection Based on the Audio Signal. In: Yin, H., et al. Intelligent Data Engineering and Automated Learning – IDEAL 2021. IDEAL 2021. Lecture Notes in Computer Science(), vol 13113. Springer, Cham. https://doi.org/10.1007/978-3-030-91608-4_43978-3-030-91607-70302-974310.1007/978-3-030-91608-4_43978-3-030-91608-4https://link.springer.com/chapter/10.1007/978-3-030-91608-4_43info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-05-11T06:58:17Zoai:repositorium.sdum.uminho.pt:1822/89901Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T16:10:49.167252Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
In-car violence detection based on the audio signal |
title |
In-car violence detection based on the audio signal |
spellingShingle |
In-car violence detection based on the audio signal Santos, Flávio Audio action recognition Audio violence detection Deep learning Interior vehicle Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática |
title_short |
In-car violence detection based on the audio signal |
title_full |
In-car violence detection based on the audio signal |
title_fullStr |
In-car violence detection based on the audio signal |
title_full_unstemmed |
In-car violence detection based on the audio signal |
title_sort |
In-car violence detection based on the audio signal |
author |
Santos, Flávio |
author_facet |
Santos, Flávio Durães, Dalila Marcondes, Francisco S. Hammerschmidt, Niklas Lange, Sascha Machado, José Manuel Novais, Paulo |
author_role |
author |
author2 |
Durães, Dalila Marcondes, Francisco S. Hammerschmidt, Niklas Lange, Sascha Machado, José Manuel Novais, Paulo |
author2_role |
author author author author author author |
dc.contributor.none.fl_str_mv |
Universidade do Minho |
dc.contributor.author.fl_str_mv |
Santos, Flávio Durães, Dalila Marcondes, Francisco S. Hammerschmidt, Niklas Lange, Sascha Machado, José Manuel Novais, Paulo |
dc.subject.por.fl_str_mv |
Audio action recognition Audio violence detection Deep learning Interior vehicle Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática |
topic |
Audio action recognition Audio violence detection Deep learning Interior vehicle Engenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática |
description |
When it is intended to detect violence in the car, audio, speech processing, music, and ambient sound are some of the main points of this problem since it is necessary to find the similarities and differences between these domains. The recent increase in interest in deep learning has allowed practical applications in many areas of signal processing, often surpassing traditional signal processing on a large scale. This paper presents a comparative study of state-of-the-art deep learning architectures applied for inside car violence detection based only on the audio signal. The methodology proposed for audio signal representation was Mel-spectrogram, after an in-depth review of the literature. We build an In-Car video dataset in the experiments and apply four different deep learning architectures to solve the classification problem. The results have shown that the ResNet-18 model presents the best accuracy results on the test set. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021 2021-01-01T00:00:00Z |
dc.type.driver.fl_str_mv |
conference paper |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/1822/89901 |
url |
https://hdl.handle.net/1822/89901 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Santos, F. et al. (2021). In-Car Violence Detection Based on the Audio Signal. In: Yin, H., et al. Intelligent Data Engineering and Automated Learning – IDEAL 2021. IDEAL 2021. Lecture Notes in Computer Science(), vol 13113. Springer, Cham. https://doi.org/10.1007/978-3-030-91608-4_43 978-3-030-91607-7 0302-9743 10.1007/978-3-030-91608-4_43 978-3-030-91608-4 https://link.springer.com/chapter/10.1007/978-3-030-91608-4_43 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Springer, Cham |
publisher.none.fl_str_mv |
Springer, Cham |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833595785345761280 |