A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition
Main Author: | |
---|---|
Publication Date: | 2024 |
Other Authors: | , , , |
Format: | Article |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | https://hdl.handle.net/10316/114560 https://doi.org/10.3390/s24072201 |
Summary: | Classical machine learning techniques have dominated Music Emotion Recognition. However, improvements have slowed down due to the complex and time-consuming task of handcrafting new emotionally relevant audio features. Deep learning methods have recently gained popularity in the field because of their ability to automatically learn relevant features from spectral representations of songs, eliminating such necessity. Nonetheless, there are limitations, such as the need for large amounts of quality labeled data, a common problem in MER research. To understand the effectiveness of these techniques, a comparison study using various classical machine learning and deep learning methods was conducted. The results showed that using an ensemble of a Dense Neural Network and a Convolutional Neural Network architecture resulted in a state-of-the-art 80.20% F1 score, an improvement of around 5% considering the best baseline results, concluding that future research should take advantage of both paradigms, that is, combining handcrafted features with feature learning. |
id |
RCAP_bec050fb1d8f4cf18733d9423e60b9a5 |
---|---|
oai_identifier_str |
oai:estudogeral.uc.pt:10316/114560 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
A Comparison Study of Deep Learning Methodologies for Music Emotion Recognitionmusic information retrievalmusic emotion recognitiondeep learningClassical machine learning techniques have dominated Music Emotion Recognition. However, improvements have slowed down due to the complex and time-consuming task of handcrafting new emotionally relevant audio features. Deep learning methods have recently gained popularity in the field because of their ability to automatically learn relevant features from spectral representations of songs, eliminating such necessity. Nonetheless, there are limitations, such as the need for large amounts of quality labeled data, a common problem in MER research. To understand the effectiveness of these techniques, a comparison study using various classical machine learning and deep learning methods was conducted. The results showed that using an ensemble of a Dense Neural Network and a Convolutional Neural Network architecture resulted in a state-of-the-art 80.20% F1 score, an improvement of around 5% considering the best baseline results, concluding that future research should take advantage of both paradigms, that is, combining handcrafted features with feature learning.This work is funded by FCT—Foundation for Science and Technology, I.P., within the scope of the projects: MERGE—PTDC/CCI-COM/3171/2021 financed with national funds (PIDDAC) via the Portuguese State Budget; and project CISUC—UID/CEC/00326/2020 with funds from the European Social Fund, through the Regional Operational Program Centro 2020. Renato Panda was supported by Ci2—FCT UIDP/05567/2020.MDPI2024info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttps://hdl.handle.net/10316/114560https://hdl.handle.net/10316/114560https://doi.org/10.3390/s24072201eng1424-8220https://www.mdpi.com/1424-8220/24/7/2201Louro, Pedro LimaRedinho, HugoMalheiro, Ricardo Manuel da SilvaPaiva, Rui PedroPanda, Renatoinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-04-01T10:32:24Zoai:estudogeral.uc.pt:10316/114560Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T06:07:43.303834Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition |
title |
A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition |
spellingShingle |
A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition Louro, Pedro Lima music information retrieval music emotion recognition deep learning |
title_short |
A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition |
title_full |
A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition |
title_fullStr |
A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition |
title_full_unstemmed |
A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition |
title_sort |
A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition |
author |
Louro, Pedro Lima |
author_facet |
Louro, Pedro Lima Redinho, Hugo Malheiro, Ricardo Manuel da Silva Paiva, Rui Pedro Panda, Renato |
author_role |
author |
author2 |
Redinho, Hugo Malheiro, Ricardo Manuel da Silva Paiva, Rui Pedro Panda, Renato |
author2_role |
author author author author |
dc.contributor.author.fl_str_mv |
Louro, Pedro Lima Redinho, Hugo Malheiro, Ricardo Manuel da Silva Paiva, Rui Pedro Panda, Renato |
dc.subject.por.fl_str_mv |
music information retrieval music emotion recognition deep learning |
topic |
music information retrieval music emotion recognition deep learning |
description |
Classical machine learning techniques have dominated Music Emotion Recognition. However, improvements have slowed down due to the complex and time-consuming task of handcrafting new emotionally relevant audio features. Deep learning methods have recently gained popularity in the field because of their ability to automatically learn relevant features from spectral representations of songs, eliminating such necessity. Nonetheless, there are limitations, such as the need for large amounts of quality labeled data, a common problem in MER research. To understand the effectiveness of these techniques, a comparison study using various classical machine learning and deep learning methods was conducted. The results showed that using an ensemble of a Dense Neural Network and a Convolutional Neural Network architecture resulted in a state-of-the-art 80.20% F1 score, an improvement of around 5% considering the best baseline results, concluding that future research should take advantage of both paradigms, that is, combining handcrafted features with feature learning. |
publishDate |
2024 |
dc.date.none.fl_str_mv |
2024 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10316/114560 https://hdl.handle.net/10316/114560 https://doi.org/10.3390/s24072201 |
url |
https://hdl.handle.net/10316/114560 https://doi.org/10.3390/s24072201 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
1424-8220 https://www.mdpi.com/1424-8220/24/7/2201 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
MDPI |
publisher.none.fl_str_mv |
MDPI |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833602585631653888 |