A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition

Bibliographic Details
Main Author: Louro, Pedro Lima
Publication Date: 2024
Other Authors: Redinho, Hugo, Malheiro, Ricardo Manuel da Silva, Paiva, Rui Pedro, Panda, Renato
Format: Article
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: https://hdl.handle.net/10316/114560
https://doi.org/10.3390/s24072201
Summary: Classical machine learning techniques have dominated Music Emotion Recognition. However, improvements have slowed down due to the complex and time-consuming task of handcrafting new emotionally relevant audio features. Deep learning methods have recently gained popularity in the field because of their ability to automatically learn relevant features from spectral representations of songs, eliminating such necessity. Nonetheless, there are limitations, such as the need for large amounts of quality labeled data, a common problem in MER research. To understand the effectiveness of these techniques, a comparison study using various classical machine learning and deep learning methods was conducted. The results showed that using an ensemble of a Dense Neural Network and a Convolutional Neural Network architecture resulted in a state-of-the-art 80.20% F1 score, an improvement of around 5% considering the best baseline results, concluding that future research should take advantage of both paradigms, that is, combining handcrafted features with feature learning.
id RCAP_bec050fb1d8f4cf18733d9423e60b9a5
oai_identifier_str oai:estudogeral.uc.pt:10316/114560
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling A Comparison Study of Deep Learning Methodologies for Music Emotion Recognitionmusic information retrievalmusic emotion recognitiondeep learningClassical machine learning techniques have dominated Music Emotion Recognition. However, improvements have slowed down due to the complex and time-consuming task of handcrafting new emotionally relevant audio features. Deep learning methods have recently gained popularity in the field because of their ability to automatically learn relevant features from spectral representations of songs, eliminating such necessity. Nonetheless, there are limitations, such as the need for large amounts of quality labeled data, a common problem in MER research. To understand the effectiveness of these techniques, a comparison study using various classical machine learning and deep learning methods was conducted. The results showed that using an ensemble of a Dense Neural Network and a Convolutional Neural Network architecture resulted in a state-of-the-art 80.20% F1 score, an improvement of around 5% considering the best baseline results, concluding that future research should take advantage of both paradigms, that is, combining handcrafted features with feature learning.This work is funded by FCT—Foundation for Science and Technology, I.P., within the scope of the projects: MERGE—PTDC/CCI-COM/3171/2021 financed with national funds (PIDDAC) via the Portuguese State Budget; and project CISUC—UID/CEC/00326/2020 with funds from the European Social Fund, through the Regional Operational Program Centro 2020. Renato Panda was supported by Ci2—FCT UIDP/05567/2020.MDPI2024info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttps://hdl.handle.net/10316/114560https://hdl.handle.net/10316/114560https://doi.org/10.3390/s24072201eng1424-8220https://www.mdpi.com/1424-8220/24/7/2201Louro, Pedro LimaRedinho, HugoMalheiro, Ricardo Manuel da SilvaPaiva, Rui PedroPanda, Renatoinfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2024-04-01T10:32:24Zoai:estudogeral.uc.pt:10316/114560Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T06:07:43.303834Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition
title A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition
spellingShingle A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition
Louro, Pedro Lima
music information retrieval
music emotion recognition
deep learning
title_short A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition
title_full A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition
title_fullStr A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition
title_full_unstemmed A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition
title_sort A Comparison Study of Deep Learning Methodologies for Music Emotion Recognition
author Louro, Pedro Lima
author_facet Louro, Pedro Lima
Redinho, Hugo
Malheiro, Ricardo Manuel da Silva
Paiva, Rui Pedro
Panda, Renato
author_role author
author2 Redinho, Hugo
Malheiro, Ricardo Manuel da Silva
Paiva, Rui Pedro
Panda, Renato
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Louro, Pedro Lima
Redinho, Hugo
Malheiro, Ricardo Manuel da Silva
Paiva, Rui Pedro
Panda, Renato
dc.subject.por.fl_str_mv music information retrieval
music emotion recognition
deep learning
topic music information retrieval
music emotion recognition
deep learning
description Classical machine learning techniques have dominated Music Emotion Recognition. However, improvements have slowed down due to the complex and time-consuming task of handcrafting new emotionally relevant audio features. Deep learning methods have recently gained popularity in the field because of their ability to automatically learn relevant features from spectral representations of songs, eliminating such necessity. Nonetheless, there are limitations, such as the need for large amounts of quality labeled data, a common problem in MER research. To understand the effectiveness of these techniques, a comparison study using various classical machine learning and deep learning methods was conducted. The results showed that using an ensemble of a Dense Neural Network and a Convolutional Neural Network architecture resulted in a state-of-the-art 80.20% F1 score, an improvement of around 5% considering the best baseline results, concluding that future research should take advantage of both paradigms, that is, combining handcrafted features with feature learning.
publishDate 2024
dc.date.none.fl_str_mv 2024
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10316/114560
https://hdl.handle.net/10316/114560
https://doi.org/10.3390/s24072201
url https://hdl.handle.net/10316/114560
https://doi.org/10.3390/s24072201
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 1424-8220
https://www.mdpi.com/1424-8220/24/7/2201
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv MDPI
publisher.none.fl_str_mv MDPI
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833602585631653888