Classificação automática de música utilizando aprendizagem de padrões de votação

Shinohara, Vítor Yudi

Classificação automática de música utilizando aprendizagem de padrões de votação

Bibliographic Details
Main Author:	Shinohara, Vítor Yudi
Publication Date:	2018
Format:	Bachelor thesis
Language:	por
Source:	Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
Download full:	http://repositorio.utfpr.edu.br/jspui/handle/1/6008
Summary:	Research in MIR have proposed many automatic genre classification systems using machine learning. In this context, two main approaches have been used to describe music tracks: Single Vector Representation (SVR), which uses a single vector, and Multiple Vector Representation (MVR), which uses multiple vectors. For training MVR models, the track is divided into auditory textures, which are all labeled with the ground truth and presented independently during training. When testing, each texture is classified independently, and then some voting scheme must be used to assign a final label to the entire track. When only the votes are available for each texture, the only option available to assign a final label to the track is majority voting. Other techniques which rely on class probabilities for each texture requires those probabilities to be computed by the classifier during prediction. These probabilities are costly to compute, thus are not always available. We present two novel voting schemes as alternatives to majority voting. Both methods use voting pattern learning where only the predicted class of each texture is known, not class probability distributions. The votes were combined in two different ways: a voting histogram and a vote sequence vector. The histograms were used as feature vectors for both K-Vizinhos Mais Próximos (K-NN) and Máquina de Vetores de Suporte (SVM) classifiers. The vote sequence vectors were used as inputs to sequence modelling with Hidden Markov Model (HMM) and two recurrent neural network architectures. The accuracy and accuracy standard deviation of the classification were computed for both proposed methods. The performance of both were compared to the results of the majority voting technique. Student’s T-Test was used to evaluate which gains were statistically significant. The results show that voting pattern learning is relevant and can provide statistically significant performance gains in some data sets.

Item metadata

id	UTFPR-12_ccfce5f5dccf6a9d3dcb40fa723cc0c5
oai_identifier_str	oai:repositorio.utfpr.edu.br:1/6008
network_acronym_str	UTFPR-12
network_name_str	Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
repository_id_str
spelling	Classificação automática de música utilizando aprendizagem de padrões de votaçãoAutomatic classification of music using voting pattern learningAprendizado do computadorMúsicaRedes neurais (Computação)Machine learningMusicNeural networks (Computer science)CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAOResearch in MIR have proposed many automatic genre classification systems using machine learning. In this context, two main approaches have been used to describe music tracks: Single Vector Representation (SVR), which uses a single vector, and Multiple Vector Representation (MVR), which uses multiple vectors. For training MVR models, the track is divided into auditory textures, which are all labeled with the ground truth and presented independently during training. When testing, each texture is classified independently, and then some voting scheme must be used to assign a final label to the entire track. When only the votes are available for each texture, the only option available to assign a final label to the track is majority voting. Other techniques which rely on class probabilities for each texture requires those probabilities to be computed by the classifier during prediction. These probabilities are costly to compute, thus are not always available. We present two novel voting schemes as alternatives to majority voting. Both methods use voting pattern learning where only the predicted class of each texture is known, not class probability distributions. The votes were combined in two different ways: a voting histogram and a vote sequence vector. The histograms were used as feature vectors for both K-Vizinhos Mais Próximos (K-NN) and Máquina de Vetores de Suporte (SVM) classifiers. The vote sequence vectors were used as inputs to sequence modelling with Hidden Markov Model (HMM) and two recurrent neural network architectures. The accuracy and accuracy standard deviation of the classification were computed for both proposed methods. The performance of both were compared to the results of the majority voting technique. Student’s T-Test was used to evaluate which gains were statistically significant. The results show that voting pattern learning is relevant and can provide statistically significant performance gains in some data sets.Pesquisas na área de Music Information Retrieval (MIR) tem proposto métodos de classificação automática de gêneros musicais usando aprendizagem de máquina. Neste contexto surgiram duas abordagens para representação de faixas de áudio: Single Vector Representation (SVR), compostas por apenas um vetor de características e Multiple Vector Representation (MVR), que usa múltiplos vetores na descrição. Para a classificação usando MVR, a faixa é dividida em trechos, denominados texturas, e todos as texturas são rotuladas com o gênero da faixa e apresentadas ao modelo na fase de treino. Para a classificação, cada textura é classificado independentemente, logo, deve se aplicar uma técnica de votação para atribuir um rótulo à faixa completa. As técnicas utilizadas para inferir um rótulo à faixa quando se tem apenas os votos de cada textura são limitadas à votação majoritária simples. Para o uso da votação majoritária ponderada é necessário uma distribuição de probabilidades de cada rótulo para cada textura, cujo custo computacional para estimar é usualmente elevado, e portanto nem sempre as probabilidades estão disponíveis. São propostos dois métodos de combinação de votos alternativos ao voto majoritário simples. Ambos métodos utilizam aprendizagem de padrões de votação onde apenas os votos de texturas são conhecidos, ao invés de distribuições de probabilidade. Os votos foram combinados de duas maneiras: Composição de um histograma de votos e composição de um vetor de sequência de votos. Os histogramas foram submetidos aos classificadores K-Vizinhos Mais Próximos (K-NN) e Máquina de Vetores de Suporte (SVM). Os vetores de sequência de votos foram submetidos aos classificadores Hidden Markov Model (HMM) e duas arquiteturas de redes neurais recorrentes. Foram computados a acurácia e o desvio padrão da acurácia da classificação em gêneros musicais obtidos pelos dois métodos propostos. O desempenho dos métodos propostos foram comparados com os resultados obtidos pelo voto majoritário simples. O teste estatístico T de Student foi usado para avaliar quais foram os ganhos estatisticamente significativos. Os resultados mostram que a aprendizagem de padrões de votação é relevante e pode trazer ganhos estatisticamente significativos em alguns conjuntos de dados.Universidade Tecnológica Federal do ParanáCampo MouraoBrasilDepartamento Acadêmico de ComputaçãoCiência da ComputaçãoUTFPRHübner, RodrigoFoleiss, Juliano HenriqueHübner, RodrigoAlencar, Aretha BarbosaFiorin Junior, LucianoShinohara, Vítor Yudi2020-11-09T19:10:14Z2020-11-09T19:10:14Z2018-11-21info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/bachelorThesisapplication/pdfSHINOHARA, Vítor Yudi. Classificação automática de música utilizando aprendizagem de padrões de votação. 2018. 51 f. Trabalho de Conclusão de Curso (Graduação) – Universidade Tecnológica Federal do Paraná, Campo Mourão, 2018.http://repositorio.utfpr.edu.br/jspui/handle/1/6008porinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))instname:Universidade Tecnológica Federal do Paraná (UTFPR)instacron:UTFPR2020-11-09T19:10:14Zoai:repositorio.utfpr.edu.br:1/6008Repositório InstitucionalPUBhttp://repositorio.utfpr.edu.br:8080/oai/requestriut@utfpr.edu.br \|\| sibi@utfpr.edu.bropendoar:2020-11-09T19:10:14Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) - Universidade Tecnológica Federal do Paraná (UTFPR)false
dc.title.none.fl_str_mv	Classificação automática de música utilizando aprendizagem de padrões de votação Automatic classification of music using voting pattern learning
title	Classificação automática de música utilizando aprendizagem de padrões de votação
spellingShingle	Classificação automática de música utilizando aprendizagem de padrões de votação Shinohara, Vítor Yudi Aprendizado do computador Música Redes neurais (Computação) Machine learning Music Neural networks (Computer science) CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
title_short	Classificação automática de música utilizando aprendizagem de padrões de votação
title_full	Classificação automática de música utilizando aprendizagem de padrões de votação
title_fullStr	Classificação automática de música utilizando aprendizagem de padrões de votação
title_full_unstemmed	Classificação automática de música utilizando aprendizagem de padrões de votação
title_sort	Classificação automática de música utilizando aprendizagem de padrões de votação
author	Shinohara, Vítor Yudi
author_facet	Shinohara, Vítor Yudi
author_role	author
dc.contributor.none.fl_str_mv	Hübner, Rodrigo Foleiss, Juliano Henrique Hübner, Rodrigo Alencar, Aretha Barbosa Fiorin Junior, Luciano
dc.contributor.author.fl_str_mv	Shinohara, Vítor Yudi
dc.subject.por.fl_str_mv	Aprendizado do computador Música Redes neurais (Computação) Machine learning Music Neural networks (Computer science) CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
topic	Aprendizado do computador Música Redes neurais (Computação) Machine learning Music Neural networks (Computer science) CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO
description	Research in MIR have proposed many automatic genre classification systems using machine learning. In this context, two main approaches have been used to describe music tracks: Single Vector Representation (SVR), which uses a single vector, and Multiple Vector Representation (MVR), which uses multiple vectors. For training MVR models, the track is divided into auditory textures, which are all labeled with the ground truth and presented independently during training. When testing, each texture is classified independently, and then some voting scheme must be used to assign a final label to the entire track. When only the votes are available for each texture, the only option available to assign a final label to the track is majority voting. Other techniques which rely on class probabilities for each texture requires those probabilities to be computed by the classifier during prediction. These probabilities are costly to compute, thus are not always available. We present two novel voting schemes as alternatives to majority voting. Both methods use voting pattern learning where only the predicted class of each texture is known, not class probability distributions. The votes were combined in two different ways: a voting histogram and a vote sequence vector. The histograms were used as feature vectors for both K-Vizinhos Mais Próximos (K-NN) and Máquina de Vetores de Suporte (SVM) classifiers. The vote sequence vectors were used as inputs to sequence modelling with Hidden Markov Model (HMM) and two recurrent neural network architectures. The accuracy and accuracy standard deviation of the classification were computed for both proposed methods. The performance of both were compared to the results of the majority voting technique. Student’s T-Test was used to evaluate which gains were statistically significant. The results show that voting pattern learning is relevant and can provide statistically significant performance gains in some data sets.
publishDate	2018
dc.date.none.fl_str_mv	2018-11-21 2020-11-09T19:10:14Z 2020-11-09T19:10:14Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/bachelorThesis
format	bachelorThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	SHINOHARA, Vítor Yudi. Classificação automática de música utilizando aprendizagem de padrões de votação. 2018. 51 f. Trabalho de Conclusão de Curso (Graduação) – Universidade Tecnológica Federal do Paraná, Campo Mourão, 2018. http://repositorio.utfpr.edu.br/jspui/handle/1/6008
identifier_str_mv	SHINOHARA, Vítor Yudi. Classificação automática de música utilizando aprendizagem de padrões de votação. 2018. 51 f. Trabalho de Conclusão de Curso (Graduação) – Universidade Tecnológica Federal do Paraná, Campo Mourão, 2018.
url	http://repositorio.utfpr.edu.br/jspui/handle/1/6008
dc.language.iso.fl_str_mv	por
language	por
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade Tecnológica Federal do Paraná Campo Mourao Brasil Departamento Acadêmico de Computação Ciência da Computação UTFPR
publisher.none.fl_str_mv	Universidade Tecnológica Federal do Paraná Campo Mourao Brasil Departamento Acadêmico de Computação Ciência da Computação UTFPR
dc.source.none.fl_str_mv	reponame:Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) instname:Universidade Tecnológica Federal do Paraná (UTFPR) instacron:UTFPR
instname_str	Universidade Tecnológica Federal do Paraná (UTFPR)
instacron_str	UTFPR
institution	UTFPR
reponame_str	Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
collection	Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT))
repository.name.fl_str_mv	Repositório Institucional da UTFPR (da Universidade Tecnológica Federal do Paraná (RIUT)) - Universidade Tecnológica Federal do Paraná (UTFPR)
repository.mail.fl_str_mv	riut@utfpr.edu.br \|\| sibi@utfpr.edu.br
_version_	1850498069968191488

Classificação automática de música utilizando aprendizagem de padrões de votação

Similar Items