Combined unsupervised and semi-supervised learning for data classification

Bibliographic Details
Main Author: Breve, Fabricio Aparecido [UNESP]
Publication Date: 2016
Other Authors: Pedronette, Daniel Carlos Guimaraes [UNESP]
Format: Conference object
Language: eng
Source: Repositório Institucional da UNESP
Download full: http://dx.doi.org/10.1109/MLSP.2016.7738877
http://hdl.handle.net/11449/173880
Summary: Semi-supervised learning methods exploit both labeled and unlabeled data items in their training process, requiring only a small subset of labeled items. Although capable of drastically reducing the costs of labeling process, such methods are directly dependent on the effectiveness of distance measures used for building the kNN graph. On the other hand, unsupervised distance learning approaches aims at capturing and exploiting the dataset structure in order to compute a more effective distance measure, without the need of any labeled data. In this paper, we propose a combined approach which employs both unsupervised and semi-supervised learning paradigms. An unsupervised distance learning procedure is performed as a pre-processing step for improving the kNN graph effectiveness. Based on the more effective graph, a semi-supervised learning method is used for classification. The proposed Combined Unsupervised and Semi-Supervised Learning (CUSSL) approach is based on very recent methods. The Reciprocal kNN Distance is used for unsupervised distance learning tasks and the semi-supervised learning classification is performed by Particle Competition and Cooperation (PCC). Experimental results conducted in six public datasets demonstrated that the combined approach can achieve effective results, boosting the accuracy of classification tasks.
id UNSP_d3675005b75af96d0c15c9e6473d808a
oai_identifier_str oai:repositorio.unesp.br:11449/173880
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling Combined unsupervised and semi-supervised learning for data classificationData ClassificationSemi-Supervised LearningUnsupervised LearningSemi-supervised learning methods exploit both labeled and unlabeled data items in their training process, requiring only a small subset of labeled items. Although capable of drastically reducing the costs of labeling process, such methods are directly dependent on the effectiveness of distance measures used for building the kNN graph. On the other hand, unsupervised distance learning approaches aims at capturing and exploiting the dataset structure in order to compute a more effective distance measure, without the need of any labeled data. In this paper, we propose a combined approach which employs both unsupervised and semi-supervised learning paradigms. An unsupervised distance learning procedure is performed as a pre-processing step for improving the kNN graph effectiveness. Based on the more effective graph, a semi-supervised learning method is used for classification. The proposed Combined Unsupervised and Semi-Supervised Learning (CUSSL) approach is based on very recent methods. The Reciprocal kNN Distance is used for unsupervised distance learning tasks and the semi-supervised learning classification is performed by Particle Competition and Cooperation (PCC). Experimental results conducted in six public datasets demonstrated that the combined approach can achieve effective results, boosting the accuracy of classification tasks.Department of Statistics Applied Mathematics and Computing (DEMAC) State University of São Paulo (UNESP)Department of Statistics Applied Mathematics and Computing (DEMAC) State University of São Paulo (UNESP)Universidade Estadual Paulista (Unesp)Breve, Fabricio Aparecido [UNESP]Pedronette, Daniel Carlos Guimaraes [UNESP]2018-12-11T17:08:10Z2018-12-11T17:08:10Z2016-11-08info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObjecthttp://dx.doi.org/10.1109/MLSP.2016.7738877IEEE International Workshop on Machine Learning for Signal Processing, MLSP, v. 2016-November.2161-03712161-0363http://hdl.handle.net/11449/17388010.1109/MLSP.2016.77388772-s2.0-8500215699456938600255383270000-0002-1123-9784Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengIEEE International Workshop on Machine Learning for Signal Processing, MLSP0,217info:eu-repo/semantics/openAccess2024-11-27T14:10:33Zoai:repositorio.unesp.br:11449/173880Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestrepositoriounesp@unesp.bropendoar:29462024-11-27T14:10:33Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv Combined unsupervised and semi-supervised learning for data classification
title Combined unsupervised and semi-supervised learning for data classification
spellingShingle Combined unsupervised and semi-supervised learning for data classification
Breve, Fabricio Aparecido [UNESP]
Data Classification
Semi-Supervised Learning
Unsupervised Learning
title_short Combined unsupervised and semi-supervised learning for data classification
title_full Combined unsupervised and semi-supervised learning for data classification
title_fullStr Combined unsupervised and semi-supervised learning for data classification
title_full_unstemmed Combined unsupervised and semi-supervised learning for data classification
title_sort Combined unsupervised and semi-supervised learning for data classification
author Breve, Fabricio Aparecido [UNESP]
author_facet Breve, Fabricio Aparecido [UNESP]
Pedronette, Daniel Carlos Guimaraes [UNESP]
author_role author
author2 Pedronette, Daniel Carlos Guimaraes [UNESP]
author2_role author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (Unesp)
dc.contributor.author.fl_str_mv Breve, Fabricio Aparecido [UNESP]
Pedronette, Daniel Carlos Guimaraes [UNESP]
dc.subject.por.fl_str_mv Data Classification
Semi-Supervised Learning
Unsupervised Learning
topic Data Classification
Semi-Supervised Learning
Unsupervised Learning
description Semi-supervised learning methods exploit both labeled and unlabeled data items in their training process, requiring only a small subset of labeled items. Although capable of drastically reducing the costs of labeling process, such methods are directly dependent on the effectiveness of distance measures used for building the kNN graph. On the other hand, unsupervised distance learning approaches aims at capturing and exploiting the dataset structure in order to compute a more effective distance measure, without the need of any labeled data. In this paper, we propose a combined approach which employs both unsupervised and semi-supervised learning paradigms. An unsupervised distance learning procedure is performed as a pre-processing step for improving the kNN graph effectiveness. Based on the more effective graph, a semi-supervised learning method is used for classification. The proposed Combined Unsupervised and Semi-Supervised Learning (CUSSL) approach is based on very recent methods. The Reciprocal kNN Distance is used for unsupervised distance learning tasks and the semi-supervised learning classification is performed by Particle Competition and Cooperation (PCC). Experimental results conducted in six public datasets demonstrated that the combined approach can achieve effective results, boosting the accuracy of classification tasks.
publishDate 2016
dc.date.none.fl_str_mv 2016-11-08
2018-12-11T17:08:10Z
2018-12-11T17:08:10Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1109/MLSP.2016.7738877
IEEE International Workshop on Machine Learning for Signal Processing, MLSP, v. 2016-November.
2161-0371
2161-0363
http://hdl.handle.net/11449/173880
10.1109/MLSP.2016.7738877
2-s2.0-85002156994
5693860025538327
0000-0002-1123-9784
url http://dx.doi.org/10.1109/MLSP.2016.7738877
http://hdl.handle.net/11449/173880
identifier_str_mv IEEE International Workshop on Machine Learning for Signal Processing, MLSP, v. 2016-November.
2161-0371
2161-0363
10.1109/MLSP.2016.7738877
2-s2.0-85002156994
5693860025538327
0000-0002-1123-9784
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv IEEE International Workshop on Machine Learning for Signal Processing, MLSP
0,217
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv repositoriounesp@unesp.br
_version_ 1834483825101504512