Combined unsupervised and semi-supervised learning for data classification
| Main Author: | |
|---|---|
| Publication Date: | 2016 |
| Other Authors: | |
| Format: | Conference object |
| Language: | eng |
| Source: | Repositório Institucional da UNESP |
| Download full: | http://dx.doi.org/10.1109/MLSP.2016.7738877 http://hdl.handle.net/11449/173880 |
Summary: | Semi-supervised learning methods exploit both labeled and unlabeled data items in their training process, requiring only a small subset of labeled items. Although capable of drastically reducing the costs of labeling process, such methods are directly dependent on the effectiveness of distance measures used for building the kNN graph. On the other hand, unsupervised distance learning approaches aims at capturing and exploiting the dataset structure in order to compute a more effective distance measure, without the need of any labeled data. In this paper, we propose a combined approach which employs both unsupervised and semi-supervised learning paradigms. An unsupervised distance learning procedure is performed as a pre-processing step for improving the kNN graph effectiveness. Based on the more effective graph, a semi-supervised learning method is used for classification. The proposed Combined Unsupervised and Semi-Supervised Learning (CUSSL) approach is based on very recent methods. The Reciprocal kNN Distance is used for unsupervised distance learning tasks and the semi-supervised learning classification is performed by Particle Competition and Cooperation (PCC). Experimental results conducted in six public datasets demonstrated that the combined approach can achieve effective results, boosting the accuracy of classification tasks. |
| id |
UNSP_d3675005b75af96d0c15c9e6473d808a |
|---|---|
| oai_identifier_str |
oai:repositorio.unesp.br:11449/173880 |
| network_acronym_str |
UNSP |
| network_name_str |
Repositório Institucional da UNESP |
| repository_id_str |
2946 |
| spelling |
Combined unsupervised and semi-supervised learning for data classificationData ClassificationSemi-Supervised LearningUnsupervised LearningSemi-supervised learning methods exploit both labeled and unlabeled data items in their training process, requiring only a small subset of labeled items. Although capable of drastically reducing the costs of labeling process, such methods are directly dependent on the effectiveness of distance measures used for building the kNN graph. On the other hand, unsupervised distance learning approaches aims at capturing and exploiting the dataset structure in order to compute a more effective distance measure, without the need of any labeled data. In this paper, we propose a combined approach which employs both unsupervised and semi-supervised learning paradigms. An unsupervised distance learning procedure is performed as a pre-processing step for improving the kNN graph effectiveness. Based on the more effective graph, a semi-supervised learning method is used for classification. The proposed Combined Unsupervised and Semi-Supervised Learning (CUSSL) approach is based on very recent methods. The Reciprocal kNN Distance is used for unsupervised distance learning tasks and the semi-supervised learning classification is performed by Particle Competition and Cooperation (PCC). Experimental results conducted in six public datasets demonstrated that the combined approach can achieve effective results, boosting the accuracy of classification tasks.Department of Statistics Applied Mathematics and Computing (DEMAC) State University of São Paulo (UNESP)Department of Statistics Applied Mathematics and Computing (DEMAC) State University of São Paulo (UNESP)Universidade Estadual Paulista (Unesp)Breve, Fabricio Aparecido [UNESP]Pedronette, Daniel Carlos Guimaraes [UNESP]2018-12-11T17:08:10Z2018-12-11T17:08:10Z2016-11-08info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObjecthttp://dx.doi.org/10.1109/MLSP.2016.7738877IEEE International Workshop on Machine Learning for Signal Processing, MLSP, v. 2016-November.2161-03712161-0363http://hdl.handle.net/11449/17388010.1109/MLSP.2016.77388772-s2.0-8500215699456938600255383270000-0002-1123-9784Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengIEEE International Workshop on Machine Learning for Signal Processing, MLSP0,217info:eu-repo/semantics/openAccess2024-11-27T14:10:33Zoai:repositorio.unesp.br:11449/173880Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestrepositoriounesp@unesp.bropendoar:29462024-11-27T14:10:33Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
| dc.title.none.fl_str_mv |
Combined unsupervised and semi-supervised learning for data classification |
| title |
Combined unsupervised and semi-supervised learning for data classification |
| spellingShingle |
Combined unsupervised and semi-supervised learning for data classification Breve, Fabricio Aparecido [UNESP] Data Classification Semi-Supervised Learning Unsupervised Learning |
| title_short |
Combined unsupervised and semi-supervised learning for data classification |
| title_full |
Combined unsupervised and semi-supervised learning for data classification |
| title_fullStr |
Combined unsupervised and semi-supervised learning for data classification |
| title_full_unstemmed |
Combined unsupervised and semi-supervised learning for data classification |
| title_sort |
Combined unsupervised and semi-supervised learning for data classification |
| author |
Breve, Fabricio Aparecido [UNESP] |
| author_facet |
Breve, Fabricio Aparecido [UNESP] Pedronette, Daniel Carlos Guimaraes [UNESP] |
| author_role |
author |
| author2 |
Pedronette, Daniel Carlos Guimaraes [UNESP] |
| author2_role |
author |
| dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (Unesp) |
| dc.contributor.author.fl_str_mv |
Breve, Fabricio Aparecido [UNESP] Pedronette, Daniel Carlos Guimaraes [UNESP] |
| dc.subject.por.fl_str_mv |
Data Classification Semi-Supervised Learning Unsupervised Learning |
| topic |
Data Classification Semi-Supervised Learning Unsupervised Learning |
| description |
Semi-supervised learning methods exploit both labeled and unlabeled data items in their training process, requiring only a small subset of labeled items. Although capable of drastically reducing the costs of labeling process, such methods are directly dependent on the effectiveness of distance measures used for building the kNN graph. On the other hand, unsupervised distance learning approaches aims at capturing and exploiting the dataset structure in order to compute a more effective distance measure, without the need of any labeled data. In this paper, we propose a combined approach which employs both unsupervised and semi-supervised learning paradigms. An unsupervised distance learning procedure is performed as a pre-processing step for improving the kNN graph effectiveness. Based on the more effective graph, a semi-supervised learning method is used for classification. The proposed Combined Unsupervised and Semi-Supervised Learning (CUSSL) approach is based on very recent methods. The Reciprocal kNN Distance is used for unsupervised distance learning tasks and the semi-supervised learning classification is performed by Particle Competition and Cooperation (PCC). Experimental results conducted in six public datasets demonstrated that the combined approach can achieve effective results, boosting the accuracy of classification tasks. |
| publishDate |
2016 |
| dc.date.none.fl_str_mv |
2016-11-08 2018-12-11T17:08:10Z 2018-12-11T17:08:10Z |
| dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
| dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
| format |
conferenceObject |
| status_str |
publishedVersion |
| dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1109/MLSP.2016.7738877 IEEE International Workshop on Machine Learning for Signal Processing, MLSP, v. 2016-November. 2161-0371 2161-0363 http://hdl.handle.net/11449/173880 10.1109/MLSP.2016.7738877 2-s2.0-85002156994 5693860025538327 0000-0002-1123-9784 |
| url |
http://dx.doi.org/10.1109/MLSP.2016.7738877 http://hdl.handle.net/11449/173880 |
| identifier_str_mv |
IEEE International Workshop on Machine Learning for Signal Processing, MLSP, v. 2016-November. 2161-0371 2161-0363 10.1109/MLSP.2016.7738877 2-s2.0-85002156994 5693860025538327 0000-0002-1123-9784 |
| dc.language.iso.fl_str_mv |
eng |
| language |
eng |
| dc.relation.none.fl_str_mv |
IEEE International Workshop on Machine Learning for Signal Processing, MLSP 0,217 |
| dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
| eu_rights_str_mv |
openAccess |
| dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
| instname_str |
Universidade Estadual Paulista (UNESP) |
| instacron_str |
UNESP |
| institution |
UNESP |
| reponame_str |
Repositório Institucional da UNESP |
| collection |
Repositório Institucional da UNESP |
| repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
| repository.mail.fl_str_mv |
repositoriounesp@unesp.br |
| _version_ |
1834483825101504512 |