A global Approach to the Comparison of Clustering Results
Main Author: | |
---|---|
Publication Date: | 2012 |
Other Authors: | , |
Format: | Article |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | http://hdl.handle.net/10400.3/2706 |
Summary: | The discovery of knowledge in the case of Hierarchical Cluster Analysis (HCA) depends on many factors, such as the clustering algorithms applied and the strategies developed in the initialstage of Cluster Analysis. We present a global approach for evaluating the quality of clustering results and making a comparison among different clustering algorithms using the relevant information available (e.g. the stability, isolation and homogeneity of the clusters). In addition, we present a visual method to facilitate evaluation of the quality of the partitions, allowing identification of the similarities and differences between partitions, as well as the behaviour of the elements in the partitions. We illustrate our approach using a complex and heterogeneous dataset (real horse data) taken from the literature. We apply HCA based on the generalized affinity coefficient (similarity coefficient) to the case of complex data (symbolic data), combined with 26 (classic and probabilistic) clustering algorithms. Finally, we discuss the obtained results and the contribution of this approach to gaining better knowledge of the structure of data. |
id |
RCAP_80567b31655ff20ed0882eb32ee7ce5f |
---|---|
oai_identifier_str |
oai:repositorio.uac.pt:10400.3/2706 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
A global Approach to the Comparison of Clustering ResultsCluster AnalysisVL MethodologyAffinity CoefficientComparing PartitionsCluster StabilityCluster ValidationThe discovery of knowledge in the case of Hierarchical Cluster Analysis (HCA) depends on many factors, such as the clustering algorithms applied and the strategies developed in the initialstage of Cluster Analysis. We present a global approach for evaluating the quality of clustering results and making a comparison among different clustering algorithms using the relevant information available (e.g. the stability, isolation and homogeneity of the clusters). In addition, we present a visual method to facilitate evaluation of the quality of the partitions, allowing identification of the similarities and differences between partitions, as well as the behaviour of the elements in the partitions. We illustrate our approach using a complex and heterogeneous dataset (real horse data) taken from the literature. We apply HCA based on the generalized affinity coefficient (similarity coefficient) to the case of complex data (symbolic data), combined with 26 (classic and probabilistic) clustering algorithms. Finally, we discuss the obtained results and the contribution of this approach to gaining better knowledge of the structure of data.Walter de GruyterRepositório da Universidade dos AçoresSilva, OsvaldoBacelar-Nicolau, HelenaNicolau, Fernando C.2014-02-05T15:53:47Z20122014-01-29T12:42:09Z2012-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.3/2706eng1896-3811 (Print)info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-07T10:05:15Zoai:repositorio.uac.pt:10400.3/2706Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T00:35:48.463328Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
A global Approach to the Comparison of Clustering Results |
title |
A global Approach to the Comparison of Clustering Results |
spellingShingle |
A global Approach to the Comparison of Clustering Results Silva, Osvaldo Cluster Analysis VL Methodology Affinity Coefficient Comparing Partitions Cluster Stability Cluster Validation |
title_short |
A global Approach to the Comparison of Clustering Results |
title_full |
A global Approach to the Comparison of Clustering Results |
title_fullStr |
A global Approach to the Comparison of Clustering Results |
title_full_unstemmed |
A global Approach to the Comparison of Clustering Results |
title_sort |
A global Approach to the Comparison of Clustering Results |
author |
Silva, Osvaldo |
author_facet |
Silva, Osvaldo Bacelar-Nicolau, Helena Nicolau, Fernando C. |
author_role |
author |
author2 |
Bacelar-Nicolau, Helena Nicolau, Fernando C. |
author2_role |
author author |
dc.contributor.none.fl_str_mv |
Repositório da Universidade dos Açores |
dc.contributor.author.fl_str_mv |
Silva, Osvaldo Bacelar-Nicolau, Helena Nicolau, Fernando C. |
dc.subject.por.fl_str_mv |
Cluster Analysis VL Methodology Affinity Coefficient Comparing Partitions Cluster Stability Cluster Validation |
topic |
Cluster Analysis VL Methodology Affinity Coefficient Comparing Partitions Cluster Stability Cluster Validation |
description |
The discovery of knowledge in the case of Hierarchical Cluster Analysis (HCA) depends on many factors, such as the clustering algorithms applied and the strategies developed in the initialstage of Cluster Analysis. We present a global approach for evaluating the quality of clustering results and making a comparison among different clustering algorithms using the relevant information available (e.g. the stability, isolation and homogeneity of the clusters). In addition, we present a visual method to facilitate evaluation of the quality of the partitions, allowing identification of the similarities and differences between partitions, as well as the behaviour of the elements in the partitions. We illustrate our approach using a complex and heterogeneous dataset (real horse data) taken from the literature. We apply HCA based on the generalized affinity coefficient (similarity coefficient) to the case of complex data (symbolic data), combined with 26 (classic and probabilistic) clustering algorithms. Finally, we discuss the obtained results and the contribution of this approach to gaining better knowledge of the structure of data. |
publishDate |
2012 |
dc.date.none.fl_str_mv |
2012 2012-01-01T00:00:00Z 2014-02-05T15:53:47Z 2014-01-29T12:42:09Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.3/2706 |
url |
http://hdl.handle.net/10400.3/2706 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
1896-3811 (Print) |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Walter de Gruyter |
publisher.none.fl_str_mv |
Walter de Gruyter |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833600609221083136 |