A global Approach to the Comparison of Clustering Results

Bibliographic Details
Main Author: Silva, Osvaldo
Publication Date: 2012
Other Authors: Bacelar-Nicolau, Helena, Nicolau, Fernando C.
Format: Article
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/10400.3/2706
Summary: The discovery of knowledge in the case of Hierarchical Cluster Analysis (HCA) depends on many factors, such as the clustering algorithms applied and the strategies developed in the initialstage of Cluster Analysis. We present a global approach for evaluating the quality of clustering results and making a comparison among different clustering algorithms using the relevant information available (e.g. the stability, isolation and homogeneity of the clusters). In addition, we present a visual method to facilitate evaluation of the quality of the partitions, allowing identification of the similarities and differences between partitions, as well as the behaviour of the elements in the partitions. We illustrate our approach using a complex and heterogeneous dataset (real horse data) taken from the literature. We apply HCA based on the generalized affinity coefficient (similarity coefficient) to the case of complex data (symbolic data), combined with 26 (classic and probabilistic) clustering algorithms. Finally, we discuss the obtained results and the contribution of this approach to gaining better knowledge of the structure of data.
id RCAP_80567b31655ff20ed0882eb32ee7ce5f
oai_identifier_str oai:repositorio.uac.pt:10400.3/2706
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling A global Approach to the Comparison of Clustering ResultsCluster AnalysisVL MethodologyAffinity CoefficientComparing PartitionsCluster StabilityCluster ValidationThe discovery of knowledge in the case of Hierarchical Cluster Analysis (HCA) depends on many factors, such as the clustering algorithms applied and the strategies developed in the initialstage of Cluster Analysis. We present a global approach for evaluating the quality of clustering results and making a comparison among different clustering algorithms using the relevant information available (e.g. the stability, isolation and homogeneity of the clusters). In addition, we present a visual method to facilitate evaluation of the quality of the partitions, allowing identification of the similarities and differences between partitions, as well as the behaviour of the elements in the partitions. We illustrate our approach using a complex and heterogeneous dataset (real horse data) taken from the literature. We apply HCA based on the generalized affinity coefficient (similarity coefficient) to the case of complex data (symbolic data), combined with 26 (classic and probabilistic) clustering algorithms. Finally, we discuss the obtained results and the contribution of this approach to gaining better knowledge of the structure of data.Walter de GruyterRepositório da Universidade dos AçoresSilva, OsvaldoBacelar-Nicolau, HelenaNicolau, Fernando C.2014-02-05T15:53:47Z20122014-01-29T12:42:09Z2012-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10400.3/2706eng1896-3811 (Print)info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-07T10:05:15Zoai:repositorio.uac.pt:10400.3/2706Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T00:35:48.463328Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv A global Approach to the Comparison of Clustering Results
title A global Approach to the Comparison of Clustering Results
spellingShingle A global Approach to the Comparison of Clustering Results
Silva, Osvaldo
Cluster Analysis
VL Methodology
Affinity Coefficient
Comparing Partitions
Cluster Stability
Cluster Validation
title_short A global Approach to the Comparison of Clustering Results
title_full A global Approach to the Comparison of Clustering Results
title_fullStr A global Approach to the Comparison of Clustering Results
title_full_unstemmed A global Approach to the Comparison of Clustering Results
title_sort A global Approach to the Comparison of Clustering Results
author Silva, Osvaldo
author_facet Silva, Osvaldo
Bacelar-Nicolau, Helena
Nicolau, Fernando C.
author_role author
author2 Bacelar-Nicolau, Helena
Nicolau, Fernando C.
author2_role author
author
dc.contributor.none.fl_str_mv Repositório da Universidade dos Açores
dc.contributor.author.fl_str_mv Silva, Osvaldo
Bacelar-Nicolau, Helena
Nicolau, Fernando C.
dc.subject.por.fl_str_mv Cluster Analysis
VL Methodology
Affinity Coefficient
Comparing Partitions
Cluster Stability
Cluster Validation
topic Cluster Analysis
VL Methodology
Affinity Coefficient
Comparing Partitions
Cluster Stability
Cluster Validation
description The discovery of knowledge in the case of Hierarchical Cluster Analysis (HCA) depends on many factors, such as the clustering algorithms applied and the strategies developed in the initialstage of Cluster Analysis. We present a global approach for evaluating the quality of clustering results and making a comparison among different clustering algorithms using the relevant information available (e.g. the stability, isolation and homogeneity of the clusters). In addition, we present a visual method to facilitate evaluation of the quality of the partitions, allowing identification of the similarities and differences between partitions, as well as the behaviour of the elements in the partitions. We illustrate our approach using a complex and heterogeneous dataset (real horse data) taken from the literature. We apply HCA based on the generalized affinity coefficient (similarity coefficient) to the case of complex data (symbolic data), combined with 26 (classic and probabilistic) clustering algorithms. Finally, we discuss the obtained results and the contribution of this approach to gaining better knowledge of the structure of data.
publishDate 2012
dc.date.none.fl_str_mv 2012
2012-01-01T00:00:00Z
2014-02-05T15:53:47Z
2014-01-29T12:42:09Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.3/2706
url http://hdl.handle.net/10400.3/2706
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 1896-3811 (Print)
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Walter de Gruyter
publisher.none.fl_str_mv Walter de Gruyter
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833600609221083136