Probabilistic approach for comparing partitions

Bibliographic Details
Main Author: Silva, Osvaldo
Publication Date: 2014
Other Authors: Bacelar-Nicolau, Helena, Nicolau, Fernando C., Sousa, Áurea
Format: Article
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: http://hdl.handle.net/10400.3/3429
Summary: The comparison of two partitions in Cluster Analysis can be performed using various classical coefficients (or indexes) in the context of three approaches (based, respectively, on the count of pairs, on the pairing of the classes and on the variation of information). However, different indexes usually highlight different peculiarities of the partitions to compare. Moreover, these coefficients may have different variation ranges or they do not vary in the predicted interval, but rather only in one of their subintervals. Furthermore, there is a great diversity of validation techniques capable of assisting in the choice of the best partitioning of the elements to be classified, but in general each one tends to favour a certain kind of algorithm. Thus, it is useful to find ways to compare the results obtained using different approaches. In order to assist this assessment, a probabilistic approach to comparing partitions is presented and exemplified. This approach, based on the VL (Validity Linkage) Similarity, has the advantage, among others, of standardizing the measurement scales in a unique probabilistic scale. In this work, the partitions obtained from the agglomerative hierarchical cluster analysis of a dataset in the field of teaching are evaluated using classical and probabilistic (of VL type) indexes, and the obtained results are compared.
id RCAP_9a9d8cd236089636e47170aaf5baa24c
oai_identifier_str oai:repositorio.uac.pt:10400.3/3429
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Probabilistic approach for comparing partitionsHierarchical Cluster AnalysisComparing PartitionsAffinity CoefficientVL MethodologyThe comparison of two partitions in Cluster Analysis can be performed using various classical coefficients (or indexes) in the context of three approaches (based, respectively, on the count of pairs, on the pairing of the classes and on the variation of information). However, different indexes usually highlight different peculiarities of the partitions to compare. Moreover, these coefficients may have different variation ranges or they do not vary in the predicted interval, but rather only in one of their subintervals. Furthermore, there is a great diversity of validation techniques capable of assisting in the choice of the best partitioning of the elements to be classified, but in general each one tends to favour a certain kind of algorithm. Thus, it is useful to find ways to compare the results obtained using different approaches. In order to assist this assessment, a probabilistic approach to comparing partitions is presented and exemplified. This approach, based on the VL (Validity Linkage) Similarity, has the advantage, among others, of standardizing the measurement scales in a unique probabilistic scale. In this work, the partitions obtained from the agglomerative hierarchical cluster analysis of a dataset in the field of teaching are evaluated using classical and probabilistic (of VL type) indexes, and the obtained results are compared.ISAST - International Society for the Advancement of Science and TechnologyRepositório da Universidade dos AçoresSilva, OsvaldoBacelar-Nicolau, HelenaNicolau, Fernando C.Sousa, Áurea2015-04-28T16:00:18Z20142014-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfapplication/pdfhttp://hdl.handle.net/10400.3/3429eng978-618-81257-5-9 (Book)978-618-81257-6-6 (e-Book)info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-07T10:08:57Zoai:repositorio.uac.pt:10400.3/3429Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T00:41:18.701455Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Probabilistic approach for comparing partitions
title Probabilistic approach for comparing partitions
spellingShingle Probabilistic approach for comparing partitions
Silva, Osvaldo
Hierarchical Cluster Analysis
Comparing Partitions
Affinity Coefficient
VL Methodology
title_short Probabilistic approach for comparing partitions
title_full Probabilistic approach for comparing partitions
title_fullStr Probabilistic approach for comparing partitions
title_full_unstemmed Probabilistic approach for comparing partitions
title_sort Probabilistic approach for comparing partitions
author Silva, Osvaldo
author_facet Silva, Osvaldo
Bacelar-Nicolau, Helena
Nicolau, Fernando C.
Sousa, Áurea
author_role author
author2 Bacelar-Nicolau, Helena
Nicolau, Fernando C.
Sousa, Áurea
author2_role author
author
author
dc.contributor.none.fl_str_mv Repositório da Universidade dos Açores
dc.contributor.author.fl_str_mv Silva, Osvaldo
Bacelar-Nicolau, Helena
Nicolau, Fernando C.
Sousa, Áurea
dc.subject.por.fl_str_mv Hierarchical Cluster Analysis
Comparing Partitions
Affinity Coefficient
VL Methodology
topic Hierarchical Cluster Analysis
Comparing Partitions
Affinity Coefficient
VL Methodology
description The comparison of two partitions in Cluster Analysis can be performed using various classical coefficients (or indexes) in the context of three approaches (based, respectively, on the count of pairs, on the pairing of the classes and on the variation of information). However, different indexes usually highlight different peculiarities of the partitions to compare. Moreover, these coefficients may have different variation ranges or they do not vary in the predicted interval, but rather only in one of their subintervals. Furthermore, there is a great diversity of validation techniques capable of assisting in the choice of the best partitioning of the elements to be classified, but in general each one tends to favour a certain kind of algorithm. Thus, it is useful to find ways to compare the results obtained using different approaches. In order to assist this assessment, a probabilistic approach to comparing partitions is presented and exemplified. This approach, based on the VL (Validity Linkage) Similarity, has the advantage, among others, of standardizing the measurement scales in a unique probabilistic scale. In this work, the partitions obtained from the agglomerative hierarchical cluster analysis of a dataset in the field of teaching are evaluated using classical and probabilistic (of VL type) indexes, and the obtained results are compared.
publishDate 2014
dc.date.none.fl_str_mv 2014
2014-01-01T00:00:00Z
2015-04-28T16:00:18Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.3/3429
url http://hdl.handle.net/10400.3/3429
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 978-618-81257-5-9 (Book)
978-618-81257-6-6 (e-Book)
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
application/pdf
dc.publisher.none.fl_str_mv ISAST - International Society for the Advancement of Science and Technology
publisher.none.fl_str_mv ISAST - International Society for the Advancement of Science and Technology
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833600649701359616