Probabilistic approach for comparing partitions
Main Author: | |
---|---|
Publication Date: | 2014 |
Other Authors: | , , |
Format: | Article |
Language: | eng |
Source: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Download full: | http://hdl.handle.net/10400.3/3429 |
Summary: | The comparison of two partitions in Cluster Analysis can be performed using various classical coefficients (or indexes) in the context of three approaches (based, respectively, on the count of pairs, on the pairing of the classes and on the variation of information). However, different indexes usually highlight different peculiarities of the partitions to compare. Moreover, these coefficients may have different variation ranges or they do not vary in the predicted interval, but rather only in one of their subintervals. Furthermore, there is a great diversity of validation techniques capable of assisting in the choice of the best partitioning of the elements to be classified, but in general each one tends to favour a certain kind of algorithm. Thus, it is useful to find ways to compare the results obtained using different approaches. In order to assist this assessment, a probabilistic approach to comparing partitions is presented and exemplified. This approach, based on the VL (Validity Linkage) Similarity, has the advantage, among others, of standardizing the measurement scales in a unique probabilistic scale. In this work, the partitions obtained from the agglomerative hierarchical cluster analysis of a dataset in the field of teaching are evaluated using classical and probabilistic (of VL type) indexes, and the obtained results are compared. |
id |
RCAP_9a9d8cd236089636e47170aaf5baa24c |
---|---|
oai_identifier_str |
oai:repositorio.uac.pt:10400.3/3429 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Probabilistic approach for comparing partitionsHierarchical Cluster AnalysisComparing PartitionsAffinity CoefficientVL MethodologyThe comparison of two partitions in Cluster Analysis can be performed using various classical coefficients (or indexes) in the context of three approaches (based, respectively, on the count of pairs, on the pairing of the classes and on the variation of information). However, different indexes usually highlight different peculiarities of the partitions to compare. Moreover, these coefficients may have different variation ranges or they do not vary in the predicted interval, but rather only in one of their subintervals. Furthermore, there is a great diversity of validation techniques capable of assisting in the choice of the best partitioning of the elements to be classified, but in general each one tends to favour a certain kind of algorithm. Thus, it is useful to find ways to compare the results obtained using different approaches. In order to assist this assessment, a probabilistic approach to comparing partitions is presented and exemplified. This approach, based on the VL (Validity Linkage) Similarity, has the advantage, among others, of standardizing the measurement scales in a unique probabilistic scale. In this work, the partitions obtained from the agglomerative hierarchical cluster analysis of a dataset in the field of teaching are evaluated using classical and probabilistic (of VL type) indexes, and the obtained results are compared.ISAST - International Society for the Advancement of Science and TechnologyRepositório da Universidade dos AçoresSilva, OsvaldoBacelar-Nicolau, HelenaNicolau, Fernando C.Sousa, Áurea2015-04-28T16:00:18Z20142014-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfapplication/pdfhttp://hdl.handle.net/10400.3/3429eng978-618-81257-5-9 (Book)978-618-81257-6-6 (e-Book)info:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-03-07T10:08:57Zoai:repositorio.uac.pt:10400.3/3429Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-29T00:41:18.701455Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Probabilistic approach for comparing partitions |
title |
Probabilistic approach for comparing partitions |
spellingShingle |
Probabilistic approach for comparing partitions Silva, Osvaldo Hierarchical Cluster Analysis Comparing Partitions Affinity Coefficient VL Methodology |
title_short |
Probabilistic approach for comparing partitions |
title_full |
Probabilistic approach for comparing partitions |
title_fullStr |
Probabilistic approach for comparing partitions |
title_full_unstemmed |
Probabilistic approach for comparing partitions |
title_sort |
Probabilistic approach for comparing partitions |
author |
Silva, Osvaldo |
author_facet |
Silva, Osvaldo Bacelar-Nicolau, Helena Nicolau, Fernando C. Sousa, Áurea |
author_role |
author |
author2 |
Bacelar-Nicolau, Helena Nicolau, Fernando C. Sousa, Áurea |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
Repositório da Universidade dos Açores |
dc.contributor.author.fl_str_mv |
Silva, Osvaldo Bacelar-Nicolau, Helena Nicolau, Fernando C. Sousa, Áurea |
dc.subject.por.fl_str_mv |
Hierarchical Cluster Analysis Comparing Partitions Affinity Coefficient VL Methodology |
topic |
Hierarchical Cluster Analysis Comparing Partitions Affinity Coefficient VL Methodology |
description |
The comparison of two partitions in Cluster Analysis can be performed using various classical coefficients (or indexes) in the context of three approaches (based, respectively, on the count of pairs, on the pairing of the classes and on the variation of information). However, different indexes usually highlight different peculiarities of the partitions to compare. Moreover, these coefficients may have different variation ranges or they do not vary in the predicted interval, but rather only in one of their subintervals. Furthermore, there is a great diversity of validation techniques capable of assisting in the choice of the best partitioning of the elements to be classified, but in general each one tends to favour a certain kind of algorithm. Thus, it is useful to find ways to compare the results obtained using different approaches. In order to assist this assessment, a probabilistic approach to comparing partitions is presented and exemplified. This approach, based on the VL (Validity Linkage) Similarity, has the advantage, among others, of standardizing the measurement scales in a unique probabilistic scale. In this work, the partitions obtained from the agglomerative hierarchical cluster analysis of a dataset in the field of teaching are evaluated using classical and probabilistic (of VL type) indexes, and the obtained results are compared. |
publishDate |
2014 |
dc.date.none.fl_str_mv |
2014 2014-01-01T00:00:00Z 2015-04-28T16:00:18Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.3/3429 |
url |
http://hdl.handle.net/10400.3/3429 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
978-618-81257-5-9 (Book) 978-618-81257-6-6 (e-Book) |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf application/pdf |
dc.publisher.none.fl_str_mv |
ISAST - International Society for the Advancement of Science and Technology |
publisher.none.fl_str_mv |
ISAST - International Society for the Advancement of Science and Technology |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833600649701359616 |