Identification and explanation of disinformation in wiki data streams

Bibliographic Details
Main Author: Arriba-Pérez, Francisco de
Publication Date: 2025
Other Authors: García-Méndez, Silvia, Leal, Fátima, Malheiro, Benedita, Burguillo, Juan C.
Format: Article
Language: eng
Source: Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full: https://hdl.handle.net/11328/6149
Summary: Social media platforms, increasingly used as news sources for varied data analytics, have transformed how information is generated and disseminated. However, the unverified nature of this content raises concerns about trustworthiness and accuracy, potentially negatively impacting readers’ critical judgment due to disinformation. This work aims to contribute to the automatic data quality validation field, addressing the rapid growth of online content on wiki pages. Our scalable solution includes stream-based data processing with feature engineering, feature analysis and selection, stream-based classification, and real-time explanation of prediction outcomes. The explainability dashboard is designed for the general public, who may need more specialized knowledge to interpret the model’s prediction. Experimental results on two datasets attain approximately 90% values across all evaluation metrics, demonstrating robust and competitive performance compared to works in the literature. In summary, the system assists editors by reducing their effort and time in detecting disinformation.
id RCAP_bbf92e21853818ee535c38cb9e305d73
oai_identifier_str oai:repositorio.upt.pt:11328/6149
network_acronym_str RCAP
network_name_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str https://opendoar.ac.uk/repository/7160
spelling Identification and explanation of disinformation in wiki data streamsWiki data streamsCiências Naturais - Ciências da Computação e da InformaçãoSocial media platforms, increasingly used as news sources for varied data analytics, have transformed how information is generated and disseminated. However, the unverified nature of this content raises concerns about trustworthiness and accuracy, potentially negatively impacting readers’ critical judgment due to disinformation. This work aims to contribute to the automatic data quality validation field, addressing the rapid growth of online content on wiki pages. Our scalable solution includes stream-based data processing with feature engineering, feature analysis and selection, stream-based classification, and real-time explanation of prediction outcomes. The explainability dashboard is designed for the general public, who may need more specialized knowledge to interpret the model’s prediction. Experimental results on two datasets attain approximately 90% values across all evaluation metrics, demonstrating robust and competitive performance compared to works in the literature. In summary, the system assists editors by reducing their effort and time in detecting disinformation.Sage2025-02-19T11:41:48Z2025-02-192025-02-02T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfArriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149https://hdl.handle.net/11328/6149Arriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149https://hdl.handle.net/11328/6149eng1069-25091875-8835https://doi.org/10.1177/10692509241306http://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessArriba-Pérez, Francisco deGarcía-Méndez, SilviaLeal, FátimaMalheiro, BeneditaBurguillo, Juan C.reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-04-24T02:06:32Zoai:repositorio.upt.pt:11328/6149Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T20:38:31.840945Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv Identification and explanation of disinformation in wiki data streams
title Identification and explanation of disinformation in wiki data streams
spellingShingle Identification and explanation of disinformation in wiki data streams
Arriba-Pérez, Francisco de
Wiki data streams
Ciências Naturais - Ciências da Computação e da Informação
title_short Identification and explanation of disinformation in wiki data streams
title_full Identification and explanation of disinformation in wiki data streams
title_fullStr Identification and explanation of disinformation in wiki data streams
title_full_unstemmed Identification and explanation of disinformation in wiki data streams
title_sort Identification and explanation of disinformation in wiki data streams
author Arriba-Pérez, Francisco de
author_facet Arriba-Pérez, Francisco de
García-Méndez, Silvia
Leal, Fátima
Malheiro, Benedita
Burguillo, Juan C.
author_role author
author2 García-Méndez, Silvia
Leal, Fátima
Malheiro, Benedita
Burguillo, Juan C.
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Arriba-Pérez, Francisco de
García-Méndez, Silvia
Leal, Fátima
Malheiro, Benedita
Burguillo, Juan C.
dc.subject.por.fl_str_mv Wiki data streams
Ciências Naturais - Ciências da Computação e da Informação
topic Wiki data streams
Ciências Naturais - Ciências da Computação e da Informação
description Social media platforms, increasingly used as news sources for varied data analytics, have transformed how information is generated and disseminated. However, the unverified nature of this content raises concerns about trustworthiness and accuracy, potentially negatively impacting readers’ critical judgment due to disinformation. This work aims to contribute to the automatic data quality validation field, addressing the rapid growth of online content on wiki pages. Our scalable solution includes stream-based data processing with feature engineering, feature analysis and selection, stream-based classification, and real-time explanation of prediction outcomes. The explainability dashboard is designed for the general public, who may need more specialized knowledge to interpret the model’s prediction. Experimental results on two datasets attain approximately 90% values across all evaluation metrics, demonstrating robust and competitive performance compared to works in the literature. In summary, the system assists editors by reducing their effort and time in detecting disinformation.
publishDate 2025
dc.date.none.fl_str_mv 2025-02-19T11:41:48Z
2025-02-19
2025-02-02T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Arriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149
https://hdl.handle.net/11328/6149
Arriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149
https://hdl.handle.net/11328/6149
identifier_str_mv Arriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149
url https://hdl.handle.net/11328/6149
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 1069-2509
1875-8835
https://doi.org/10.1177/10692509241306
dc.rights.driver.fl_str_mv http://creativecommons.org/licenses/by/4.0/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv http://creativecommons.org/licenses/by/4.0/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Sage
publisher.none.fl_str_mv Sage
dc.source.none.fl_str_mv reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron:RCAAP
instname_str FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str RCAAP
institution RCAAP
reponame_str Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv info@rcaap.pt
_version_ 1833598765197426688