Identification and explanation of disinformation in wiki data streams

Arriba-Pérez, Francisco de; García-Méndez, Silvia; Leal, Fátima; Malheiro, Benedita; Burguillo, Juan C.

Identification and explanation of disinformation in wiki data streams

Bibliographic Details
Main Author:	Arriba-Pérez, Francisco de
Publication Date:	2025
Other Authors:	García-Méndez, Silvia, Leal, Fátima, Malheiro, Benedita, Burguillo, Juan C.
Format:	Article
Language:	eng
Source:	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
Download full:	https://hdl.handle.net/11328/6149
Summary:	Social media platforms, increasingly used as news sources for varied data analytics, have transformed how information is generated and disseminated. However, the unverified nature of this content raises concerns about trustworthiness and accuracy, potentially negatively impacting readers’ critical judgment due to disinformation. This work aims to contribute to the automatic data quality validation field, addressing the rapid growth of online content on wiki pages. Our scalable solution includes stream-based data processing with feature engineering, feature analysis and selection, stream-based classification, and real-time explanation of prediction outcomes. The explainability dashboard is designed for the general public, who may need more specialized knowledge to interpret the model’s prediction. Experimental results on two datasets attain approximately 90% values across all evaluation metrics, demonstrating robust and competitive performance compared to works in the literature. In summary, the system assists editors by reducing their effort and time in detecting disinformation.

Item metadata

id	RCAP_bbf92e21853818ee535c38cb9e305d73
oai_identifier_str	oai:repositorio.upt.pt:11328/6149
network_acronym_str	RCAP
network_name_str	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository_id_str	https://opendoar.ac.uk/repository/7160
spelling	Identification and explanation of disinformation in wiki data streamsWiki data streamsCiências Naturais - Ciências da Computação e da InformaçãoSocial media platforms, increasingly used as news sources for varied data analytics, have transformed how information is generated and disseminated. However, the unverified nature of this content raises concerns about trustworthiness and accuracy, potentially negatively impacting readers’ critical judgment due to disinformation. This work aims to contribute to the automatic data quality validation field, addressing the rapid growth of online content on wiki pages. Our scalable solution includes stream-based data processing with feature engineering, feature analysis and selection, stream-based classification, and real-time explanation of prediction outcomes. The explainability dashboard is designed for the general public, who may need more specialized knowledge to interpret the model’s prediction. Experimental results on two datasets attain approximately 90% values across all evaluation metrics, demonstrating robust and competitive performance compared to works in the literature. In summary, the system assists editors by reducing their effort and time in detecting disinformation.Sage2025-02-19T11:41:48Z2025-02-192025-02-02T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfArriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149https://hdl.handle.net/11328/6149Arriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149https://hdl.handle.net/11328/6149eng1069-25091875-8835https://doi.org/10.1177/10692509241306http://creativecommons.org/licenses/by/4.0/info:eu-repo/semantics/openAccessArriba-Pérez, Francisco deGarcía-Méndez, SilviaLeal, FátimaMalheiro, BeneditaBurguillo, Juan C.reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-04-24T02:06:32Zoai:repositorio.upt.pt:11328/6149Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T20:38:31.840945Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse
dc.title.none.fl_str_mv	Identification and explanation of disinformation in wiki data streams
title	Identification and explanation of disinformation in wiki data streams
spellingShingle	Identification and explanation of disinformation in wiki data streams Arriba-Pérez, Francisco de Wiki data streams Ciências Naturais - Ciências da Computação e da Informação
title_short	Identification and explanation of disinformation in wiki data streams
title_full	Identification and explanation of disinformation in wiki data streams
title_fullStr	Identification and explanation of disinformation in wiki data streams
title_full_unstemmed	Identification and explanation of disinformation in wiki data streams
title_sort	Identification and explanation of disinformation in wiki data streams
author	Arriba-Pérez, Francisco de
author_facet	Arriba-Pérez, Francisco de García-Méndez, Silvia Leal, Fátima Malheiro, Benedita Burguillo, Juan C.
author_role	author
author2	García-Méndez, Silvia Leal, Fátima Malheiro, Benedita Burguillo, Juan C.
author2_role	author author author author
dc.contributor.author.fl_str_mv	Arriba-Pérez, Francisco de García-Méndez, Silvia Leal, Fátima Malheiro, Benedita Burguillo, Juan C.
dc.subject.por.fl_str_mv	Wiki data streams Ciências Naturais - Ciências da Computação e da Informação
topic	Wiki data streams Ciências Naturais - Ciências da Computação e da Informação
description	Social media platforms, increasingly used as news sources for varied data analytics, have transformed how information is generated and disseminated. However, the unverified nature of this content raises concerns about trustworthiness and accuracy, potentially negatively impacting readers’ critical judgment due to disinformation. This work aims to contribute to the automatic data quality validation field, addressing the rapid growth of online content on wiki pages. Our scalable solution includes stream-based data processing with feature engineering, feature analysis and selection, stream-based classification, and real-time explanation of prediction outcomes. The explainability dashboard is designed for the general public, who may need more specialized knowledge to interpret the model’s prediction. Experimental results on two datasets attain approximately 90% values across all evaluation metrics, demonstrating robust and competitive performance compared to works in the literature. In summary, the system assists editors by reducing their effort and time in detecting disinformation.
publishDate	2025
dc.date.none.fl_str_mv	2025-02-19T11:41:48Z 2025-02-19 2025-02-02T00:00:00Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	Arriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149 https://hdl.handle.net/11328/6149 Arriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149 https://hdl.handle.net/11328/6149
identifier_str_mv	Arriba-Pérez, F., García-Méndez, S., Leal, F., Malheiro, B., & Burguillo, J. C. (2025). Identification and explanation of disinformation in wiki data streams. Integrated Computer-Aided Engineering, (published online: 02 February 2025), 1-17. https://doi.org/10.1177/1069250924130658. Repositório Institucional UPT. https://hdl.handle.net/11328/6149
url	https://hdl.handle.net/11328/6149
dc.language.iso.fl_str_mv	eng
language	eng
dc.relation.none.fl_str_mv	1069-2509 1875-8835 https://doi.org/10.1177/10692509241306
dc.rights.driver.fl_str_mv	http://creativecommons.org/licenses/by/4.0/ info:eu-repo/semantics/openAccess
rights_invalid_str_mv	http://creativecommons.org/licenses/by/4.0/
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Sage
publisher.none.fl_str_mv	Sage
dc.source.none.fl_str_mv	reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP
instname_str	FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
collection	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)
repository.name.fl_str_mv	Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia
repository.mail.fl_str_mv	info@rcaap.pt
_version_	1833598765197426688

Identification and explanation of disinformation in wiki data streams

Similar Items