Automated Fake News detection using computational Forensic Linguistics
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
Texto Completo: | https://hdl.handle.net/10216/135505 |
Resumo: | In our society, everyone has access to the internet and can post anything about any topic at any time. Despite its many advantages, this possibility brought along a serious problem: Fake News. Fake News is news that is not real for not following journalism principles. Instead, Fake News try to mimic the look and feel of real news with the intent to disinform the reader. However, what makes Fake News a real problem is the influence that it can have on our society. Lay people are attracted to this kind of news and often give more attention to them than truthful accounts. Despite the development of systems to detect Fake News, most are based on fact-checking methods, which are unfit when the news's truth is distorted, exaggerated, or even placed out of context. We aim to detect Portuguese Fake News using machine learning techniques with a Forensic Linguistic approach. Contrary to previous approaches, our approach builds upon linguistic and stylistic analysis methods that have been tried and tested in Forensic Linguistic analysis. After collecting the corpus from multiple sources, we formulated the task as a text classification problem and demonstrated the proposed classifier's capability for detecting Fake News. The results reported are promising, achieving high accuracies on the test data. |
id |
RCAP_33f3f98e1c9ae3aaf68bf2205a41f0ac |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/135505 |
network_acronym_str |
RCAP |
network_name_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository_id_str |
https://opendoar.ac.uk/repository/7160 |
spelling |
Automated Fake News detection using computational Forensic LinguisticsEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringIn our society, everyone has access to the internet and can post anything about any topic at any time. Despite its many advantages, this possibility brought along a serious problem: Fake News. Fake News is news that is not real for not following journalism principles. Instead, Fake News try to mimic the look and feel of real news with the intent to disinform the reader. However, what makes Fake News a real problem is the influence that it can have on our society. Lay people are attracted to this kind of news and often give more attention to them than truthful accounts. Despite the development of systems to detect Fake News, most are based on fact-checking methods, which are unfit when the news's truth is distorted, exaggerated, or even placed out of context. We aim to detect Portuguese Fake News using machine learning techniques with a Forensic Linguistic approach. Contrary to previous approaches, our approach builds upon linguistic and stylistic analysis methods that have been tried and tested in Forensic Linguistic analysis. After collecting the corpus from multiple sources, we formulated the task as a text classification problem and demonstrated the proposed classifier's capability for detecting Fake News. The results reported are promising, achieving high accuracies on the test data.2021-07-092021-07-09T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/135505TID:202825051engRicardo Ribeiro Sanfins Mourainfo:eu-repo/semantics/openAccessreponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP)instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiainstacron:RCAAP2025-02-27T18:30:25Zoai:repositorio-aberto.up.pt:10216/135505Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireinfo@rcaap.ptopendoar:https://opendoar.ac.uk/repository/71602025-05-28T22:50:51.988221Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologiafalse |
dc.title.none.fl_str_mv |
Automated Fake News detection using computational Forensic Linguistics |
title |
Automated Fake News detection using computational Forensic Linguistics |
spellingShingle |
Automated Fake News detection using computational Forensic Linguistics Ricardo Ribeiro Sanfins Moura Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Automated Fake News detection using computational Forensic Linguistics |
title_full |
Automated Fake News detection using computational Forensic Linguistics |
title_fullStr |
Automated Fake News detection using computational Forensic Linguistics |
title_full_unstemmed |
Automated Fake News detection using computational Forensic Linguistics |
title_sort |
Automated Fake News detection using computational Forensic Linguistics |
author |
Ricardo Ribeiro Sanfins Moura |
author_facet |
Ricardo Ribeiro Sanfins Moura |
author_role |
author |
dc.contributor.author.fl_str_mv |
Ricardo Ribeiro Sanfins Moura |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
In our society, everyone has access to the internet and can post anything about any topic at any time. Despite its many advantages, this possibility brought along a serious problem: Fake News. Fake News is news that is not real for not following journalism principles. Instead, Fake News try to mimic the look and feel of real news with the intent to disinform the reader. However, what makes Fake News a real problem is the influence that it can have on our society. Lay people are attracted to this kind of news and often give more attention to them than truthful accounts. Despite the development of systems to detect Fake News, most are based on fact-checking methods, which are unfit when the news's truth is distorted, exaggerated, or even placed out of context. We aim to detect Portuguese Fake News using machine learning techniques with a Forensic Linguistic approach. Contrary to previous approaches, our approach builds upon linguistic and stylistic analysis methods that have been tried and tested in Forensic Linguistic analysis. After collecting the corpus from multiple sources, we formulated the task as a text classification problem and demonstrated the proposed classifier's capability for detecting Fake News. The results reported are promising, achieving high accuracies on the test data. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021-07-09 2021-07-09T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/135505 TID:202825051 |
url |
https://hdl.handle.net/10216/135505 |
identifier_str_mv |
TID:202825051 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) instname:FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia instacron:RCAAP |
instname_str |
FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
collection |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) |
repository.name.fl_str_mv |
Repositórios Científicos de Acesso Aberto de Portugal (RCAAP) - FCCN, serviços digitais da FCT – Fundação para a Ciência e a Tecnologia |
repository.mail.fl_str_mv |
info@rcaap.pt |
_version_ |
1833599894493855744 |