Automated Fake News detection using computational Forensic Linguistics
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/135505 |
Resumo: | In our society, everyone has access to the internet and can post anything about any topic at any time. Despite its many advantages, this possibility brought along a serious problem: Fake News. Fake News is news that is not real for not following journalism principles. Instead, Fake News try to mimic the look and feel of real news with the intent to disinform the reader. However, what makes Fake News a real problem is the influence that it can have on our society. Lay people are attracted to this kind of news and often give more attention to them than truthful accounts. Despite the development of systems to detect Fake News, most are based on fact-checking methods, which are unfit when the news's truth is distorted, exaggerated, or even placed out of context. We aim to detect Portuguese Fake News using machine learning techniques with a Forensic Linguistic approach. Contrary to previous approaches, our approach builds upon linguistic and stylistic analysis methods that have been tried and tested in Forensic Linguistic analysis. After collecting the corpus from multiple sources, we formulated the task as a text classification problem and demonstrated the proposed classifier's capability for detecting Fake News. The results reported are promising, achieving high accuracies on the test data. |
id |
RCAP_33f3f98e1c9ae3aaf68bf2205a41f0ac |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/135505 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Automated Fake News detection using computational Forensic LinguisticsEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringIn our society, everyone has access to the internet and can post anything about any topic at any time. Despite its many advantages, this possibility brought along a serious problem: Fake News. Fake News is news that is not real for not following journalism principles. Instead, Fake News try to mimic the look and feel of real news with the intent to disinform the reader. However, what makes Fake News a real problem is the influence that it can have on our society. Lay people are attracted to this kind of news and often give more attention to them than truthful accounts. Despite the development of systems to detect Fake News, most are based on fact-checking methods, which are unfit when the news's truth is distorted, exaggerated, or even placed out of context. We aim to detect Portuguese Fake News using machine learning techniques with a Forensic Linguistic approach. Contrary to previous approaches, our approach builds upon linguistic and stylistic analysis methods that have been tried and tested in Forensic Linguistic analysis. After collecting the corpus from multiple sources, we formulated the task as a text classification problem and demonstrated the proposed classifier's capability for detecting Fake News. The results reported are promising, achieving high accuracies on the test data.2021-07-092021-07-09T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/135505TID:202825051engRicardo Ribeiro Sanfins Mourainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T14:12:58Zoai:repositorio-aberto.up.pt:10216/135505Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T23:57:10.979951Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Automated Fake News detection using computational Forensic Linguistics |
title |
Automated Fake News detection using computational Forensic Linguistics |
spellingShingle |
Automated Fake News detection using computational Forensic Linguistics Ricardo Ribeiro Sanfins Moura Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Automated Fake News detection using computational Forensic Linguistics |
title_full |
Automated Fake News detection using computational Forensic Linguistics |
title_fullStr |
Automated Fake News detection using computational Forensic Linguistics |
title_full_unstemmed |
Automated Fake News detection using computational Forensic Linguistics |
title_sort |
Automated Fake News detection using computational Forensic Linguistics |
author |
Ricardo Ribeiro Sanfins Moura |
author_facet |
Ricardo Ribeiro Sanfins Moura |
author_role |
author |
dc.contributor.author.fl_str_mv |
Ricardo Ribeiro Sanfins Moura |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
In our society, everyone has access to the internet and can post anything about any topic at any time. Despite its many advantages, this possibility brought along a serious problem: Fake News. Fake News is news that is not real for not following journalism principles. Instead, Fake News try to mimic the look and feel of real news with the intent to disinform the reader. However, what makes Fake News a real problem is the influence that it can have on our society. Lay people are attracted to this kind of news and often give more attention to them than truthful accounts. Despite the development of systems to detect Fake News, most are based on fact-checking methods, which are unfit when the news's truth is distorted, exaggerated, or even placed out of context. We aim to detect Portuguese Fake News using machine learning techniques with a Forensic Linguistic approach. Contrary to previous approaches, our approach builds upon linguistic and stylistic analysis methods that have been tried and tested in Forensic Linguistic analysis. After collecting the corpus from multiple sources, we formulated the task as a text classification problem and demonstrated the proposed classifier's capability for detecting Fake News. The results reported are promising, achieving high accuracies on the test data. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021-07-09 2021-07-09T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/135505 TID:202825051 |
url |
https://hdl.handle.net/10216/135505 |
identifier_str_mv |
TID:202825051 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799135892830420992 |