Framework for the Discovery of Newsworthy Events in Social Media
Autor(a) principal: | |
---|---|
Data de Publicação: | 2019 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10773/26037 |
Resumo: | The new communication paradigm established by social media along with its growing popularity in recent years contributed to attract an increasing interest of several research fields. One such research field is the field of event detection in social media. The contribution of this article is to implement a system to detect newsworthy events in Twitter. The proposed pipeline first splits the tweets into segments. These segments are then ranked. The top k segments in this ranking are then grouped together. Finally, the resulting candidate events are filtered in order to retain only those related to realworld newsworthy events. The implemented system was tested with three months of data, representing a total of 4,770,636 tweets written in Portuguese. In terms of performance, the proposed approach achieved an overall precision of 88% and a recall of 38%. |
id |
RCAP_48457ae73bb4f9de52ac08175c7a38f2 |
---|---|
oai_identifier_str |
oai:ria.ua.pt:10773/26037 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Framework for the Discovery of Newsworthy Events in Social MediaDirected Acyclic GraphDynamic ProgrammingEvent DetectionJarvis-Patrick ClusteringKNN NeighborsLearningMachine XGBoostNaïve BayesRandom ForestSVMThe new communication paradigm established by social media along with its growing popularity in recent years contributed to attract an increasing interest of several research fields. One such research field is the field of event detection in social media. The contribution of this article is to implement a system to detect newsworthy events in Twitter. The proposed pipeline first splits the tweets into segments. These segments are then ranked. The top k segments in this ranking are then grouped together. Finally, the resulting candidate events are filtered in order to retain only those related to realworld newsworthy events. The implemented system was tested with three months of data, representing a total of 4,770,636 tweets written in Portuguese. In terms of performance, the proposed approach achieved an overall precision of 88% and a recall of 38%.IGI Global2019-05-13T14:00:42Z2019-07-01T00:00:00Z2019-07-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10773/26037eng1947-934410.4018/IJOCI.2019070103Duarte, Fernando José FradiquePereira, Óscar MortáguaAguiar, Rui L.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-05-06T04:20:24Zoai:ria.ua.pt:10773/26037Portal AgregadorONGhttps://www.rcaap.pt/oai/openairemluisa.alvim@gmail.comopendoar:71602024-05-06T04:20:24Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Framework for the Discovery of Newsworthy Events in Social Media |
title |
Framework for the Discovery of Newsworthy Events in Social Media |
spellingShingle |
Framework for the Discovery of Newsworthy Events in Social Media Duarte, Fernando José Fradique Directed Acyclic Graph Dynamic Programming Event Detection Jarvis-Patrick Clustering KNN Neighbors Learning Machine XGBoost Naïve Bayes Random Forest SVM |
title_short |
Framework for the Discovery of Newsworthy Events in Social Media |
title_full |
Framework for the Discovery of Newsworthy Events in Social Media |
title_fullStr |
Framework for the Discovery of Newsworthy Events in Social Media |
title_full_unstemmed |
Framework for the Discovery of Newsworthy Events in Social Media |
title_sort |
Framework for the Discovery of Newsworthy Events in Social Media |
author |
Duarte, Fernando José Fradique |
author_facet |
Duarte, Fernando José Fradique Pereira, Óscar Mortágua Aguiar, Rui L. |
author_role |
author |
author2 |
Pereira, Óscar Mortágua Aguiar, Rui L. |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Duarte, Fernando José Fradique Pereira, Óscar Mortágua Aguiar, Rui L. |
dc.subject.por.fl_str_mv |
Directed Acyclic Graph Dynamic Programming Event Detection Jarvis-Patrick Clustering KNN Neighbors Learning Machine XGBoost Naïve Bayes Random Forest SVM |
topic |
Directed Acyclic Graph Dynamic Programming Event Detection Jarvis-Patrick Clustering KNN Neighbors Learning Machine XGBoost Naïve Bayes Random Forest SVM |
description |
The new communication paradigm established by social media along with its growing popularity in recent years contributed to attract an increasing interest of several research fields. One such research field is the field of event detection in social media. The contribution of this article is to implement a system to detect newsworthy events in Twitter. The proposed pipeline first splits the tweets into segments. These segments are then ranked. The top k segments in this ranking are then grouped together. Finally, the resulting candidate events are filtered in order to retain only those related to realworld newsworthy events. The implemented system was tested with three months of data, representing a total of 4,770,636 tweets written in Portuguese. In terms of performance, the proposed approach achieved an overall precision of 88% and a recall of 38%. |
publishDate |
2019 |
dc.date.none.fl_str_mv |
2019-05-13T14:00:42Z 2019-07-01T00:00:00Z 2019-07-01 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10773/26037 |
url |
http://hdl.handle.net/10773/26037 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
1947-9344 10.4018/IJOCI.2019070103 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
IGI Global |
publisher.none.fl_str_mv |
IGI Global |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
mluisa.alvim@gmail.com |
_version_ |
1817543709209657344 |