Framework for the Discovery of Newsworthy Events in Social Media

Detalhes bibliográficos
Autor(a) principal: Duarte, Fernando José Fradique
Data de Publicação: 2019
Outros Autores: Pereira, Óscar Mortágua, Aguiar, Rui L.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10773/26037
Resumo: The new communication paradigm established by social media along with its growing popularity in recent years contributed to attract an increasing interest of several research fields. One such research field is the field of event detection in social media. The contribution of this article is to implement a system to detect newsworthy events in Twitter. The proposed pipeline first splits the tweets into segments. These segments are then ranked. The top k segments in this ranking are then grouped together. Finally, the resulting candidate events are filtered in order to retain only those related to realworld newsworthy events. The implemented system was tested with three months of data, representing a total of 4,770,636 tweets written in Portuguese. In terms of performance, the proposed approach achieved an overall precision of 88% and a recall of 38%.
id RCAP_48457ae73bb4f9de52ac08175c7a38f2
oai_identifier_str oai:ria.ua.pt:10773/26037
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Framework for the Discovery of Newsworthy Events in Social MediaDirected Acyclic GraphDynamic ProgrammingEvent DetectionJarvis-Patrick ClusteringKNN NeighborsLearningMachine XGBoostNaïve BayesRandom ForestSVMThe new communication paradigm established by social media along with its growing popularity in recent years contributed to attract an increasing interest of several research fields. One such research field is the field of event detection in social media. The contribution of this article is to implement a system to detect newsworthy events in Twitter. The proposed pipeline first splits the tweets into segments. These segments are then ranked. The top k segments in this ranking are then grouped together. Finally, the resulting candidate events are filtered in order to retain only those related to realworld newsworthy events. The implemented system was tested with three months of data, representing a total of 4,770,636 tweets written in Portuguese. In terms of performance, the proposed approach achieved an overall precision of 88% and a recall of 38%.IGI Global2019-05-13T14:00:42Z2019-07-01T00:00:00Z2019-07-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/10773/26037eng1947-934410.4018/IJOCI.2019070103Duarte, Fernando José FradiquePereira, Óscar MortáguaAguiar, Rui L.info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-05-06T04:20:24Zoai:ria.ua.pt:10773/26037Portal AgregadorONGhttps://www.rcaap.pt/oai/openairemluisa.alvim@gmail.comopendoar:71602024-05-06T04:20:24Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Framework for the Discovery of Newsworthy Events in Social Media
title Framework for the Discovery of Newsworthy Events in Social Media
spellingShingle Framework for the Discovery of Newsworthy Events in Social Media
Duarte, Fernando José Fradique
Directed Acyclic Graph
Dynamic Programming
Event Detection
Jarvis-Patrick Clustering
KNN Neighbors
Learning
Machine XGBoost
Naïve Bayes
Random Forest
SVM
title_short Framework for the Discovery of Newsworthy Events in Social Media
title_full Framework for the Discovery of Newsworthy Events in Social Media
title_fullStr Framework for the Discovery of Newsworthy Events in Social Media
title_full_unstemmed Framework for the Discovery of Newsworthy Events in Social Media
title_sort Framework for the Discovery of Newsworthy Events in Social Media
author Duarte, Fernando José Fradique
author_facet Duarte, Fernando José Fradique
Pereira, Óscar Mortágua
Aguiar, Rui L.
author_role author
author2 Pereira, Óscar Mortágua
Aguiar, Rui L.
author2_role author
author
dc.contributor.author.fl_str_mv Duarte, Fernando José Fradique
Pereira, Óscar Mortágua
Aguiar, Rui L.
dc.subject.por.fl_str_mv Directed Acyclic Graph
Dynamic Programming
Event Detection
Jarvis-Patrick Clustering
KNN Neighbors
Learning
Machine XGBoost
Naïve Bayes
Random Forest
SVM
topic Directed Acyclic Graph
Dynamic Programming
Event Detection
Jarvis-Patrick Clustering
KNN Neighbors
Learning
Machine XGBoost
Naïve Bayes
Random Forest
SVM
description The new communication paradigm established by social media along with its growing popularity in recent years contributed to attract an increasing interest of several research fields. One such research field is the field of event detection in social media. The contribution of this article is to implement a system to detect newsworthy events in Twitter. The proposed pipeline first splits the tweets into segments. These segments are then ranked. The top k segments in this ranking are then grouped together. Finally, the resulting candidate events are filtered in order to retain only those related to realworld newsworthy events. The implemented system was tested with three months of data, representing a total of 4,770,636 tweets written in Portuguese. In terms of performance, the proposed approach achieved an overall precision of 88% and a recall of 38%.
publishDate 2019
dc.date.none.fl_str_mv 2019-05-13T14:00:42Z
2019-07-01T00:00:00Z
2019-07-01
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10773/26037
url http://hdl.handle.net/10773/26037
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 1947-9344
10.4018/IJOCI.2019070103
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv IGI Global
publisher.none.fl_str_mv IGI Global
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv mluisa.alvim@gmail.com
_version_ 1817543709209657344