TweeProfiles: deteção de padrões espácio-temporais no Twitter

Detalhes bibliográficos
Autor(a) principal: Tiago Daniel Sá Cunha
Data de Publicação: 2013
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://hdl.handle.net/10216/68545
Resumo: Online social networks present themselves as valuable information sources about their users and their respective interests. Such information has been subjected to many studies conducted by Data Mining scholars throughout the world in order to discover users' behaviours and patterns. Besides, there has also been also investment applied in creating platforms for the continuous information extraction and for their data visualization. This dissertation aims to identify tweet profiles by analysing multiple types of data: spatial, temporal, social and content. The goals set are to develop a information extraction approach to validate dimensional combination and to develop a visualization tool capable of displaying the patterns found using state of the art representations. The data mining process is composed by dissimilarity matrices computation, normalization and combination. Each dissimilarity matrix is then subjected to a clustering algorithm that retrieves the information. This dissertation studies in depth appropriate distance functions for the different types of data, the normalization and combination methods available for different dimensions and the clustering algorithms existent. The visualization platform is designed for a dynamic and intuitive usage, aimed at revealing the discovered patterns in the data mining process in an understandable and interactive manner. In order to accomplish such, various visualization patterns were studied and widgets chosen to better represent the information retrieved. The study case on which it will be applied is the geo-referenced data from TwitterEcho, although it will be developed to use any geo-referenced tweets extracted form Twitter itself.
id RCAP_32ee64213e667925acdfb96d8253449f
oai_identifier_str oai:repositorio-aberto.up.pt:10216/68545
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling TweeProfiles: deteção de padrões espácio-temporais no TwitterCiências da engenharia e tecnologiasEngineering and technologyOnline social networks present themselves as valuable information sources about their users and their respective interests. Such information has been subjected to many studies conducted by Data Mining scholars throughout the world in order to discover users' behaviours and patterns. Besides, there has also been also investment applied in creating platforms for the continuous information extraction and for their data visualization. This dissertation aims to identify tweet profiles by analysing multiple types of data: spatial, temporal, social and content. The goals set are to develop a information extraction approach to validate dimensional combination and to develop a visualization tool capable of displaying the patterns found using state of the art representations. The data mining process is composed by dissimilarity matrices computation, normalization and combination. Each dissimilarity matrix is then subjected to a clustering algorithm that retrieves the information. This dissertation studies in depth appropriate distance functions for the different types of data, the normalization and combination methods available for different dimensions and the clustering algorithms existent. The visualization platform is designed for a dynamic and intuitive usage, aimed at revealing the discovered patterns in the data mining process in an understandable and interactive manner. In order to accomplish such, various visualization patterns were studied and widgets chosen to better represent the information retrieved. The study case on which it will be applied is the geo-referenced data from TwitterEcho, although it will be developed to use any geo-referenced tweets extracted form Twitter itself.2013-07-082013-07-08T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/68545engTiago Daniel Sá Cunhainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T15:14:19Zoai:repositorio-aberto.up.pt:10216/68545Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:18:42.940817Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv TweeProfiles: deteção de padrões espácio-temporais no Twitter
title TweeProfiles: deteção de padrões espácio-temporais no Twitter
spellingShingle TweeProfiles: deteção de padrões espácio-temporais no Twitter
Tiago Daniel Sá Cunha
Ciências da engenharia e tecnologias
Engineering and technology
title_short TweeProfiles: deteção de padrões espácio-temporais no Twitter
title_full TweeProfiles: deteção de padrões espácio-temporais no Twitter
title_fullStr TweeProfiles: deteção de padrões espácio-temporais no Twitter
title_full_unstemmed TweeProfiles: deteção de padrões espácio-temporais no Twitter
title_sort TweeProfiles: deteção de padrões espácio-temporais no Twitter
author Tiago Daniel Sá Cunha
author_facet Tiago Daniel Sá Cunha
author_role author
dc.contributor.author.fl_str_mv Tiago Daniel Sá Cunha
dc.subject.por.fl_str_mv Ciências da engenharia e tecnologias
Engineering and technology
topic Ciências da engenharia e tecnologias
Engineering and technology
description Online social networks present themselves as valuable information sources about their users and their respective interests. Such information has been subjected to many studies conducted by Data Mining scholars throughout the world in order to discover users' behaviours and patterns. Besides, there has also been also investment applied in creating platforms for the continuous information extraction and for their data visualization. This dissertation aims to identify tweet profiles by analysing multiple types of data: spatial, temporal, social and content. The goals set are to develop a information extraction approach to validate dimensional combination and to develop a visualization tool capable of displaying the patterns found using state of the art representations. The data mining process is composed by dissimilarity matrices computation, normalization and combination. Each dissimilarity matrix is then subjected to a clustering algorithm that retrieves the information. This dissertation studies in depth appropriate distance functions for the different types of data, the normalization and combination methods available for different dimensions and the clustering algorithms existent. The visualization platform is designed for a dynamic and intuitive usage, aimed at revealing the discovered patterns in the data mining process in an understandable and interactive manner. In order to accomplish such, various visualization patterns were studied and widgets chosen to better represent the information retrieved. The study case on which it will be applied is the geo-referenced data from TwitterEcho, although it will be developed to use any geo-referenced tweets extracted form Twitter itself.
publishDate 2013
dc.date.none.fl_str_mv 2013-07-08
2013-07-08T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://hdl.handle.net/10216/68545
url https://hdl.handle.net/10216/68545
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136105580199937