TweeProfiles: deteção de padrões espácio-temporais no Twitter
Autor(a) principal: | |
---|---|
Data de Publicação: | 2013 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://hdl.handle.net/10216/68545 |
Resumo: | Online social networks present themselves as valuable information sources about their users and their respective interests. Such information has been subjected to many studies conducted by Data Mining scholars throughout the world in order to discover users' behaviours and patterns. Besides, there has also been also investment applied in creating platforms for the continuous information extraction and for their data visualization. This dissertation aims to identify tweet profiles by analysing multiple types of data: spatial, temporal, social and content. The goals set are to develop a information extraction approach to validate dimensional combination and to develop a visualization tool capable of displaying the patterns found using state of the art representations. The data mining process is composed by dissimilarity matrices computation, normalization and combination. Each dissimilarity matrix is then subjected to a clustering algorithm that retrieves the information. This dissertation studies in depth appropriate distance functions for the different types of data, the normalization and combination methods available for different dimensions and the clustering algorithms existent. The visualization platform is designed for a dynamic and intuitive usage, aimed at revealing the discovered patterns in the data mining process in an understandable and interactive manner. In order to accomplish such, various visualization patterns were studied and widgets chosen to better represent the information retrieved. The study case on which it will be applied is the geo-referenced data from TwitterEcho, although it will be developed to use any geo-referenced tweets extracted form Twitter itself. |
id |
RCAP_32ee64213e667925acdfb96d8253449f |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/68545 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
TweeProfiles: deteção de padrões espácio-temporais no TwitterCiências da engenharia e tecnologiasEngineering and technologyOnline social networks present themselves as valuable information sources about their users and their respective interests. Such information has been subjected to many studies conducted by Data Mining scholars throughout the world in order to discover users' behaviours and patterns. Besides, there has also been also investment applied in creating platforms for the continuous information extraction and for their data visualization. This dissertation aims to identify tweet profiles by analysing multiple types of data: spatial, temporal, social and content. The goals set are to develop a information extraction approach to validate dimensional combination and to develop a visualization tool capable of displaying the patterns found using state of the art representations. The data mining process is composed by dissimilarity matrices computation, normalization and combination. Each dissimilarity matrix is then subjected to a clustering algorithm that retrieves the information. This dissertation studies in depth appropriate distance functions for the different types of data, the normalization and combination methods available for different dimensions and the clustering algorithms existent. The visualization platform is designed for a dynamic and intuitive usage, aimed at revealing the discovered patterns in the data mining process in an understandable and interactive manner. In order to accomplish such, various visualization patterns were studied and widgets chosen to better represent the information retrieved. The study case on which it will be applied is the geo-referenced data from TwitterEcho, although it will be developed to use any geo-referenced tweets extracted form Twitter itself.2013-07-082013-07-08T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://hdl.handle.net/10216/68545engTiago Daniel Sá Cunhainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T15:14:19Zoai:repositorio-aberto.up.pt:10216/68545Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:18:42.940817Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
TweeProfiles: deteção de padrões espácio-temporais no Twitter |
title |
TweeProfiles: deteção de padrões espácio-temporais no Twitter |
spellingShingle |
TweeProfiles: deteção de padrões espácio-temporais no Twitter Tiago Daniel Sá Cunha Ciências da engenharia e tecnologias Engineering and technology |
title_short |
TweeProfiles: deteção de padrões espácio-temporais no Twitter |
title_full |
TweeProfiles: deteção de padrões espácio-temporais no Twitter |
title_fullStr |
TweeProfiles: deteção de padrões espácio-temporais no Twitter |
title_full_unstemmed |
TweeProfiles: deteção de padrões espácio-temporais no Twitter |
title_sort |
TweeProfiles: deteção de padrões espácio-temporais no Twitter |
author |
Tiago Daniel Sá Cunha |
author_facet |
Tiago Daniel Sá Cunha |
author_role |
author |
dc.contributor.author.fl_str_mv |
Tiago Daniel Sá Cunha |
dc.subject.por.fl_str_mv |
Ciências da engenharia e tecnologias Engineering and technology |
topic |
Ciências da engenharia e tecnologias Engineering and technology |
description |
Online social networks present themselves as valuable information sources about their users and their respective interests. Such information has been subjected to many studies conducted by Data Mining scholars throughout the world in order to discover users' behaviours and patterns. Besides, there has also been also investment applied in creating platforms for the continuous information extraction and for their data visualization. This dissertation aims to identify tweet profiles by analysing multiple types of data: spatial, temporal, social and content. The goals set are to develop a information extraction approach to validate dimensional combination and to develop a visualization tool capable of displaying the patterns found using state of the art representations. The data mining process is composed by dissimilarity matrices computation, normalization and combination. Each dissimilarity matrix is then subjected to a clustering algorithm that retrieves the information. This dissertation studies in depth appropriate distance functions for the different types of data, the normalization and combination methods available for different dimensions and the clustering algorithms existent. The visualization platform is designed for a dynamic and intuitive usage, aimed at revealing the discovered patterns in the data mining process in an understandable and interactive manner. In order to accomplish such, various visualization patterns were studied and widgets chosen to better represent the information retrieved. The study case on which it will be applied is the geo-referenced data from TwitterEcho, although it will be developed to use any geo-referenced tweets extracted form Twitter itself. |
publishDate |
2013 |
dc.date.none.fl_str_mv |
2013-07-08 2013-07-08T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://hdl.handle.net/10216/68545 |
url |
https://hdl.handle.net/10216/68545 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136105580199937 |