Identification of health concepts in social networks

Detalhes bibliográficos
Autor(a) principal: Matos, André Pereira de
Data de Publicação: 2014
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10773/14716
Resumo: Social Networks are, without doubt, one of the fastest growing trends on the Internet nowadays and the amount of information generated is huge. On the other hand, search engines such as Google, Yahoo and Bing became global interfaces for information search. From an enterprise point of view, this phenomenon has lead to changes in strategies to deal with information representation, indexing and searching. Engines like Lucene, Nutch, Solr, Sphinx, ElasticSearch, among others, were born and became important platforms of software engineering. The amount of information available on the Internet on a daily basis, can be utterly valuable for multiple goals - social, scientific, politic, economic. For instance, in the case of health related subjects, institutions keep information on patients such as pathologies, conditions, treatments, exam results, pathways, etc. Also, governmental institutions divulge more and more information on health subjects in the most varied aspects. On the other hand, health research is one of the most active areas, resulting on network publishing of new results. Adding to the aforementioned, the way people share information on their health status can be valuable to detect pandemics and also to identify clinical conditions and their causes. In this project we proposed to develop a system capable of processing information retrieved from social networks, identifying symptoms, disorders and drugs. From an engineering point of view, the process consisted in accessing varied data sources to obtain information, create dictionaries and models of natural language processing to identify text patterns and associations between concepts. In the future, these associations can be valuable, for instance, to detect adverse reaction to drugs.
id RCAP_723f05d3d5cdf1559a11c87a02898914
oai_identifier_str oai:ria.ua.pt:10773/14716
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Identification of health concepts in social networksEngenharia de computadoresRedes sociais - Análise de dadosRecuperação da informaçãoMedicamentos - ConsumoSocial Networks are, without doubt, one of the fastest growing trends on the Internet nowadays and the amount of information generated is huge. On the other hand, search engines such as Google, Yahoo and Bing became global interfaces for information search. From an enterprise point of view, this phenomenon has lead to changes in strategies to deal with information representation, indexing and searching. Engines like Lucene, Nutch, Solr, Sphinx, ElasticSearch, among others, were born and became important platforms of software engineering. The amount of information available on the Internet on a daily basis, can be utterly valuable for multiple goals - social, scientific, politic, economic. For instance, in the case of health related subjects, institutions keep information on patients such as pathologies, conditions, treatments, exam results, pathways, etc. Also, governmental institutions divulge more and more information on health subjects in the most varied aspects. On the other hand, health research is one of the most active areas, resulting on network publishing of new results. Adding to the aforementioned, the way people share information on their health status can be valuable to detect pandemics and also to identify clinical conditions and their causes. In this project we proposed to develop a system capable of processing information retrieved from social networks, identifying symptoms, disorders and drugs. From an engineering point of view, the process consisted in accessing varied data sources to obtain information, create dictionaries and models of natural language processing to identify text patterns and associations between concepts. In the future, these associations can be valuable, for instance, to detect adverse reaction to drugs.No últimos anos, blogs e redes sociais tiveram um crescimento imensurável, tanto em número de utilizadores como em receita gerada. Por outro lado, portais como o Google, Yahoo e Bing tornaram-se interfaces privilegiadas de procura de informação a nível global. Sob um ponto de vista empresarial, este fenómeno tem igualmente levado à mudança de estratégias de representação, indexação e pesquisa de informação. Surgiram motores como Lucene, Nutch, Solr, Sphinx, ElasticSearch, entre muitos outros, que são hoje sistemas incontornáveis na engenharia do software. A informação que diariamente é disponibilizada na Internet pode ter um enorme valor em múltiplas vertentes - social, científica, política, económica. Por exemplo, na área da saúde, as instituições registam cada vez mais informação sobre utentes, procedimentos, exames, diagnósticos, etc. Instituições governamentais divulgam cada vez mais informação sobre saúde nos seus mais variados aspectos. Por outro lado, a investigação em saúde é uma das áreas mais ativas, resultando na publicação em rede de novos resultados. Para além disto, a forma como os cidadãos trocam informação sobre o seu estado de saúde pode igualmente ajudar a detectar pandemias bem como a identificar condições clínicas e as suas causas. Neste projeto pretendeu-se desenvolver um sistema capaz de processar informação proveniente de redes sociais, identificando sinais, sintomas, doenças e medicamentos. Do ponto de vista de engenharia, o processo consistiu em aceder a diversas fontes de dados para obter informação, criar dicionários e modelos de processamento de linguagem natural para identificar padrões no texto e associações entre conceitos. Associações essas que, futuramente, poderão ser um passo para a detecção de reacções adversas a medicamentos.Universidade de Aveiro2015-09-24T11:13:34Z2014-01-01T00:00:00Z2014info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10773/14716TID:201565684engMatos, André Pereira deinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-02-22T11:26:58Zoai:ria.ua.pt:10773/14716Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T02:50:14.532254Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Identification of health concepts in social networks
title Identification of health concepts in social networks
spellingShingle Identification of health concepts in social networks
Matos, André Pereira de
Engenharia de computadores
Redes sociais - Análise de dados
Recuperação da informação
Medicamentos - Consumo
title_short Identification of health concepts in social networks
title_full Identification of health concepts in social networks
title_fullStr Identification of health concepts in social networks
title_full_unstemmed Identification of health concepts in social networks
title_sort Identification of health concepts in social networks
author Matos, André Pereira de
author_facet Matos, André Pereira de
author_role author
dc.contributor.author.fl_str_mv Matos, André Pereira de
dc.subject.por.fl_str_mv Engenharia de computadores
Redes sociais - Análise de dados
Recuperação da informação
Medicamentos - Consumo
topic Engenharia de computadores
Redes sociais - Análise de dados
Recuperação da informação
Medicamentos - Consumo
description Social Networks are, without doubt, one of the fastest growing trends on the Internet nowadays and the amount of information generated is huge. On the other hand, search engines such as Google, Yahoo and Bing became global interfaces for information search. From an enterprise point of view, this phenomenon has lead to changes in strategies to deal with information representation, indexing and searching. Engines like Lucene, Nutch, Solr, Sphinx, ElasticSearch, among others, were born and became important platforms of software engineering. The amount of information available on the Internet on a daily basis, can be utterly valuable for multiple goals - social, scientific, politic, economic. For instance, in the case of health related subjects, institutions keep information on patients such as pathologies, conditions, treatments, exam results, pathways, etc. Also, governmental institutions divulge more and more information on health subjects in the most varied aspects. On the other hand, health research is one of the most active areas, resulting on network publishing of new results. Adding to the aforementioned, the way people share information on their health status can be valuable to detect pandemics and also to identify clinical conditions and their causes. In this project we proposed to develop a system capable of processing information retrieved from social networks, identifying symptoms, disorders and drugs. From an engineering point of view, the process consisted in accessing varied data sources to obtain information, create dictionaries and models of natural language processing to identify text patterns and associations between concepts. In the future, these associations can be valuable, for instance, to detect adverse reaction to drugs.
publishDate 2014
dc.date.none.fl_str_mv 2014-01-01T00:00:00Z
2014
2015-09-24T11:13:34Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10773/14716
TID:201565684
url http://hdl.handle.net/10773/14716
identifier_str_mv TID:201565684
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade de Aveiro
publisher.none.fl_str_mv Universidade de Aveiro
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799137553009344512