Identification of health concepts in social networks

Matos, André Pereira de

Identification of health concepts in social networks

Detalhes bibliográficos
Autor(a) principal:	Matos, André Pereira de
Data de Publicação:	2014
Tipo de documento:	Dissertação
Idioma:	eng
Título da fonte:	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo:	http://hdl.handle.net/10773/14716
Resumo:	Social Networks are, without doubt, one of the fastest growing trends on the Internet nowadays and the amount of information generated is huge. On the other hand, search engines such as Google, Yahoo and Bing became global interfaces for information search. From an enterprise point of view, this phenomenon has lead to changes in strategies to deal with information representation, indexing and searching. Engines like Lucene, Nutch, Solr, Sphinx, ElasticSearch, among others, were born and became important platforms of software engineering. The amount of information available on the Internet on a daily basis, can be utterly valuable for multiple goals - social, scientific, politic, economic. For instance, in the case of health related subjects, institutions keep information on patients such as pathologies, conditions, treatments, exam results, pathways, etc. Also, governmental institutions divulge more and more information on health subjects in the most varied aspects. On the other hand, health research is one of the most active areas, resulting on network publishing of new results. Adding to the aforementioned, the way people share information on their health status can be valuable to detect pandemics and also to identify clinical conditions and their causes. In this project we proposed to develop a system capable of processing information retrieved from social networks, identifying symptoms, disorders and drugs. From an engineering point of view, the process consisted in accessing varied data sources to obtain information, create dictionaries and models of natural language processing to identify text patterns and associations between concepts. In the future, these associations can be valuable, for instance, to detect adverse reaction to drugs.

Metadados do item

id	RCAP_723f05d3d5cdf1559a11c87a02898914
oai_identifier_str	oai:ria.ua.pt:10773/14716
network_acronym_str	RCAP
network_name_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str	7160
spelling	Identification of health concepts in social networksEngenharia de computadoresRedes sociais - Análise de dadosRecuperação da informaçãoMedicamentos - ConsumoSocial Networks are, without doubt, one of the fastest growing trends on the Internet nowadays and the amount of information generated is huge. On the other hand, search engines such as Google, Yahoo and Bing became global interfaces for information search. From an enterprise point of view, this phenomenon has lead to changes in strategies to deal with information representation, indexing and searching. Engines like Lucene, Nutch, Solr, Sphinx, ElasticSearch, among others, were born and became important platforms of software engineering. The amount of information available on the Internet on a daily basis, can be utterly valuable for multiple goals - social, scientific, politic, economic. For instance, in the case of health related subjects, institutions keep information on patients such as pathologies, conditions, treatments, exam results, pathways, etc. Also, governmental institutions divulge more and more information on health subjects in the most varied aspects. On the other hand, health research is one of the most active areas, resulting on network publishing of new results. Adding to the aforementioned, the way people share information on their health status can be valuable to detect pandemics and also to identify clinical conditions and their causes. In this project we proposed to develop a system capable of processing information retrieved from social networks, identifying symptoms, disorders and drugs. From an engineering point of view, the process consisted in accessing varied data sources to obtain information, create dictionaries and models of natural language processing to identify text patterns and associations between concepts. In the future, these associations can be valuable, for instance, to detect adverse reaction to drugs.No últimos anos, blogs e redes sociais tiveram um crescimento imensurável, tanto em número de utilizadores como em receita gerada. Por outro lado, portais como o Google, Yahoo e Bing tornaram-se interfaces privilegiadas de procura de informação a nível global. Sob um ponto de vista empresarial, este fenómeno tem igualmente levado à mudança de estratégias de representação, indexação e pesquisa de informação. Surgiram motores como Lucene, Nutch, Solr, Sphinx, ElasticSearch, entre muitos outros, que são hoje sistemas incontornáveis na engenharia do software. A informação que diariamente é disponibilizada na Internet pode ter um enorme valor em múltiplas vertentes - social, científica, política, económica. Por exemplo, na área da saúde, as instituições registam cada vez mais informação sobre utentes, procedimentos, exames, diagnósticos, etc. Instituições governamentais divulgam cada vez mais informação sobre saúde nos seus mais variados aspectos. Por outro lado, a investigação em saúde é uma das áreas mais ativas, resultando na publicação em rede de novos resultados. Para além disto, a forma como os cidadãos trocam informação sobre o seu estado de saúde pode igualmente ajudar a detectar pandemias bem como a identificar condições clínicas e as suas causas. Neste projeto pretendeu-se desenvolver um sistema capaz de processar informação proveniente de redes sociais, identificando sinais, sintomas, doenças e medicamentos. Do ponto de vista de engenharia, o processo consistiu em aceder a diversas fontes de dados para obter informação, criar dicionários e modelos de processamento de linguagem natural para identificar padrões no texto e associações entre conceitos. Associações essas que, futuramente, poderão ser um passo para a detecção de reacções adversas a medicamentos.Universidade de Aveiro2015-09-24T11:13:34Z2014-01-01T00:00:00Z2014info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10773/14716TID:201565684engMatos, André Pereira deinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-02-22T11:26:58Zoai:ria.ua.pt:10773/14716Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T02:50:14.532254Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv	Identification of health concepts in social networks
title	Identification of health concepts in social networks
spellingShingle	Identification of health concepts in social networks Matos, André Pereira de Engenharia de computadores Redes sociais - Análise de dados Recuperação da informação Medicamentos - Consumo
title_short	Identification of health concepts in social networks
title_full	Identification of health concepts in social networks
title_fullStr	Identification of health concepts in social networks
title_full_unstemmed	Identification of health concepts in social networks
title_sort	Identification of health concepts in social networks
author	Matos, André Pereira de
author_facet	Matos, André Pereira de
author_role	author
dc.contributor.author.fl_str_mv	Matos, André Pereira de
dc.subject.por.fl_str_mv	Engenharia de computadores Redes sociais - Análise de dados Recuperação da informação Medicamentos - Consumo
topic	Engenharia de computadores Redes sociais - Análise de dados Recuperação da informação Medicamentos - Consumo
description	Social Networks are, without doubt, one of the fastest growing trends on the Internet nowadays and the amount of information generated is huge. On the other hand, search engines such as Google, Yahoo and Bing became global interfaces for information search. From an enterprise point of view, this phenomenon has lead to changes in strategies to deal with information representation, indexing and searching. Engines like Lucene, Nutch, Solr, Sphinx, ElasticSearch, among others, were born and became important platforms of software engineering. The amount of information available on the Internet on a daily basis, can be utterly valuable for multiple goals - social, scientific, politic, economic. For instance, in the case of health related subjects, institutions keep information on patients such as pathologies, conditions, treatments, exam results, pathways, etc. Also, governmental institutions divulge more and more information on health subjects in the most varied aspects. On the other hand, health research is one of the most active areas, resulting on network publishing of new results. Adding to the aforementioned, the way people share information on their health status can be valuable to detect pandemics and also to identify clinical conditions and their causes. In this project we proposed to develop a system capable of processing information retrieved from social networks, identifying symptoms, disorders and drugs. From an engineering point of view, the process consisted in accessing varied data sources to obtain information, create dictionaries and models of natural language processing to identify text patterns and associations between concepts. In the future, these associations can be valuable, for instance, to detect adverse reaction to drugs.
publishDate	2014
dc.date.none.fl_str_mv	2014-01-01T00:00:00Z 2014 2015-09-24T11:13:34Z
dc.type.status.fl_str_mv	info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv	info:eu-repo/semantics/masterThesis
format	masterThesis
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	http://hdl.handle.net/10773/14716 TID:201565684
url	http://hdl.handle.net/10773/14716
identifier_str_mv	TID:201565684
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.format.none.fl_str_mv	application/pdf
dc.publisher.none.fl_str_mv	Universidade de Aveiro
publisher.none.fl_str_mv	Universidade de Aveiro
dc.source.none.fl_str_mv	reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP
instname_str	Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str	RCAAP
institution	RCAAP
reponame_str	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv	Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_	1799137553009344512

Identification of health concepts in social networks

Registros relacionados