Identificação de Bioprocessos em textos

Detalhes bibliográficos
Autor(a) principal: Vânia Alice Sousa Leite
Data de Publicação: 2017
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: https://repositorio-aberto.up.pt/handle/10216/106604
Resumo: Due to the large diversity, heterogeneity and ever growing rate of publications made electronicallyavailable in databases such as PubMed, biomedical researchers spend a lot of time and effortsearching for the available information in their area of research. A lot of issues cause this difficulty, among them the fact that there are various forms of representation expressions for the same object or activity in the biomedical field, orthographic variants and abbreviations, meaning that most standard publication search engines can't deal with this variety. Biomedical Text Mining (BTM),the field that deals with automatic retrieval and processing of biomedical literature, is therefore avery promising research field, namely in the retrieval of biological elements or concepts, workingtowards developing automated curation tools to better aid researchers to cope with this aforementioned information overload. This dissertation has the aim of developing a tool do automatically extract biological processes from texts, making use of state of the art BTM tasks such as NER, ontological knowledge and classification, and integrating different tools for knowledge discovery in texts - for example Genia Tagger (gives text information such as base forms, lemma, chunk, part-of-speech tag (POST) and named entities), UMLS Metamap (a program developed to discover UMLS Metathesaurus concepts referred to in texts) - to experiment with different settings and tools to find the best way to fruitfully combine all of this and help researchers find relevant information for their studies in a quicker way.
id RCAP_99ffcd64d23177a3c56cd2582c94933f
oai_identifier_str oai:repositorio-aberto.up.pt:10216/106604
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Identificação de Bioprocessos em textosEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringDue to the large diversity, heterogeneity and ever growing rate of publications made electronicallyavailable in databases such as PubMed, biomedical researchers spend a lot of time and effortsearching for the available information in their area of research. A lot of issues cause this difficulty, among them the fact that there are various forms of representation expressions for the same object or activity in the biomedical field, orthographic variants and abbreviations, meaning that most standard publication search engines can't deal with this variety. Biomedical Text Mining (BTM),the field that deals with automatic retrieval and processing of biomedical literature, is therefore avery promising research field, namely in the retrieval of biological elements or concepts, workingtowards developing automated curation tools to better aid researchers to cope with this aforementioned information overload. This dissertation has the aim of developing a tool do automatically extract biological processes from texts, making use of state of the art BTM tasks such as NER, ontological knowledge and classification, and integrating different tools for knowledge discovery in texts - for example Genia Tagger (gives text information such as base forms, lemma, chunk, part-of-speech tag (POST) and named entities), UMLS Metamap (a program developed to discover UMLS Metathesaurus concepts referred to in texts) - to experiment with different settings and tools to find the best way to fruitfully combine all of this and help researchers find relevant information for their studies in a quicker way.2017-07-132017-07-13T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://repositorio-aberto.up.pt/handle/10216/106604TID:201803895engVânia Alice Sousa Leiteinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T15:02:33Zoai:repositorio-aberto.up.pt:10216/106604Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:14:20.794593Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Identificação de Bioprocessos em textos
title Identificação de Bioprocessos em textos
spellingShingle Identificação de Bioprocessos em textos
Vânia Alice Sousa Leite
Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
title_short Identificação de Bioprocessos em textos
title_full Identificação de Bioprocessos em textos
title_fullStr Identificação de Bioprocessos em textos
title_full_unstemmed Identificação de Bioprocessos em textos
title_sort Identificação de Bioprocessos em textos
author Vânia Alice Sousa Leite
author_facet Vânia Alice Sousa Leite
author_role author
dc.contributor.author.fl_str_mv Vânia Alice Sousa Leite
dc.subject.por.fl_str_mv Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
topic Engenharia electrotécnica, electrónica e informática
Electrical engineering, Electronic engineering, Information engineering
description Due to the large diversity, heterogeneity and ever growing rate of publications made electronicallyavailable in databases such as PubMed, biomedical researchers spend a lot of time and effortsearching for the available information in their area of research. A lot of issues cause this difficulty, among them the fact that there are various forms of representation expressions for the same object or activity in the biomedical field, orthographic variants and abbreviations, meaning that most standard publication search engines can't deal with this variety. Biomedical Text Mining (BTM),the field that deals with automatic retrieval and processing of biomedical literature, is therefore avery promising research field, namely in the retrieval of biological elements or concepts, workingtowards developing automated curation tools to better aid researchers to cope with this aforementioned information overload. This dissertation has the aim of developing a tool do automatically extract biological processes from texts, making use of state of the art BTM tasks such as NER, ontological knowledge and classification, and integrating different tools for knowledge discovery in texts - for example Genia Tagger (gives text information such as base forms, lemma, chunk, part-of-speech tag (POST) and named entities), UMLS Metamap (a program developed to discover UMLS Metathesaurus concepts referred to in texts) - to experiment with different settings and tools to find the best way to fruitfully combine all of this and help researchers find relevant information for their studies in a quicker way.
publishDate 2017
dc.date.none.fl_str_mv 2017-07-13
2017-07-13T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://repositorio-aberto.up.pt/handle/10216/106604
TID:201803895
url https://repositorio-aberto.up.pt/handle/10216/106604
identifier_str_mv TID:201803895
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136064909082624