Identificação de Bioprocessos em textos
Autor(a) principal: | |
---|---|
Data de Publicação: | 2017 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | https://repositorio-aberto.up.pt/handle/10216/106604 |
Resumo: | Due to the large diversity, heterogeneity and ever growing rate of publications made electronicallyavailable in databases such as PubMed, biomedical researchers spend a lot of time and effortsearching for the available information in their area of research. A lot of issues cause this difficulty, among them the fact that there are various forms of representation expressions for the same object or activity in the biomedical field, orthographic variants and abbreviations, meaning that most standard publication search engines can't deal with this variety. Biomedical Text Mining (BTM),the field that deals with automatic retrieval and processing of biomedical literature, is therefore avery promising research field, namely in the retrieval of biological elements or concepts, workingtowards developing automated curation tools to better aid researchers to cope with this aforementioned information overload. This dissertation has the aim of developing a tool do automatically extract biological processes from texts, making use of state of the art BTM tasks such as NER, ontological knowledge and classification, and integrating different tools for knowledge discovery in texts - for example Genia Tagger (gives text information such as base forms, lemma, chunk, part-of-speech tag (POST) and named entities), UMLS Metamap (a program developed to discover UMLS Metathesaurus concepts referred to in texts) - to experiment with different settings and tools to find the best way to fruitfully combine all of this and help researchers find relevant information for their studies in a quicker way. |
id |
RCAP_99ffcd64d23177a3c56cd2582c94933f |
---|---|
oai_identifier_str |
oai:repositorio-aberto.up.pt:10216/106604 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Identificação de Bioprocessos em textosEngenharia electrotécnica, electrónica e informáticaElectrical engineering, Electronic engineering, Information engineeringDue to the large diversity, heterogeneity and ever growing rate of publications made electronicallyavailable in databases such as PubMed, biomedical researchers spend a lot of time and effortsearching for the available information in their area of research. A lot of issues cause this difficulty, among them the fact that there are various forms of representation expressions for the same object or activity in the biomedical field, orthographic variants and abbreviations, meaning that most standard publication search engines can't deal with this variety. Biomedical Text Mining (BTM),the field that deals with automatic retrieval and processing of biomedical literature, is therefore avery promising research field, namely in the retrieval of biological elements or concepts, workingtowards developing automated curation tools to better aid researchers to cope with this aforementioned information overload. This dissertation has the aim of developing a tool do automatically extract biological processes from texts, making use of state of the art BTM tasks such as NER, ontological knowledge and classification, and integrating different tools for knowledge discovery in texts - for example Genia Tagger (gives text information such as base forms, lemma, chunk, part-of-speech tag (POST) and named entities), UMLS Metamap (a program developed to discover UMLS Metathesaurus concepts referred to in texts) - to experiment with different settings and tools to find the best way to fruitfully combine all of this and help researchers find relevant information for their studies in a quicker way.2017-07-132017-07-13T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://repositorio-aberto.up.pt/handle/10216/106604TID:201803895engVânia Alice Sousa Leiteinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-11-29T15:02:33Zoai:repositorio-aberto.up.pt:10216/106604Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:14:20.794593Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Identificação de Bioprocessos em textos |
title |
Identificação de Bioprocessos em textos |
spellingShingle |
Identificação de Bioprocessos em textos Vânia Alice Sousa Leite Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
title_short |
Identificação de Bioprocessos em textos |
title_full |
Identificação de Bioprocessos em textos |
title_fullStr |
Identificação de Bioprocessos em textos |
title_full_unstemmed |
Identificação de Bioprocessos em textos |
title_sort |
Identificação de Bioprocessos em textos |
author |
Vânia Alice Sousa Leite |
author_facet |
Vânia Alice Sousa Leite |
author_role |
author |
dc.contributor.author.fl_str_mv |
Vânia Alice Sousa Leite |
dc.subject.por.fl_str_mv |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
topic |
Engenharia electrotécnica, electrónica e informática Electrical engineering, Electronic engineering, Information engineering |
description |
Due to the large diversity, heterogeneity and ever growing rate of publications made electronicallyavailable in databases such as PubMed, biomedical researchers spend a lot of time and effortsearching for the available information in their area of research. A lot of issues cause this difficulty, among them the fact that there are various forms of representation expressions for the same object or activity in the biomedical field, orthographic variants and abbreviations, meaning that most standard publication search engines can't deal with this variety. Biomedical Text Mining (BTM),the field that deals with automatic retrieval and processing of biomedical literature, is therefore avery promising research field, namely in the retrieval of biological elements or concepts, workingtowards developing automated curation tools to better aid researchers to cope with this aforementioned information overload. This dissertation has the aim of developing a tool do automatically extract biological processes from texts, making use of state of the art BTM tasks such as NER, ontological knowledge and classification, and integrating different tools for knowledge discovery in texts - for example Genia Tagger (gives text information such as base forms, lemma, chunk, part-of-speech tag (POST) and named entities), UMLS Metamap (a program developed to discover UMLS Metathesaurus concepts referred to in texts) - to experiment with different settings and tools to find the best way to fruitfully combine all of this and help researchers find relevant information for their studies in a quicker way. |
publishDate |
2017 |
dc.date.none.fl_str_mv |
2017-07-13 2017-07-13T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://repositorio-aberto.up.pt/handle/10216/106604 TID:201803895 |
url |
https://repositorio-aberto.up.pt/handle/10216/106604 |
identifier_str_mv |
TID:201803895 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136064909082624 |