Unsupervised learning of ontology for the medical domain
Autor(a) principal: | |
---|---|
Data de Publicação: | 2009 |
Tipo de documento: | Dissertação |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10400.6/3873 |
Resumo: | Tom Gruber (1993) defines Ontology as ”an explicit specification of a conceptualization.” Due two the enormous quantity of information available, there is a growing number of applications that perform tasks where lexical-semantic resources are needed, like Information Retrieval, intelligent search or machine translation. This shows that Natural Language Processing is becoming more dependent on semantic information. One of the main motivations in ontology building is the possibility of knowledge sharing and reuse across different applications. The start point is to fixed a particular domain (like medicine), which is expected to be the base of domain knowledge for a variety of applications. This is a difficult task as the domain knowledge strongly depends on the particular task at hand. This paper is an approach on ontology learning, for which it was selected the Medical Domain, so that we could have a base to compare and evaluate the resulting ontology. In our approach, we use different techniques, like Asymmetric Association Measures, clustering algorithm and text rank algorithm, so that we can obtain relations between a set of terms, which are rank by the degree of generality, like the cluster obtained by applying clustering algorithms, with the confidence measure as the values for the similarity matrix, to the set of terms, the generality clusters. Those clusters are then submitted to clustering algorithm, but with Symmetric Conditional Probability values in the similarity matrix, to obtain domain clusters within the generality clusters. In the future, this ontology may be used in acquisition of Lexical Chains for Text Summarization, as in other Natural Language Processing applications. |
id |
RCAP_ac9264e9ce3c886467c69288de54034e |
---|---|
oai_identifier_str |
oai:ubibliorum.ubi.pt:10400.6/3873 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Unsupervised learning of ontology for the medical domainCiência da computação - Ciências da informação - OntologiaWeb semântica - OntologiaUMLS (Unified Medical Language System)Domínio/Área Científica:Engenharia e TecnologiaTom Gruber (1993) defines Ontology as ”an explicit specification of a conceptualization.” Due two the enormous quantity of information available, there is a growing number of applications that perform tasks where lexical-semantic resources are needed, like Information Retrieval, intelligent search or machine translation. This shows that Natural Language Processing is becoming more dependent on semantic information. One of the main motivations in ontology building is the possibility of knowledge sharing and reuse across different applications. The start point is to fixed a particular domain (like medicine), which is expected to be the base of domain knowledge for a variety of applications. This is a difficult task as the domain knowledge strongly depends on the particular task at hand. This paper is an approach on ontology learning, for which it was selected the Medical Domain, so that we could have a base to compare and evaluate the resulting ontology. In our approach, we use different techniques, like Asymmetric Association Measures, clustering algorithm and text rank algorithm, so that we can obtain relations between a set of terms, which are rank by the degree of generality, like the cluster obtained by applying clustering algorithms, with the confidence measure as the values for the similarity matrix, to the set of terms, the generality clusters. Those clusters are then submitted to clustering algorithm, but with Symmetric Conditional Probability values in the similarity matrix, to obtain domain clusters within the generality clusters. In the future, this ontology may be used in acquisition of Lexical Chains for Text Summarization, as in other Natural Language Processing applications.Dias, Gaël Harry Adélio AndréuBibliorumBastos, Sónia Margarida Ferreira de2015-10-29T10:26:42Z20092009-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.6/3873enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-12-15T09:40:28Zoai:ubibliorum.ubi.pt:10400.6/3873Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:45:12.142449Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Unsupervised learning of ontology for the medical domain |
title |
Unsupervised learning of ontology for the medical domain |
spellingShingle |
Unsupervised learning of ontology for the medical domain Bastos, Sónia Margarida Ferreira de Ciência da computação - Ciências da informação - Ontologia Web semântica - Ontologia UMLS (Unified Medical Language System) Domínio/Área Científica:Engenharia e Tecnologia |
title_short |
Unsupervised learning of ontology for the medical domain |
title_full |
Unsupervised learning of ontology for the medical domain |
title_fullStr |
Unsupervised learning of ontology for the medical domain |
title_full_unstemmed |
Unsupervised learning of ontology for the medical domain |
title_sort |
Unsupervised learning of ontology for the medical domain |
author |
Bastos, Sónia Margarida Ferreira de |
author_facet |
Bastos, Sónia Margarida Ferreira de |
author_role |
author |
dc.contributor.none.fl_str_mv |
Dias, Gaël Harry Adélio André uBibliorum |
dc.contributor.author.fl_str_mv |
Bastos, Sónia Margarida Ferreira de |
dc.subject.por.fl_str_mv |
Ciência da computação - Ciências da informação - Ontologia Web semântica - Ontologia UMLS (Unified Medical Language System) Domínio/Área Científica:Engenharia e Tecnologia |
topic |
Ciência da computação - Ciências da informação - Ontologia Web semântica - Ontologia UMLS (Unified Medical Language System) Domínio/Área Científica:Engenharia e Tecnologia |
description |
Tom Gruber (1993) defines Ontology as ”an explicit specification of a conceptualization.” Due two the enormous quantity of information available, there is a growing number of applications that perform tasks where lexical-semantic resources are needed, like Information Retrieval, intelligent search or machine translation. This shows that Natural Language Processing is becoming more dependent on semantic information. One of the main motivations in ontology building is the possibility of knowledge sharing and reuse across different applications. The start point is to fixed a particular domain (like medicine), which is expected to be the base of domain knowledge for a variety of applications. This is a difficult task as the domain knowledge strongly depends on the particular task at hand. This paper is an approach on ontology learning, for which it was selected the Medical Domain, so that we could have a base to compare and evaluate the resulting ontology. In our approach, we use different techniques, like Asymmetric Association Measures, clustering algorithm and text rank algorithm, so that we can obtain relations between a set of terms, which are rank by the degree of generality, like the cluster obtained by applying clustering algorithms, with the confidence measure as the values for the similarity matrix, to the set of terms, the generality clusters. Those clusters are then submitted to clustering algorithm, but with Symmetric Conditional Probability values in the similarity matrix, to obtain domain clusters within the generality clusters. In the future, this ontology may be used in acquisition of Lexical Chains for Text Summarization, as in other Natural Language Processing applications. |
publishDate |
2009 |
dc.date.none.fl_str_mv |
2009 2009-01-01T00:00:00Z 2015-10-29T10:26:42Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10400.6/3873 |
url |
http://hdl.handle.net/10400.6/3873 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136348276260864 |