Unsupervised learning of ontology for the medical domain

Detalhes bibliográficos
Autor(a) principal: Bastos, Sónia Margarida Ferreira de
Data de Publicação: 2009
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10400.6/3873
Resumo: Tom Gruber (1993) defines Ontology as ”an explicit specification of a conceptualization.” Due two the enormous quantity of information available, there is a growing number of applications that perform tasks where lexical-semantic resources are needed, like Information Retrieval, intelligent search or machine translation. This shows that Natural Language Processing is becoming more dependent on semantic information. One of the main motivations in ontology building is the possibility of knowledge sharing and reuse across different applications. The start point is to fixed a particular domain (like medicine), which is expected to be the base of domain knowledge for a variety of applications. This is a difficult task as the domain knowledge strongly depends on the particular task at hand. This paper is an approach on ontology learning, for which it was selected the Medical Domain, so that we could have a base to compare and evaluate the resulting ontology. In our approach, we use different techniques, like Asymmetric Association Measures, clustering algorithm and text rank algorithm, so that we can obtain relations between a set of terms, which are rank by the degree of generality, like the cluster obtained by applying clustering algorithms, with the confidence measure as the values for the similarity matrix, to the set of terms, the generality clusters. Those clusters are then submitted to clustering algorithm, but with Symmetric Conditional Probability values in the similarity matrix, to obtain domain clusters within the generality clusters. In the future, this ontology may be used in acquisition of Lexical Chains for Text Summarization, as in other Natural Language Processing applications.
id RCAP_ac9264e9ce3c886467c69288de54034e
oai_identifier_str oai:ubibliorum.ubi.pt:10400.6/3873
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Unsupervised learning of ontology for the medical domainCiência da computação - Ciências da informação - OntologiaWeb semântica - OntologiaUMLS (Unified Medical Language System)Domínio/Área Científica:Engenharia e TecnologiaTom Gruber (1993) defines Ontology as ”an explicit specification of a conceptualization.” Due two the enormous quantity of information available, there is a growing number of applications that perform tasks where lexical-semantic resources are needed, like Information Retrieval, intelligent search or machine translation. This shows that Natural Language Processing is becoming more dependent on semantic information. One of the main motivations in ontology building is the possibility of knowledge sharing and reuse across different applications. The start point is to fixed a particular domain (like medicine), which is expected to be the base of domain knowledge for a variety of applications. This is a difficult task as the domain knowledge strongly depends on the particular task at hand. This paper is an approach on ontology learning, for which it was selected the Medical Domain, so that we could have a base to compare and evaluate the resulting ontology. In our approach, we use different techniques, like Asymmetric Association Measures, clustering algorithm and text rank algorithm, so that we can obtain relations between a set of terms, which are rank by the degree of generality, like the cluster obtained by applying clustering algorithms, with the confidence measure as the values for the similarity matrix, to the set of terms, the generality clusters. Those clusters are then submitted to clustering algorithm, but with Symmetric Conditional Probability values in the similarity matrix, to obtain domain clusters within the generality clusters. In the future, this ontology may be used in acquisition of Lexical Chains for Text Summarization, as in other Natural Language Processing applications.Dias, Gaël Harry Adélio AndréuBibliorumBastos, Sónia Margarida Ferreira de2015-10-29T10:26:42Z20092009-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttp://hdl.handle.net/10400.6/3873enginfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-12-15T09:40:28Zoai:ubibliorum.ubi.pt:10400.6/3873Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T00:45:12.142449Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Unsupervised learning of ontology for the medical domain
title Unsupervised learning of ontology for the medical domain
spellingShingle Unsupervised learning of ontology for the medical domain
Bastos, Sónia Margarida Ferreira de
Ciência da computação - Ciências da informação - Ontologia
Web semântica - Ontologia
UMLS (Unified Medical Language System)
Domínio/Área Científica:Engenharia e Tecnologia
title_short Unsupervised learning of ontology for the medical domain
title_full Unsupervised learning of ontology for the medical domain
title_fullStr Unsupervised learning of ontology for the medical domain
title_full_unstemmed Unsupervised learning of ontology for the medical domain
title_sort Unsupervised learning of ontology for the medical domain
author Bastos, Sónia Margarida Ferreira de
author_facet Bastos, Sónia Margarida Ferreira de
author_role author
dc.contributor.none.fl_str_mv Dias, Gaël Harry Adélio André
uBibliorum
dc.contributor.author.fl_str_mv Bastos, Sónia Margarida Ferreira de
dc.subject.por.fl_str_mv Ciência da computação - Ciências da informação - Ontologia
Web semântica - Ontologia
UMLS (Unified Medical Language System)
Domínio/Área Científica:Engenharia e Tecnologia
topic Ciência da computação - Ciências da informação - Ontologia
Web semântica - Ontologia
UMLS (Unified Medical Language System)
Domínio/Área Científica:Engenharia e Tecnologia
description Tom Gruber (1993) defines Ontology as ”an explicit specification of a conceptualization.” Due two the enormous quantity of information available, there is a growing number of applications that perform tasks where lexical-semantic resources are needed, like Information Retrieval, intelligent search or machine translation. This shows that Natural Language Processing is becoming more dependent on semantic information. One of the main motivations in ontology building is the possibility of knowledge sharing and reuse across different applications. The start point is to fixed a particular domain (like medicine), which is expected to be the base of domain knowledge for a variety of applications. This is a difficult task as the domain knowledge strongly depends on the particular task at hand. This paper is an approach on ontology learning, for which it was selected the Medical Domain, so that we could have a base to compare and evaluate the resulting ontology. In our approach, we use different techniques, like Asymmetric Association Measures, clustering algorithm and text rank algorithm, so that we can obtain relations between a set of terms, which are rank by the degree of generality, like the cluster obtained by applying clustering algorithms, with the confidence measure as the values for the similarity matrix, to the set of terms, the generality clusters. Those clusters are then submitted to clustering algorithm, but with Symmetric Conditional Probability values in the similarity matrix, to obtain domain clusters within the generality clusters. In the future, this ontology may be used in acquisition of Lexical Chains for Text Summarization, as in other Natural Language Processing applications.
publishDate 2009
dc.date.none.fl_str_mv 2009
2009-01-01T00:00:00Z
2015-10-29T10:26:42Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10400.6/3873
url http://hdl.handle.net/10400.6/3873
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136348276260864