Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1

Detalhes bibliográficos
Autor(a) principal: Schiessl, Marcelo
Data de Publicação: 2017
Outros Autores: Bräscher, Marisa
Tipo de documento: Artigo
Idioma: por
Título da fonte: Transinformação (Online)
Texto Completo: https://periodicos.puc-campinas.edu.br/transinfo/article/view/5980
Resumo: The proposal presented in this study seeks to properly represent natural language to ontologies and vice-versa. Therefore, thesemi-automatic creation of a lexical databasein Brazilian Portuguese containing morphological, syntactic, and semantic informationthat can be read by machines was proposed, allowing the link between structured and unstructured data and its integration intoan information retrieval model to improve precision. The results obtained demonstrated that the methodology can be used in therisco financeiro (financial risk) domain in Portuguese for the construction of an ontology and the lexical-semantic database andthe proposal of a semantic information retrieval model. In order to evaluate the performance of the proposed model, documentscontaining the main definitions of the financial risk domain were selected and indexed with and without semantic annotation. Toenable the comparison between the approaches, two databases were created based on the texts with the semantic annotationsto represent the semantic search. The first one represents the traditional search and the second contained the index built based onthe texts with the semantic annotations to represent the semantic search. The evaluation of the proposal was based on recall andprecision. The queries submitted to the model showed that the semantic search outperforms the traditional search and validatesthe methodology used. Although more complex, the procedure proposed can be used in all kinds of domains.
id PUC_CAMP-4_bd339d2cac29a91f3ee5a48853192b2d
oai_identifier_str oai:ojs.periodicos.puc-campinas.edu.br:article/5980
network_acronym_str PUC_CAMP-4
network_name_str Transinformação (Online)
repository_id_str
spelling Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1The proposal presented in this study seeks to properly represent natural language to ontologies and vice-versa. Therefore, thesemi-automatic creation of a lexical databasein Brazilian Portuguese containing morphological, syntactic, and semantic informationthat can be read by machines was proposed, allowing the link between structured and unstructured data and its integration intoan information retrieval model to improve precision. The results obtained demonstrated that the methodology can be used in therisco financeiro (financial risk) domain in Portuguese for the construction of an ontology and the lexical-semantic database andthe proposal of a semantic information retrieval model. In order to evaluate the performance of the proposed model, documentscontaining the main definitions of the financial risk domain were selected and indexed with and without semantic annotation. Toenable the comparison between the approaches, two databases were created based on the texts with the semantic annotationsto represent the semantic search. The first one represents the traditional search and the second contained the index built based onthe texts with the semantic annotations to represent the semantic search. The evaluation of the proposal was based on recall andprecision. The queries submitted to the model showed that the semantic search outperforms the traditional search and validatesthe methodology used. Although more complex, the procedure proposed can be used in all kinds of domains.Esta proposta visa representar a linguagem natural na forma adequada às ontologias e vice-versa. Para tanto, propõe-se à criaçãosemiautomática de base de léxicos em português brasileiro, contendo informações morfológicas, sintáticas e semânticas apropriadas para a leitura por máquinas, permitindo vincular dados estruturados e não estruturados, bem como integrar a leitura em modelo de recuperação da informação para aumentar a precisão. Os resultados alcançados demonstram a utilização da metodologia, no domínio de risco financeiro em português, para a elaboração da ontologia, da base léxico-semântica e da proposta do modelo de recuperação da informação semântica. Para avaliar a performance do modelo proposto, foram selecionados documentos contendo as principais definições do domínio de risco financeiro. Esses foram indexados com e sem anotação semântica. Para possibilitar a comparação entre as abordagens, foram criadas duas bases, a primeira representando a busca tradicional, e a segunda contendo o índice construído, a partir dos textos com as anotações semânticas para representar a busca semântica. A avaliação da proposta é baseada na revocação e na precisão. As consultas submetidas ao modelo mostram que a busca semântica supera o desempenho da tradicional e validam a metodologia empregada. O procedimento, embora adicione complexidade em sua elaboração, pode ser reproduzido em qualquer outro domínio.Núcleo de Editoração - PUC-Campinas2017-03-25info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionPeer-reviewed ArticleArtículo revisado por paresAvaliado pelos Paresapplication/pdfhttps://periodicos.puc-campinas.edu.br/transinfo/article/view/5980Transinformação; Vol. 29 No. 1 (2017)Transinformação; Vol. 29 Núm. 1 (2017)Transinformação; v. 29 n. 1 (2017)2318-08890103-3786reponame:Transinformação (Online)instname:Pontifícia Universidade Católica de Campinas (PUC-CAMPINAS)instacron:PUC_CAMPporhttps://periodicos.puc-campinas.edu.br/transinfo/article/view/5980/3709Copyright (c) 2022 Transinformaçãohttps://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessSchiessl, Marcelo Bräscher, Marisa 2024-04-02T11:38:48Zoai:ojs.periodicos.puc-campinas.edu.br:article/5980Revistahttp://periodicos.puc-campinas.edu.br/seer/index.php/transinfo/indexPRIhttps://old.scielo.br/oai/scielo-oai.phpsbi.nucleodeeditoracao@puc-campinas.edu.br||transinfo@puc-campinas.edu.br2318-08890103-3786opendoar:2024-04-02T11:38:48Transinformação (Online) - Pontifícia Universidade Católica de Campinas (PUC-CAMPINAS)false
dc.title.none.fl_str_mv Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1
title Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1
spellingShingle Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1
Schiessl, Marcelo
title_short Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1
title_full Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1
title_fullStr Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1
title_full_unstemmed Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1
title_sort Ontology lexicalization: Relationship between content and meaning in the context of Information Retrieval1
author Schiessl, Marcelo
author_facet Schiessl, Marcelo
Bräscher, Marisa
author_role author
author2 Bräscher, Marisa
author2_role author
dc.contributor.author.fl_str_mv Schiessl, Marcelo
Bräscher, Marisa
description The proposal presented in this study seeks to properly represent natural language to ontologies and vice-versa. Therefore, thesemi-automatic creation of a lexical databasein Brazilian Portuguese containing morphological, syntactic, and semantic informationthat can be read by machines was proposed, allowing the link between structured and unstructured data and its integration intoan information retrieval model to improve precision. The results obtained demonstrated that the methodology can be used in therisco financeiro (financial risk) domain in Portuguese for the construction of an ontology and the lexical-semantic database andthe proposal of a semantic information retrieval model. In order to evaluate the performance of the proposed model, documentscontaining the main definitions of the financial risk domain were selected and indexed with and without semantic annotation. Toenable the comparison between the approaches, two databases were created based on the texts with the semantic annotationsto represent the semantic search. The first one represents the traditional search and the second contained the index built based onthe texts with the semantic annotations to represent the semantic search. The evaluation of the proposal was based on recall andprecision. The queries submitted to the model showed that the semantic search outperforms the traditional search and validatesthe methodology used. Although more complex, the procedure proposed can be used in all kinds of domains.
publishDate 2017
dc.date.none.fl_str_mv 2017-03-25
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Peer-reviewed Article
Artículo revisado por pares
Avaliado pelos Pares
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://periodicos.puc-campinas.edu.br/transinfo/article/view/5980
url https://periodicos.puc-campinas.edu.br/transinfo/article/view/5980
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://periodicos.puc-campinas.edu.br/transinfo/article/view/5980/3709
dc.rights.driver.fl_str_mv Copyright (c) 2022 Transinformação
https://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2022 Transinformação
https://creativecommons.org/licenses/by/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Núcleo de Editoração - PUC-Campinas
publisher.none.fl_str_mv Núcleo de Editoração - PUC-Campinas
dc.source.none.fl_str_mv Transinformação; Vol. 29 No. 1 (2017)
Transinformação; Vol. 29 Núm. 1 (2017)
Transinformação; v. 29 n. 1 (2017)
2318-0889
0103-3786
reponame:Transinformação (Online)
instname:Pontifícia Universidade Católica de Campinas (PUC-CAMPINAS)
instacron:PUC_CAMP
instname_str Pontifícia Universidade Católica de Campinas (PUC-CAMPINAS)
instacron_str PUC_CAMP
institution PUC_CAMP
reponame_str Transinformação (Online)
collection Transinformação (Online)
repository.name.fl_str_mv Transinformação (Online) - Pontifícia Universidade Católica de Campinas (PUC-CAMPINAS)
repository.mail.fl_str_mv sbi.nucleodeeditoracao@puc-campinas.edu.br||transinfo@puc-campinas.edu.br
_version_ 1821141875028918272