Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases

Detalhes bibliográficos
Autor(a) principal: Ceci, Flavio
Data de Publicação: 2012
Outros Autores: Pietrobon, Ricardo, Gonçalves, Alexandre
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Universitário da Ânima (RUNA)
Texto Completo: https://repositorio.animaeducacao.com.br/handle/ANIMA/2649
Resumo: Background: Web-based, free-text documents on science and technology have been increasing growing on the web. However, most of these documents are not immediately processable by computers slowing down the acquisition of useful information. Computational ontologies might represent a possible solution by enabling semantically machine readable data sets. But, the process of ontology creation, instantiation and maintenance is still based on manual methodologies and thus time and cost intensive. Method: We focused on a large corpus containing information on researchers, research fields, and institutions. We ased our strategy on traditional entity recognition, social computing and correlation. We devised a semi automatic approach for the recognition, correlation and extraction of named entities and relations from textual documents which are then used to create, instantiate, and maintain an ontology. Results: We present a prototype demonstrating the applicability of the proposed strategy, along with a case study describing how direct and indirect relations can be extracted from academic and professional activities registered in a database of curriculum vitae in free-text format. We present evidence that this system can identify entities to assist in the process of knowledge extraction and representation to support ontology maintenance. We also demonstrate the extraction of relationships among ontology classes and their instances. Conclusion: We have demonstrated that our system can be used for the conversion of research information in free text format into database with a semantic structure. Future studies should test this system using the growing number of freetext information available at the institutional and national levels.
id Ânima_f1e021ad5699155e49be11938d6b5b3d
oai_identifier_str oai:repositorio.animaeducacao.com.br:ANIMA/2649
network_acronym_str Ânima
network_name_str Repositório Universitário da Ânima (RUNA)
repository_id_str
spelling Turning text into research networks: information retrieval and computational ontologies in the creation of scientific DatabasesOntology maintenanceNamed entity recognitionKnowledge engineeringCiência da ComputaçãoBackground: Web-based, free-text documents on science and technology have been increasing growing on the web. However, most of these documents are not immediately processable by computers slowing down the acquisition of useful information. Computational ontologies might represent a possible solution by enabling semantically machine readable data sets. But, the process of ontology creation, instantiation and maintenance is still based on manual methodologies and thus time and cost intensive. Method: We focused on a large corpus containing information on researchers, research fields, and institutions. We ased our strategy on traditional entity recognition, social computing and correlation. We devised a semi automatic approach for the recognition, correlation and extraction of named entities and relations from textual documents which are then used to create, instantiate, and maintain an ontology. Results: We present a prototype demonstrating the applicability of the proposed strategy, along with a case study describing how direct and indirect relations can be extracted from academic and professional activities registered in a database of curriculum vitae in free-text format. We present evidence that this system can identify entities to assist in the process of knowledge extraction and representation to support ontology maintenance. We also demonstrate the extraction of relationships among ontology classes and their instances. Conclusion: We have demonstrated that our system can be used for the conversion of research information in free text format into database with a semantic structure. Future studies should test this system using the growing number of freetext information available at the institutional and national levels.2017-08-10T13:09:39Z2020-11-26T17:37:56Z2017-08-10T13:09:39Z2020-11-26T17:37:56Z2012info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlep.1-9application/pdfJaneiro1932-62031https://repositorio.animaeducacao.com.br/handle/ANIMA/26497San FranciscoAttribution-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nd/3.0/br/info:eu-repo/semantics/openAccessCeci, FlavioPietrobon, RicardoGonçalves, Alexandreengreponame:Repositório Universitário da Ânima (RUNA)instname:Ânima Educaçãoinstacron:Ânima2021-08-11T21:15:22Zoai:repositorio.animaeducacao.com.br:ANIMA/2649Repositório InstitucionalPRIhttps://repositorio.animaeducacao.com.br/oai/requestcontato@animaeducacao.com.bropendoar:2021-08-11T21:15:22Repositório Universitário da Ânima (RUNA) - Ânima Educaçãofalse
dc.title.none.fl_str_mv Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases
title Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases
spellingShingle Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases
Ceci, Flavio
Ontology maintenance
Named entity recognition
Knowledge engineering
Ciência da Computação
title_short Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases
title_full Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases
title_fullStr Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases
title_full_unstemmed Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases
title_sort Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases
author Ceci, Flavio
author_facet Ceci, Flavio
Pietrobon, Ricardo
Gonçalves, Alexandre
author_role author
author2 Pietrobon, Ricardo
Gonçalves, Alexandre
author2_role author
author
dc.contributor.author.fl_str_mv Ceci, Flavio
Pietrobon, Ricardo
Gonçalves, Alexandre
dc.subject.por.fl_str_mv Ontology maintenance
Named entity recognition
Knowledge engineering
Ciência da Computação
topic Ontology maintenance
Named entity recognition
Knowledge engineering
Ciência da Computação
description Background: Web-based, free-text documents on science and technology have been increasing growing on the web. However, most of these documents are not immediately processable by computers slowing down the acquisition of useful information. Computational ontologies might represent a possible solution by enabling semantically machine readable data sets. But, the process of ontology creation, instantiation and maintenance is still based on manual methodologies and thus time and cost intensive. Method: We focused on a large corpus containing information on researchers, research fields, and institutions. We ased our strategy on traditional entity recognition, social computing and correlation. We devised a semi automatic approach for the recognition, correlation and extraction of named entities and relations from textual documents which are then used to create, instantiate, and maintain an ontology. Results: We present a prototype demonstrating the applicability of the proposed strategy, along with a case study describing how direct and indirect relations can be extracted from academic and professional activities registered in a database of curriculum vitae in free-text format. We present evidence that this system can identify entities to assist in the process of knowledge extraction and representation to support ontology maintenance. We also demonstrate the extraction of relationships among ontology classes and their instances. Conclusion: We have demonstrated that our system can be used for the conversion of research information in free text format into database with a semantic structure. Future studies should test this system using the growing number of freetext information available at the institutional and national levels.
publishDate 2012
dc.date.none.fl_str_mv 2012
2017-08-10T13:09:39Z
2017-08-10T13:09:39Z
2020-11-26T17:37:56Z
2020-11-26T17:37:56Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Janeiro
1932-6203
1
https://repositorio.animaeducacao.com.br/handle/ANIMA/2649
identifier_str_mv Janeiro
1932-6203
1
url https://repositorio.animaeducacao.com.br/handle/ANIMA/2649
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 7
dc.rights.driver.fl_str_mv Attribution-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nd/3.0/br/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nd/3.0/br/
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv p.1-9
application/pdf
dc.coverage.none.fl_str_mv San Francisco
dc.source.none.fl_str_mv reponame:Repositório Universitário da Ânima (RUNA)
instname:Ânima Educação
instacron:Ânima
instname_str Ânima Educação
instacron_str Ânima
institution Ânima
reponame_str Repositório Universitário da Ânima (RUNA)
collection Repositório Universitário da Ânima (RUNA)
repository.name.fl_str_mv Repositório Universitário da Ânima (RUNA) - Ânima Educação
repository.mail.fl_str_mv contato@animaeducacao.com.br
_version_ 1767415818186915840