Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases
Autor(a) principal: | |
---|---|
Data de Publicação: | 2012 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Universitário da Ânima (RUNA) |
Texto Completo: | https://repositorio.animaeducacao.com.br/handle/ANIMA/2649 |
Resumo: | Background: Web-based, free-text documents on science and technology have been increasing growing on the web. However, most of these documents are not immediately processable by computers slowing down the acquisition of useful information. Computational ontologies might represent a possible solution by enabling semantically machine readable data sets. But, the process of ontology creation, instantiation and maintenance is still based on manual methodologies and thus time and cost intensive. Method: We focused on a large corpus containing information on researchers, research fields, and institutions. We ased our strategy on traditional entity recognition, social computing and correlation. We devised a semi automatic approach for the recognition, correlation and extraction of named entities and relations from textual documents which are then used to create, instantiate, and maintain an ontology. Results: We present a prototype demonstrating the applicability of the proposed strategy, along with a case study describing how direct and indirect relations can be extracted from academic and professional activities registered in a database of curriculum vitae in free-text format. We present evidence that this system can identify entities to assist in the process of knowledge extraction and representation to support ontology maintenance. We also demonstrate the extraction of relationships among ontology classes and their instances. Conclusion: We have demonstrated that our system can be used for the conversion of research information in free text format into database with a semantic structure. Future studies should test this system using the growing number of freetext information available at the institutional and national levels. |
id |
Ânima_f1e021ad5699155e49be11938d6b5b3d |
---|---|
oai_identifier_str |
oai:repositorio.animaeducacao.com.br:ANIMA/2649 |
network_acronym_str |
Ânima |
network_name_str |
Repositório Universitário da Ânima (RUNA) |
repository_id_str |
|
spelling |
Turning text into research networks: information retrieval and computational ontologies in the creation of scientific DatabasesOntology maintenanceNamed entity recognitionKnowledge engineeringCiência da ComputaçãoBackground: Web-based, free-text documents on science and technology have been increasing growing on the web. However, most of these documents are not immediately processable by computers slowing down the acquisition of useful information. Computational ontologies might represent a possible solution by enabling semantically machine readable data sets. But, the process of ontology creation, instantiation and maintenance is still based on manual methodologies and thus time and cost intensive. Method: We focused on a large corpus containing information on researchers, research fields, and institutions. We ased our strategy on traditional entity recognition, social computing and correlation. We devised a semi automatic approach for the recognition, correlation and extraction of named entities and relations from textual documents which are then used to create, instantiate, and maintain an ontology. Results: We present a prototype demonstrating the applicability of the proposed strategy, along with a case study describing how direct and indirect relations can be extracted from academic and professional activities registered in a database of curriculum vitae in free-text format. We present evidence that this system can identify entities to assist in the process of knowledge extraction and representation to support ontology maintenance. We also demonstrate the extraction of relationships among ontology classes and their instances. Conclusion: We have demonstrated that our system can be used for the conversion of research information in free text format into database with a semantic structure. Future studies should test this system using the growing number of freetext information available at the institutional and national levels.2017-08-10T13:09:39Z2020-11-26T17:37:56Z2017-08-10T13:09:39Z2020-11-26T17:37:56Z2012info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlep.1-9application/pdfJaneiro1932-62031https://repositorio.animaeducacao.com.br/handle/ANIMA/26497San FranciscoAttribution-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nd/3.0/br/info:eu-repo/semantics/openAccessCeci, FlavioPietrobon, RicardoGonçalves, Alexandreengreponame:Repositório Universitário da Ânima (RUNA)instname:Ânima Educaçãoinstacron:Ânima2021-08-11T21:15:22Zoai:repositorio.animaeducacao.com.br:ANIMA/2649Repositório InstitucionalPRIhttps://repositorio.animaeducacao.com.br/oai/requestcontato@animaeducacao.com.bropendoar:2021-08-11T21:15:22Repositório Universitário da Ânima (RUNA) - Ânima Educaçãofalse |
dc.title.none.fl_str_mv |
Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases |
title |
Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases |
spellingShingle |
Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases Ceci, Flavio Ontology maintenance Named entity recognition Knowledge engineering Ciência da Computação |
title_short |
Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases |
title_full |
Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases |
title_fullStr |
Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases |
title_full_unstemmed |
Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases |
title_sort |
Turning text into research networks: information retrieval and computational ontologies in the creation of scientific Databases |
author |
Ceci, Flavio |
author_facet |
Ceci, Flavio Pietrobon, Ricardo Gonçalves, Alexandre |
author_role |
author |
author2 |
Pietrobon, Ricardo Gonçalves, Alexandre |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Ceci, Flavio Pietrobon, Ricardo Gonçalves, Alexandre |
dc.subject.por.fl_str_mv |
Ontology maintenance Named entity recognition Knowledge engineering Ciência da Computação |
topic |
Ontology maintenance Named entity recognition Knowledge engineering Ciência da Computação |
description |
Background: Web-based, free-text documents on science and technology have been increasing growing on the web. However, most of these documents are not immediately processable by computers slowing down the acquisition of useful information. Computational ontologies might represent a possible solution by enabling semantically machine readable data sets. But, the process of ontology creation, instantiation and maintenance is still based on manual methodologies and thus time and cost intensive. Method: We focused on a large corpus containing information on researchers, research fields, and institutions. We ased our strategy on traditional entity recognition, social computing and correlation. We devised a semi automatic approach for the recognition, correlation and extraction of named entities and relations from textual documents which are then used to create, instantiate, and maintain an ontology. Results: We present a prototype demonstrating the applicability of the proposed strategy, along with a case study describing how direct and indirect relations can be extracted from academic and professional activities registered in a database of curriculum vitae in free-text format. We present evidence that this system can identify entities to assist in the process of knowledge extraction and representation to support ontology maintenance. We also demonstrate the extraction of relationships among ontology classes and their instances. Conclusion: We have demonstrated that our system can be used for the conversion of research information in free text format into database with a semantic structure. Future studies should test this system using the growing number of freetext information available at the institutional and national levels. |
publishDate |
2012 |
dc.date.none.fl_str_mv |
2012 2017-08-10T13:09:39Z 2017-08-10T13:09:39Z 2020-11-26T17:37:56Z 2020-11-26T17:37:56Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
Janeiro 1932-6203 1 https://repositorio.animaeducacao.com.br/handle/ANIMA/2649 |
identifier_str_mv |
Janeiro 1932-6203 1 |
url |
https://repositorio.animaeducacao.com.br/handle/ANIMA/2649 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
7 |
dc.rights.driver.fl_str_mv |
Attribution-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nd/3.0/br/ info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Attribution-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nd/3.0/br/ |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
p.1-9 application/pdf |
dc.coverage.none.fl_str_mv |
San Francisco |
dc.source.none.fl_str_mv |
reponame:Repositório Universitário da Ânima (RUNA) instname:Ânima Educação instacron:Ânima |
instname_str |
Ânima Educação |
instacron_str |
Ânima |
institution |
Ânima |
reponame_str |
Repositório Universitário da Ânima (RUNA) |
collection |
Repositório Universitário da Ânima (RUNA) |
repository.name.fl_str_mv |
Repositório Universitário da Ânima (RUNA) - Ânima Educação |
repository.mail.fl_str_mv |
contato@animaeducacao.com.br |
_version_ |
1767415818186915840 |