Annotating diverse scientific data with hasco

Detalhes bibliográficos
Autor(a) principal: Paulo Pinheiro
Data de Publicação: 2018
Outros Autores: Marcello Peixoto Bax, Henrique Santos, Sabbir Rashid, Zhicheng Liang, Yue Liu, James Mccusker, Deborah Mcguinness, Yarden Ne’eman
Tipo de documento: Artigo de conferência
Idioma: eng
Título da fonte: Repositório Institucional da UFMG
Texto Completo: http://hdl.handle.net/1843/51523
https://orcid.org/0000-0001-8469-4043
https://orcid.org/0000-0003-0503-3031
https://orcid.org/0000-0002-4162-8334
https://orcid.org/0000-0001-9646-1183
https://orcid.org/0000-0003-1085-6059
https://orcid.org/0000-0001-7037-4567
https://orcid.org/0000-0002-3017-2722
Resumo: Ontologies are being widely used across many scientific fields, most notably in roles related to acquiring, preparing, integrating and managing data resources. Data acquisition and preparation activities are often difficult to reuse since they tend to be domain dependent, as well as dependent on how data is acquired: through measurement, subject-elicitation, and/or model-generation activities. Therefore, tools developed for preparing data from one scientific activity often cannot be easily adapted to prepare data from other scientific activities. We introduce the Human-Aware Science Ontology (HAScO) that integrates a collection of well-established science-related ontologies, and aims to address issues related to data annotation for large data ecosystem, where data can come from diverse data sources including sensors, lab results, and questionnaires. The work reported in the paper is based on our experience developing HAScO, using it to annotate data collections to facilitate data exploration and analysis for numerous scientific projects, three of which will be described. Data files produced by scientific studies are processed to identify and annotate the objects (a gene, for instance) with the appropriate ontological terms. One benefit we realized (of preserving scientific data provenance) is that software platforms can support scientists in their exploration and preparation of data for analysis since the meaning of and interrelationships between the data is explicit.
id UFMG_35fc0e3eda07cf413190251dd3dfb0d9
oai_identifier_str oai:repositorio.ufmg.br:1843/51523
network_acronym_str UFMG
network_name_str Repositório Institucional da UFMG
repository_id_str
spelling 2023-04-03T21:00:28Z2023-04-03T21:00:28Z20181180911613-0073http://hdl.handle.net/1843/51523https://orcid.org/0000-0001-8469-4043https://orcid.org/0000-0003-0503-3031https://orcid.org/0000-0002-4162-8334https://orcid.org/0000-0001-9646-1183https://orcid.org/0000-0003-1085-6059https://orcid.org/0000-0001-7037-4567https://orcid.org/0000-0002-3017-2722Ontologies are being widely used across many scientific fields, most notably in roles related to acquiring, preparing, integrating and managing data resources. Data acquisition and preparation activities are often difficult to reuse since they tend to be domain dependent, as well as dependent on how data is acquired: through measurement, subject-elicitation, and/or model-generation activities. Therefore, tools developed for preparing data from one scientific activity often cannot be easily adapted to prepare data from other scientific activities. We introduce the Human-Aware Science Ontology (HAScO) that integrates a collection of well-established science-related ontologies, and aims to address issues related to data annotation for large data ecosystem, where data can come from diverse data sources including sensors, lab results, and questionnaires. The work reported in the paper is based on our experience developing HAScO, using it to annotate data collections to facilitate data exploration and analysis for numerous scientific projects, three of which will be described. Data files produced by scientific studies are processed to identify and annotate the objects (a gene, for instance) with the appropriate ontological terms. One benefit we realized (of preserving scientific data provenance) is that software platforms can support scientists in their exploration and preparation of data for analysis since the meaning of and interrelationships between the data is explicit.engUniversidade Federal de Minas GeraisUFMGBrasilECI - DEPARTAMENTO DE TEORIA E GESTÃO INFORMAÇÃOSeminar on Ontology Research in BrazilOntologias (Recuperação da informação)Gestão da InformaçãoHAScoOntologiasAnnotating diverse scientific data with hascoinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObjecthttps://ceur-ws.org/Vol-2228/Paulo PinheiroMarcello Peixoto BaxHenrique SantosSabbir RashidZhicheng LiangYue LiuJames MccuskerDeborah McguinnessYarden Ne’emanapplication/pdfinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGLICENSELicense.txtLicense.txttext/plain; charset=utf-82042https://repositorio.ufmg.br/bitstream/1843/51523/1/License.txtfa505098d172de0bc8864fc1287ffe22MD51ORIGINALAnnotating diverse scientific data with hascoAnnotating diverse scientific data with hascoapplication/pdf756269https://repositorio.ufmg.br/bitstream/1843/51523/2/Annotating%20diverse%20scientific%20data%20with%20hasco2dd775d4843cfed0ad6639229782ee80MD521843/515232023-04-03 18:02:37.003oai:repositorio.ufmg.br:1843/51523TElDRU7vv71BIERFIERJU1RSSUJVSe+/ve+/vU8gTu+/vU8tRVhDTFVTSVZBIERPIFJFUE9TSVTvv71SSU8gSU5TVElUVUNJT05BTCBEQSBVRk1HCiAKCkNvbSBhIGFwcmVzZW50Ye+/ve+/vW8gZGVzdGEgbGljZW7vv71hLCB2b2Pvv70gKG8gYXV0b3IgKGVzKSBvdSBvIHRpdHVsYXIgZG9zIGRpcmVpdG9zIGRlIGF1dG9yKSBjb25jZWRlIGFvIFJlcG9zaXTvv71yaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIChSSS1VRk1HKSBvIGRpcmVpdG8gbu+/vW8gZXhjbHVzaXZvIGUgaXJyZXZvZ++/vXZlbCBkZSByZXByb2R1emlyIGUvb3UgZGlzdHJpYnVpciBhIHN1YSBwdWJsaWNh77+977+9byAoaW5jbHVpbmRvIG8gcmVzdW1vKSBwb3IgdG9kbyBvIG11bmRvIG5vIGZvcm1hdG8gaW1wcmVzc28gZSBlbGV0cu+/vW5pY28gZSBlbSBxdWFscXVlciBtZWlvLCBpbmNsdWluZG8gb3MgZm9ybWF0b3Mg77+9dWRpbyBvdSB277+9ZGVvLgoKVm9j77+9IGRlY2xhcmEgcXVlIGNvbmhlY2UgYSBwb2zvv710aWNhIGRlIGNvcHlyaWdodCBkYSBlZGl0b3JhIGRvIHNldSBkb2N1bWVudG8gZSBxdWUgY29uaGVjZSBlIGFjZWl0YSBhcyBEaXJldHJpemVzIGRvIFJJLVVGTUcuCgpWb2Pvv70gY29uY29yZGEgcXVlIG8gUmVwb3NpdO+/vXJpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGTUcgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250Ze+/vWRvLCB0cmFuc3BvciBhIHN1YSBwdWJsaWNh77+977+9byBwYXJhIHF1YWxxdWVyIG1laW8gb3UgZm9ybWF0byBwYXJhIGZpbnMgZGUgcHJlc2VydmHvv73vv71vLgoKVm9j77+9IHRhbWLvv71tIGNvbmNvcmRhIHF1ZSBvIFJlcG9zaXTvv71yaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGPvv71waWEgZGUgc3VhIHB1YmxpY2Hvv73vv71vIHBhcmEgZmlucyBkZSBzZWd1cmFu77+9YSwgYmFjay11cCBlIHByZXNlcnZh77+977+9by4KClZvY++/vSBkZWNsYXJhIHF1ZSBhIHN1YSBwdWJsaWNh77+977+9byDvv70gb3JpZ2luYWwgZSBxdWUgdm9j77+9IHRlbSBvIHBvZGVyIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vu77+9YS4gVm9j77+9IHRhbWLvv71tIGRlY2xhcmEgcXVlIG8gZGVw77+9c2l0byBkZSBzdWEgcHVibGljYe+/ve+/vW8gbu+/vW8sIHF1ZSBzZWphIGRlIHNldSBjb25oZWNpbWVudG8sIGluZnJpbmdlIGRpcmVpdG9zIGF1dG9yYWlzIGRlIG5pbmd177+9bS4KCkNhc28gYSBzdWEgcHVibGljYe+/ve+/vW8gY29udGVuaGEgbWF0ZXJpYWwgcXVlIHZvY++/vSBu77+9byBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2Pvv70gZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc++/vW8gaXJyZXN0cml0YSBkbyBkZXRlbnRvciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgcGFyYSBjb25jZWRlciBhbyBSZXBvc2l077+9cmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7vv71hLCBlIHF1ZSBlc3NlIG1hdGVyaWFsIGRlIHByb3ByaWVkYWRlIGRlIHRlcmNlaXJvcyBlc3Tvv70gY2xhcmFtZW50ZSBpZGVudGlmaWNhZG8gZSByZWNvbmhlY2lkbyBubyB0ZXh0byBvdSBubyBjb250Ze+/vWRvIGRhIHB1YmxpY2Hvv73vv71vIG9yYSBkZXBvc2l0YWRhLgoKQ0FTTyBBIFBVQkxJQ0Hvv73vv71PIE9SQSBERVBPU0lUQURBIFRFTkhBIFNJRE8gUkVTVUxUQURPIERFIFVNIFBBVFJPQ++/vU5JTyBPVSBBUE9JTyBERSBVTUEgQUfvv71OQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PLCBWT0Pvv70gREVDTEFSQSBRVUUgUkVTUEVJVE9VIFRPRE9TIEUgUVVBSVNRVUVSIERJUkVJVE9TIERFIFJFVklT77+9TyBDT01PIFRBTULvv71NIEFTIERFTUFJUyBPQlJJR0Hvv73vv71FUyBFWElHSURBUyBQT1IgQ09OVFJBVE8gT1UgQUNPUkRPLgoKTyBSZXBvc2l077+9cmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lKHMpIG91IG8ocykgbm9tZXMocykgZG8ocykgZGV0ZW50b3IoZXMpIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkYSBwdWJsaWNh77+977+9bywgZSBu77+9byBmYXLvv70gcXVhbHF1ZXIgYWx0ZXJh77+977+9bywgYWzvv71tIGRhcXVlbGFzIGNvbmNlZGlkYXMgcG9yIGVzdGEgbGljZW7vv71hLgo=Repositório de PublicaçõesPUBhttps://repositorio.ufmg.br/oaiopendoar:2023-04-03T21:02:37Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false
dc.title.pt_BR.fl_str_mv Annotating diverse scientific data with hasco
title Annotating diverse scientific data with hasco
spellingShingle Annotating diverse scientific data with hasco
Paulo Pinheiro
HASco
Ontologias
Ontologias (Recuperação da informação)
Gestão da Informação
title_short Annotating diverse scientific data with hasco
title_full Annotating diverse scientific data with hasco
title_fullStr Annotating diverse scientific data with hasco
title_full_unstemmed Annotating diverse scientific data with hasco
title_sort Annotating diverse scientific data with hasco
author Paulo Pinheiro
author_facet Paulo Pinheiro
Marcello Peixoto Bax
Henrique Santos
Sabbir Rashid
Zhicheng Liang
Yue Liu
James Mccusker
Deborah Mcguinness
Yarden Ne’eman
author_role author
author2 Marcello Peixoto Bax
Henrique Santos
Sabbir Rashid
Zhicheng Liang
Yue Liu
James Mccusker
Deborah Mcguinness
Yarden Ne’eman
author2_role author
author
author
author
author
author
author
author
dc.contributor.author.fl_str_mv Paulo Pinheiro
Marcello Peixoto Bax
Henrique Santos
Sabbir Rashid
Zhicheng Liang
Yue Liu
James Mccusker
Deborah Mcguinness
Yarden Ne’eman
dc.subject.por.fl_str_mv HASco
Ontologias
topic HASco
Ontologias
Ontologias (Recuperação da informação)
Gestão da Informação
dc.subject.other.pt_BR.fl_str_mv Ontologias (Recuperação da informação)
Gestão da Informação
description Ontologies are being widely used across many scientific fields, most notably in roles related to acquiring, preparing, integrating and managing data resources. Data acquisition and preparation activities are often difficult to reuse since they tend to be domain dependent, as well as dependent on how data is acquired: through measurement, subject-elicitation, and/or model-generation activities. Therefore, tools developed for preparing data from one scientific activity often cannot be easily adapted to prepare data from other scientific activities. We introduce the Human-Aware Science Ontology (HAScO) that integrates a collection of well-established science-related ontologies, and aims to address issues related to data annotation for large data ecosystem, where data can come from diverse data sources including sensors, lab results, and questionnaires. The work reported in the paper is based on our experience developing HAScO, using it to annotate data collections to facilitate data exploration and analysis for numerous scientific projects, three of which will be described. Data files produced by scientific studies are processed to identify and annotate the objects (a gene, for instance) with the appropriate ontological terms. One benefit we realized (of preserving scientific data provenance) is that software platforms can support scientists in their exploration and preparation of data for analysis since the meaning of and interrelationships between the data is explicit.
publishDate 2018
dc.date.issued.fl_str_mv 2018
dc.date.accessioned.fl_str_mv 2023-04-03T21:00:28Z
dc.date.available.fl_str_mv 2023-04-03T21:00:28Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/1843/51523
dc.identifier.issn.pt_BR.fl_str_mv 1613-0073
dc.identifier.orcid.pt_BR.fl_str_mv https://orcid.org/0000-0001-8469-4043
https://orcid.org/0000-0003-0503-3031
https://orcid.org/0000-0002-4162-8334
https://orcid.org/0000-0001-9646-1183
https://orcid.org/0000-0003-1085-6059
https://orcid.org/0000-0001-7037-4567
https://orcid.org/0000-0002-3017-2722
identifier_str_mv 1613-0073
url http://hdl.handle.net/1843/51523
https://orcid.org/0000-0001-8469-4043
https://orcid.org/0000-0003-0503-3031
https://orcid.org/0000-0002-4162-8334
https://orcid.org/0000-0001-9646-1183
https://orcid.org/0000-0003-1085-6059
https://orcid.org/0000-0001-7037-4567
https://orcid.org/0000-0002-3017-2722
dc.language.iso.fl_str_mv eng
language eng
dc.relation.ispartof.pt_BR.fl_str_mv Seminar on Ontology Research in Brazil
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.publisher.initials.fl_str_mv UFMG
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv ECI - DEPARTAMENTO DE TEORIA E GESTÃO INFORMAÇÃO
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFMG
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Repositório Institucional da UFMG
collection Repositório Institucional da UFMG
bitstream.url.fl_str_mv https://repositorio.ufmg.br/bitstream/1843/51523/1/License.txt
https://repositorio.ufmg.br/bitstream/1843/51523/2/Annotating%20diverse%20scientific%20data%20with%20hasco
bitstream.checksum.fl_str_mv fa505098d172de0bc8864fc1287ffe22
2dd775d4843cfed0ad6639229782ee80
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv
_version_ 1803589521479041024