Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration

Detalhes bibliográficos
Autor(a) principal: Silva, Diego
Data de Publicação: 2023
Outros Autores: Frohlich, William, Mello, Blanda, Vieira, Renata, Rigo, Sandro
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10174/35792
https://doi.org/da Silva DP, da Rosa Fröhlich W, de Mello BH, Vieira R, Rigo SandroJosé, Exploring named entity recognition and relation extraction for ontology and medical records integration, Informatics in Medicine Unlocked (2023), doi: https://doi.org/10.1016/j.imu.2023.101381
https://doi.org/10.1016/j.imu.2023.101381
Resumo: The available natural language data in electronic health records is of noteworthy interest to health research and development. Nevertheless, their manual analysis is not feasible and poses a challenge to accessing valuable information in these records. This paper presents an approach to automatically extract information from these unstructured medical records using Domain Entity Recognition and Relation Extraction, structuring the results through a domain ontology. We developed our work in the oncology domain, an attention-demanding field. The main contribution of this work lies in integrating multiple resources in a complete methodology to accomplish this task. We developed a new entity and relation annotated dataset of medical evolutions in Brazilian Portuguese, containing 1622 documents, 146,769 entities, and 111,716 relations. We attained 78.24 % accuracy for entity and relation extraction in the exams domain. Healthcare specialists evaluated the approach regarding entity recognition and relation extraction positively and considered the methodology valuable to health professionals.
id RCAP_06d46be3aec094d71d5b1eda635edcba
oai_identifier_str oai:dspace.uevora.pt:10174/35792
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integrationThe available natural language data in electronic health records is of noteworthy interest to health research and development. Nevertheless, their manual analysis is not feasible and poses a challenge to accessing valuable information in these records. This paper presents an approach to automatically extract information from these unstructured medical records using Domain Entity Recognition and Relation Extraction, structuring the results through a domain ontology. We developed our work in the oncology domain, an attention-demanding field. The main contribution of this work lies in integrating multiple resources in a complete methodology to accomplish this task. We developed a new entity and relation annotated dataset of medical evolutions in Brazilian Portuguese, containing 1622 documents, 146,769 entities, and 111,716 relations. We attained 78.24 % accuracy for entity and relation extraction in the exams domain. Healthcare specialists evaluated the approach regarding entity recognition and relation extraction positively and considered the methodology valuable to health professionals.Elsevier2023-12-12T14:32:47Z2023-12-122023-10-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/35792https://doi.org/da Silva DP, da Rosa Fröhlich W, de Mello BH, Vieira R, Rigo SandroJosé, Exploring named entity recognition and relation extraction for ontology and medical records integration, Informatics in Medicine Unlocked (2023), doi: https://doi.org/10.1016/j.imu.2023.101381https://doi.org/10.1016/j.imu.2023.101381http://hdl.handle.net/10174/35792https://doi.org/10.1016/j.imu.2023.101381enghttps://www.sciencedirect.com/science/article/pii/S2352914823002277?via%3Dihubndndndrenatav@uevora.ptnd299Silva, DiegoFrohlich, WilliamMello, BlandaVieira, RenataRigo, Sandroinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:39:28Zoai:dspace.uevora.pt:10174/35792Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:24:00.691249Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration
title Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration
spellingShingle Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration
Silva, Diego
title_short Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration
title_full Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration
title_fullStr Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration
title_full_unstemmed Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration
title_sort Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration
author Silva, Diego
author_facet Silva, Diego
Frohlich, William
Mello, Blanda
Vieira, Renata
Rigo, Sandro
author_role author
author2 Frohlich, William
Mello, Blanda
Vieira, Renata
Rigo, Sandro
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Silva, Diego
Frohlich, William
Mello, Blanda
Vieira, Renata
Rigo, Sandro
description The available natural language data in electronic health records is of noteworthy interest to health research and development. Nevertheless, their manual analysis is not feasible and poses a challenge to accessing valuable information in these records. This paper presents an approach to automatically extract information from these unstructured medical records using Domain Entity Recognition and Relation Extraction, structuring the results through a domain ontology. We developed our work in the oncology domain, an attention-demanding field. The main contribution of this work lies in integrating multiple resources in a complete methodology to accomplish this task. We developed a new entity and relation annotated dataset of medical evolutions in Brazilian Portuguese, containing 1622 documents, 146,769 entities, and 111,716 relations. We attained 78.24 % accuracy for entity and relation extraction in the exams domain. Healthcare specialists evaluated the approach regarding entity recognition and relation extraction positively and considered the methodology valuable to health professionals.
publishDate 2023
dc.date.none.fl_str_mv 2023-12-12T14:32:47Z
2023-12-12
2023-10-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10174/35792
https://doi.org/da Silva DP, da Rosa Fröhlich W, de Mello BH, Vieira R, Rigo SandroJosé, Exploring named entity recognition and relation extraction for ontology and medical records integration, Informatics in Medicine Unlocked (2023), doi: https://doi.org/10.1016/j.imu.2023.101381
https://doi.org/10.1016/j.imu.2023.101381
http://hdl.handle.net/10174/35792
https://doi.org/10.1016/j.imu.2023.101381
url http://hdl.handle.net/10174/35792
https://doi.org/da Silva DP, da Rosa Fröhlich W, de Mello BH, Vieira R, Rigo SandroJosé, Exploring named entity recognition and relation extraction for ontology and medical records integration, Informatics in Medicine Unlocked (2023), doi: https://doi.org/10.1016/j.imu.2023.101381
https://doi.org/10.1016/j.imu.2023.101381
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://www.sciencedirect.com/science/article/pii/S2352914823002277?via%3Dihub
nd
nd
nd
renatav@uevora.pt
nd
299
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Elsevier
publisher.none.fl_str_mv Elsevier
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136722366234624