Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Outros Autores: | , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10174/35792 https://doi.org/da Silva DP, da Rosa Fröhlich W, de Mello BH, Vieira R, Rigo SandroJosé, Exploring named entity recognition and relation extraction for ontology and medical records integration, Informatics in Medicine Unlocked (2023), doi: https://doi.org/10.1016/j.imu.2023.101381 https://doi.org/10.1016/j.imu.2023.101381 |
Resumo: | The available natural language data in electronic health records is of noteworthy interest to health research and development. Nevertheless, their manual analysis is not feasible and poses a challenge to accessing valuable information in these records. This paper presents an approach to automatically extract information from these unstructured medical records using Domain Entity Recognition and Relation Extraction, structuring the results through a domain ontology. We developed our work in the oncology domain, an attention-demanding field. The main contribution of this work lies in integrating multiple resources in a complete methodology to accomplish this task. We developed a new entity and relation annotated dataset of medical evolutions in Brazilian Portuguese, containing 1622 documents, 146,769 entities, and 111,716 relations. We attained 78.24 % accuracy for entity and relation extraction in the exams domain. Healthcare specialists evaluated the approach regarding entity recognition and relation extraction positively and considered the methodology valuable to health professionals. |
id |
RCAP_06d46be3aec094d71d5b1eda635edcba |
---|---|
oai_identifier_str |
oai:dspace.uevora.pt:10174/35792 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integrationThe available natural language data in electronic health records is of noteworthy interest to health research and development. Nevertheless, their manual analysis is not feasible and poses a challenge to accessing valuable information in these records. This paper presents an approach to automatically extract information from these unstructured medical records using Domain Entity Recognition and Relation Extraction, structuring the results through a domain ontology. We developed our work in the oncology domain, an attention-demanding field. The main contribution of this work lies in integrating multiple resources in a complete methodology to accomplish this task. We developed a new entity and relation annotated dataset of medical evolutions in Brazilian Portuguese, containing 1622 documents, 146,769 entities, and 111,716 relations. We attained 78.24 % accuracy for entity and relation extraction in the exams domain. Healthcare specialists evaluated the approach regarding entity recognition and relation extraction positively and considered the methodology valuable to health professionals.Elsevier2023-12-12T14:32:47Z2023-12-122023-10-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/35792https://doi.org/da Silva DP, da Rosa Fröhlich W, de Mello BH, Vieira R, Rigo SandroJosé, Exploring named entity recognition and relation extraction for ontology and medical records integration, Informatics in Medicine Unlocked (2023), doi: https://doi.org/10.1016/j.imu.2023.101381https://doi.org/10.1016/j.imu.2023.101381http://hdl.handle.net/10174/35792https://doi.org/10.1016/j.imu.2023.101381enghttps://www.sciencedirect.com/science/article/pii/S2352914823002277?via%3Dihubndndndrenatav@uevora.ptnd299Silva, DiegoFrohlich, WilliamMello, BlandaVieira, RenataRigo, Sandroinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:39:28Zoai:dspace.uevora.pt:10174/35792Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:24:00.691249Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration |
title |
Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration |
spellingShingle |
Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration Silva, Diego |
title_short |
Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration |
title_full |
Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration |
title_fullStr |
Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration |
title_full_unstemmed |
Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration |
title_sort |
Exploring Named Entity Recognition and Relation Extraction for ontology and medical records integration |
author |
Silva, Diego |
author_facet |
Silva, Diego Frohlich, William Mello, Blanda Vieira, Renata Rigo, Sandro |
author_role |
author |
author2 |
Frohlich, William Mello, Blanda Vieira, Renata Rigo, Sandro |
author2_role |
author author author author |
dc.contributor.author.fl_str_mv |
Silva, Diego Frohlich, William Mello, Blanda Vieira, Renata Rigo, Sandro |
description |
The available natural language data in electronic health records is of noteworthy interest to health research and development. Nevertheless, their manual analysis is not feasible and poses a challenge to accessing valuable information in these records. This paper presents an approach to automatically extract information from these unstructured medical records using Domain Entity Recognition and Relation Extraction, structuring the results through a domain ontology. We developed our work in the oncology domain, an attention-demanding field. The main contribution of this work lies in integrating multiple resources in a complete methodology to accomplish this task. We developed a new entity and relation annotated dataset of medical evolutions in Brazilian Portuguese, containing 1622 documents, 146,769 entities, and 111,716 relations. We attained 78.24 % accuracy for entity and relation extraction in the exams domain. Healthcare specialists evaluated the approach regarding entity recognition and relation extraction positively and considered the methodology valuable to health professionals. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-12-12T14:32:47Z 2023-12-12 2023-10-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10174/35792 https://doi.org/da Silva DP, da Rosa Fröhlich W, de Mello BH, Vieira R, Rigo SandroJosé, Exploring named entity recognition and relation extraction for ontology and medical records integration, Informatics in Medicine Unlocked (2023), doi: https://doi.org/10.1016/j.imu.2023.101381 https://doi.org/10.1016/j.imu.2023.101381 http://hdl.handle.net/10174/35792 https://doi.org/10.1016/j.imu.2023.101381 |
url |
http://hdl.handle.net/10174/35792 https://doi.org/da Silva DP, da Rosa Fröhlich W, de Mello BH, Vieira R, Rigo SandroJosé, Exploring named entity recognition and relation extraction for ontology and medical records integration, Informatics in Medicine Unlocked (2023), doi: https://doi.org/10.1016/j.imu.2023.101381 https://doi.org/10.1016/j.imu.2023.101381 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
https://www.sciencedirect.com/science/article/pii/S2352914823002277?via%3Dihub nd nd nd renatav@uevora.pt nd 299 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
Elsevier |
publisher.none.fl_str_mv |
Elsevier |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136722366234624 |