Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records
Autor(a) principal: | |
---|---|
Data de Publicação: | 2023 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10174/35773 https://doi.org/BANDEIRA, Lucas T.; CONSOLI, Bernardo S.; VIEIRA, Renata; BORDIN, Rafael H.. Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records. In: SIMPÓSIO BRASILEIRO DE TECNOLOGIA DA INFORMAÇÃO E DA LINGUAGEM HUMANA (STIL), 14. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 224-228. DOI: https://doi.org/10.5753/stil.2023.234200. https://doi.org/10.5753/stil.2023.234200 |
Resumo: | With the growing importance of the use of information from electronic patient records in the development of machine learning models, there is also a need for a holistic understanding of those records, in particular abridging the clinical notes so that important information is used in the training process without the repetition that is commonly found in such notes. This paper presents the pre-processing of clinical notes from the BRATECA Dataset, a Brazilian tertiary care data collection, aiming at removing repeated information resulting from the interaction between healthcare providers and patients, considering assigned values of semantic similarity between sentences in clinical notes. |
id |
RCAP_86e725f0589b66ab54a5a36463b18b36 |
---|---|
oai_identifier_str |
oai:dspace.uevora.pt:10174/35773 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health RecordsSemantic similarityEletronic Health RecordsWith the growing importance of the use of information from electronic patient records in the development of machine learning models, there is also a need for a holistic understanding of those records, in particular abridging the clinical notes so that important information is used in the training process without the repetition that is commonly found in such notes. This paper presents the pre-processing of clinical notes from the BRATECA Dataset, a Brazilian tertiary care data collection, aiming at removing repeated information resulting from the interaction between healthcare providers and patients, considering assigned values of semantic similarity between sentences in clinical notes.CEECIND/01997/2017SBC Open Library2023-12-12T14:04:45Z2023-12-122023-09-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/35773https://doi.org/BANDEIRA, Lucas T.; CONSOLI, Bernardo S.; VIEIRA, Renata; BORDIN, Rafael H.. Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records. In: SIMPÓSIO BRASILEIRO DE TECNOLOGIA DA INFORMAÇÃO E DA LINGUAGEM HUMANA (STIL), 14. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 224-228. DOI: https://doi.org/10.5753/stil.2023.234200.http://hdl.handle.net/10174/35773https://doi.org/10.5753/stil.2023.234200enghttps://sol.sbc.org.br/index.php/stil/article/view/25454ndndrenatav@uevora.ptnd299Bandeira, LucasConsoli, BernardoVieira, RenataBordini, Rafaelinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:39:25Zoai:dspace.uevora.pt:10174/35773Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:23:59.019416Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records |
title |
Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records |
spellingShingle |
Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records Bandeira, Lucas Semantic similarity Eletronic Health Records |
title_short |
Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records |
title_full |
Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records |
title_fullStr |
Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records |
title_full_unstemmed |
Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records |
title_sort |
Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records |
author |
Bandeira, Lucas |
author_facet |
Bandeira, Lucas Consoli, Bernardo Vieira, Renata Bordini, Rafael |
author_role |
author |
author2 |
Consoli, Bernardo Vieira, Renata Bordini, Rafael |
author2_role |
author author author |
dc.contributor.author.fl_str_mv |
Bandeira, Lucas Consoli, Bernardo Vieira, Renata Bordini, Rafael |
dc.subject.por.fl_str_mv |
Semantic similarity Eletronic Health Records |
topic |
Semantic similarity Eletronic Health Records |
description |
With the growing importance of the use of information from electronic patient records in the development of machine learning models, there is also a need for a holistic understanding of those records, in particular abridging the clinical notes so that important information is used in the training process without the repetition that is commonly found in such notes. This paper presents the pre-processing of clinical notes from the BRATECA Dataset, a Brazilian tertiary care data collection, aiming at removing repeated information resulting from the interaction between healthcare providers and patients, considering assigned values of semantic similarity between sentences in clinical notes. |
publishDate |
2023 |
dc.date.none.fl_str_mv |
2023-12-12T14:04:45Z 2023-12-12 2023-09-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10174/35773 https://doi.org/BANDEIRA, Lucas T.; CONSOLI, Bernardo S.; VIEIRA, Renata; BORDIN, Rafael H.. Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records. In: SIMPÓSIO BRASILEIRO DE TECNOLOGIA DA INFORMAÇÃO E DA LINGUAGEM HUMANA (STIL), 14. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 224-228. DOI: https://doi.org/10.5753/stil.2023.234200. http://hdl.handle.net/10174/35773 https://doi.org/10.5753/stil.2023.234200 |
url |
http://hdl.handle.net/10174/35773 https://doi.org/BANDEIRA, Lucas T.; CONSOLI, Bernardo S.; VIEIRA, Renata; BORDIN, Rafael H.. Semantic Textual Similarity for Abridging Clinical Notes in Brazilian Electronic Health Records. In: SIMPÓSIO BRASILEIRO DE TECNOLOGIA DA INFORMAÇÃO E DA LINGUAGEM HUMANA (STIL), 14. , 2023, Belo Horizonte/MG. Anais [...]. Porto Alegre: Sociedade Brasileira de Computação, 2023 . p. 224-228. DOI: https://doi.org/10.5753/stil.2023.234200. https://doi.org/10.5753/stil.2023.234200 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
https://sol.sbc.org.br/index.php/stil/article/view/25454 nd nd renatav@uevora.pt nd 299 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
SBC Open Library |
publisher.none.fl_str_mv |
SBC Open Library |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136722331631616 |