PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction

Detalhes bibliográficos
Autor(a) principal: Thiago Castro Ferreira
Data de Publicação: 2021
Outros Autores: João Vitor Andrioli de Souza, Lucas Ferro Antunes de Oliveira, Lucas Emanuel Silva e Oliveira, Claudia Moro Barra, Emerson Cabrera Paraíso, Yohan Bonescki Gumiel, Adriana Silvina Pagano, Giovanni Pazini Meneghel Paiva, Elisa Terumi Rubel Schneider, Lucas Pavanelli
Tipo de documento: Artigo de conferência
Idioma: eng
Título da fonte: Repositório Institucional da UFMG
Texto Completo: http://hdl.handle.net/1843/66341
https://orcid.org/0000-0003-0200-3646
https://orcid.org/0000-0002-8950-0890
https://orcid.org/0000-0003-4052-7993
https://orcid.org/0000-0002-8921-5598
https://orcid.org/0000-0003-2228-7965
https://orcid.org/0000-0002-3150-3503
https://orcid.org/0000-0002-6740-7855
https://orcid.org/0000-0003-2637-3086
https://orcid.org/0000-0003-1811-5087
https://orcid.org/0000-0002-9789-9547
https://orcid.org/0000-0001-8239-2930
Resumo: This study introduces the system submitted to the eHealthKD Challenge 2021 by the PUCRJ-PUCPR-UFMG team. We proposed a multilingual BERT-based system for joint entity recognition and relation extraction in multidomain texts. Our end-to-end multitasking model benefits from the transformer architecture, which has proved to capture better the global dependencies of the input text. Also, the use of a multilingual model contributed to our system to perform well even in the set of tests containing non-Spanish sentences. Our system ranked first in the entity recognition task and second in the Main scenario, where both tasks of entity recognition and relation extraction had to be solved. The full code of our approach and more details of the implementation are publicly available
id UFMG_c27e2c31ebcb9d13a4804188478a367e
oai_identifier_str oai:repositorio.ufmg.br:1843/66341
network_acronym_str UFMG
network_name_str Repositório Institucional da UFMG
repository_id_str
spelling 2024-03-21T18:20:27Z2024-03-21T18:20:27Z20216836911613-0073http://hdl.handle.net/1843/66341https://orcid.org/0000-0003-0200-3646https://orcid.org/0000-0002-8950-0890https://orcid.org/0000-0003-4052-7993https://orcid.org/0000-0002-8921-5598https://orcid.org/0000-0003-2228-7965https://orcid.org/0000-0002-3150-3503https://orcid.org/0000-0002-6740-7855https://orcid.org/0000-0003-2637-3086https://orcid.org/0000-0003-1811-5087https://orcid.org/0000-0002-9789-9547https://orcid.org/0000-0001-8239-2930This study introduces the system submitted to the eHealthKD Challenge 2021 by the PUCRJ-PUCPR-UFMG team. We proposed a multilingual BERT-based system for joint entity recognition and relation extraction in multidomain texts. Our end-to-end multitasking model benefits from the transformer architecture, which has proved to capture better the global dependencies of the input text. Also, the use of a multilingual model contributed to our system to perform well even in the set of tests containing non-Spanish sentences. Our system ranked first in the entity recognition task and second in the Main scenario, where both tasks of entity recognition and relation extraction had to be solved. The full code of our approach and more details of the implementation are publicly availableengUniversidade Federal de Minas GeraisUFMGBrasilFALE - FACULDADE DE LETRASIberian Languages Evaluation ForumProcessamento da linguagem natural (Computação)Informática na medicinaeHealthEntity recognitionRelation extractionBERTDeep learningPUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extractioninfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObjecthttps://ceur-ws.org/Vol-2943/ehealth_paper3.pdfThiago Castro FerreiraJoão Vitor Andrioli de SouzaLucas Ferro Antunes de OliveiraLucas Emanuel Silva e OliveiraClaudia Moro BarraEmerson Cabrera ParaísoYohan Bonescki GumielAdriana Silvina PaganoGiovanni Pazini Meneghel PaivaElisa Terumi Rubel SchneiderLucas Pavanelliapplication/pdfinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGLICENSELicense.txtLicense.txttext/plain; charset=utf-82042https://repositorio.ufmg.br/bitstream/1843/66341/1/License.txtfa505098d172de0bc8864fc1287ffe22MD51ORIGINALPUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021 a multilingual BERT-based system for joint entity recognition and relation extraction.pdfPUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021 a multilingual BERT-based system for joint entity recognition and relation extraction.pdfapplication/pdf619758https://repositorio.ufmg.br/bitstream/1843/66341/2/PUCRJ-PUCPR-UFMG%20at%20eHealth-KD%20Challenge%202021%20a%20multilingual%20BERT-based%20system%20for%20joint%20entity%20recognition%20and%20relation%20extraction.pdfab2e86fc636493c13bb3ef17fac783daMD521843/663412024-03-21 15:20:28.143oai:repositorio.ufmg.br:1843/66341TElDRU7vv71BIERFIERJU1RSSUJVSe+/ve+/vU8gTu+/vU8tRVhDTFVTSVZBIERPIFJFUE9TSVTvv71SSU8gSU5TVElUVUNJT05BTCBEQSBVRk1HCiAKCkNvbSBhIGFwcmVzZW50Ye+/ve+/vW8gZGVzdGEgbGljZW7vv71hLCB2b2Pvv70gKG8gYXV0b3IgKGVzKSBvdSBvIHRpdHVsYXIgZG9zIGRpcmVpdG9zIGRlIGF1dG9yKSBjb25jZWRlIGFvIFJlcG9zaXTvv71yaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIChSSS1VRk1HKSBvIGRpcmVpdG8gbu+/vW8gZXhjbHVzaXZvIGUgaXJyZXZvZ++/vXZlbCBkZSByZXByb2R1emlyIGUvb3UgZGlzdHJpYnVpciBhIHN1YSBwdWJsaWNh77+977+9byAoaW5jbHVpbmRvIG8gcmVzdW1vKSBwb3IgdG9kbyBvIG11bmRvIG5vIGZvcm1hdG8gaW1wcmVzc28gZSBlbGV0cu+/vW5pY28gZSBlbSBxdWFscXVlciBtZWlvLCBpbmNsdWluZG8gb3MgZm9ybWF0b3Mg77+9dWRpbyBvdSB277+9ZGVvLgoKVm9j77+9IGRlY2xhcmEgcXVlIGNvbmhlY2UgYSBwb2zvv710aWNhIGRlIGNvcHlyaWdodCBkYSBlZGl0b3JhIGRvIHNldSBkb2N1bWVudG8gZSBxdWUgY29uaGVjZSBlIGFjZWl0YSBhcyBEaXJldHJpemVzIGRvIFJJLVVGTUcuCgpWb2Pvv70gY29uY29yZGEgcXVlIG8gUmVwb3NpdO+/vXJpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGTUcgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250Ze+/vWRvLCB0cmFuc3BvciBhIHN1YSBwdWJsaWNh77+977+9byBwYXJhIHF1YWxxdWVyIG1laW8gb3UgZm9ybWF0byBwYXJhIGZpbnMgZGUgcHJlc2VydmHvv73vv71vLgoKVm9j77+9IHRhbWLvv71tIGNvbmNvcmRhIHF1ZSBvIFJlcG9zaXTvv71yaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGPvv71waWEgZGUgc3VhIHB1YmxpY2Hvv73vv71vIHBhcmEgZmlucyBkZSBzZWd1cmFu77+9YSwgYmFjay11cCBlIHByZXNlcnZh77+977+9by4KClZvY++/vSBkZWNsYXJhIHF1ZSBhIHN1YSBwdWJsaWNh77+977+9byDvv70gb3JpZ2luYWwgZSBxdWUgdm9j77+9IHRlbSBvIHBvZGVyIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vu77+9YS4gVm9j77+9IHRhbWLvv71tIGRlY2xhcmEgcXVlIG8gZGVw77+9c2l0byBkZSBzdWEgcHVibGljYe+/ve+/vW8gbu+/vW8sIHF1ZSBzZWphIGRlIHNldSBjb25oZWNpbWVudG8sIGluZnJpbmdlIGRpcmVpdG9zIGF1dG9yYWlzIGRlIG5pbmd177+9bS4KCkNhc28gYSBzdWEgcHVibGljYe+/ve+/vW8gY29udGVuaGEgbWF0ZXJpYWwgcXVlIHZvY++/vSBu77+9byBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2Pvv70gZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc++/vW8gaXJyZXN0cml0YSBkbyBkZXRlbnRvciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgcGFyYSBjb25jZWRlciBhbyBSZXBvc2l077+9cmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7vv71hLCBlIHF1ZSBlc3NlIG1hdGVyaWFsIGRlIHByb3ByaWVkYWRlIGRlIHRlcmNlaXJvcyBlc3Tvv70gY2xhcmFtZW50ZSBpZGVudGlmaWNhZG8gZSByZWNvbmhlY2lkbyBubyB0ZXh0byBvdSBubyBjb250Ze+/vWRvIGRhIHB1YmxpY2Hvv73vv71vIG9yYSBkZXBvc2l0YWRhLgoKQ0FTTyBBIFBVQkxJQ0Hvv73vv71PIE9SQSBERVBPU0lUQURBIFRFTkhBIFNJRE8gUkVTVUxUQURPIERFIFVNIFBBVFJPQ++/vU5JTyBPVSBBUE9JTyBERSBVTUEgQUfvv71OQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PLCBWT0Pvv70gREVDTEFSQSBRVUUgUkVTUEVJVE9VIFRPRE9TIEUgUVVBSVNRVUVSIERJUkVJVE9TIERFIFJFVklT77+9TyBDT01PIFRBTULvv71NIEFTIERFTUFJUyBPQlJJR0Hvv73vv71FUyBFWElHSURBUyBQT1IgQ09OVFJBVE8gT1UgQUNPUkRPLgoKTyBSZXBvc2l077+9cmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lKHMpIG91IG8ocykgbm9tZXMocykgZG8ocykgZGV0ZW50b3IoZXMpIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkYSBwdWJsaWNh77+977+9bywgZSBu77+9byBmYXLvv70gcXVhbHF1ZXIgYWx0ZXJh77+977+9bywgYWzvv71tIGRhcXVlbGFzIGNvbmNlZGlkYXMgcG9yIGVzdGEgbGljZW7vv71hLgo=Repositório de PublicaçõesPUBhttps://repositorio.ufmg.br/oaiopendoar:2024-03-21T18:20:28Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false
dc.title.pt_BR.fl_str_mv PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction
title PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction
spellingShingle PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction
Thiago Castro Ferreira
eHealth
Entity recognition
Relation extraction
BERT
Deep learning
Processamento da linguagem natural (Computação)
Informática na medicina
title_short PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction
title_full PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction
title_fullStr PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction
title_full_unstemmed PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction
title_sort PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction
author Thiago Castro Ferreira
author_facet Thiago Castro Ferreira
João Vitor Andrioli de Souza
Lucas Ferro Antunes de Oliveira
Lucas Emanuel Silva e Oliveira
Claudia Moro Barra
Emerson Cabrera Paraíso
Yohan Bonescki Gumiel
Adriana Silvina Pagano
Giovanni Pazini Meneghel Paiva
Elisa Terumi Rubel Schneider
Lucas Pavanelli
author_role author
author2 João Vitor Andrioli de Souza
Lucas Ferro Antunes de Oliveira
Lucas Emanuel Silva e Oliveira
Claudia Moro Barra
Emerson Cabrera Paraíso
Yohan Bonescki Gumiel
Adriana Silvina Pagano
Giovanni Pazini Meneghel Paiva
Elisa Terumi Rubel Schneider
Lucas Pavanelli
author2_role author
author
author
author
author
author
author
author
author
author
dc.contributor.author.fl_str_mv Thiago Castro Ferreira
João Vitor Andrioli de Souza
Lucas Ferro Antunes de Oliveira
Lucas Emanuel Silva e Oliveira
Claudia Moro Barra
Emerson Cabrera Paraíso
Yohan Bonescki Gumiel
Adriana Silvina Pagano
Giovanni Pazini Meneghel Paiva
Elisa Terumi Rubel Schneider
Lucas Pavanelli
dc.subject.por.fl_str_mv eHealth
Entity recognition
Relation extraction
BERT
Deep learning
topic eHealth
Entity recognition
Relation extraction
BERT
Deep learning
Processamento da linguagem natural (Computação)
Informática na medicina
dc.subject.other.pt_BR.fl_str_mv Processamento da linguagem natural (Computação)
Informática na medicina
description This study introduces the system submitted to the eHealthKD Challenge 2021 by the PUCRJ-PUCPR-UFMG team. We proposed a multilingual BERT-based system for joint entity recognition and relation extraction in multidomain texts. Our end-to-end multitasking model benefits from the transformer architecture, which has proved to capture better the global dependencies of the input text. Also, the use of a multilingual model contributed to our system to perform well even in the set of tests containing non-Spanish sentences. Our system ranked first in the entity recognition task and second in the Main scenario, where both tasks of entity recognition and relation extraction had to be solved. The full code of our approach and more details of the implementation are publicly available
publishDate 2021
dc.date.issued.fl_str_mv 2021
dc.date.accessioned.fl_str_mv 2024-03-21T18:20:27Z
dc.date.available.fl_str_mv 2024-03-21T18:20:27Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/1843/66341
dc.identifier.issn.pt_BR.fl_str_mv 1613-0073
dc.identifier.orcid.pt_BR.fl_str_mv https://orcid.org/0000-0003-0200-3646
https://orcid.org/0000-0002-8950-0890
https://orcid.org/0000-0003-4052-7993
https://orcid.org/0000-0002-8921-5598
https://orcid.org/0000-0003-2228-7965
https://orcid.org/0000-0002-3150-3503
https://orcid.org/0000-0002-6740-7855
https://orcid.org/0000-0003-2637-3086
https://orcid.org/0000-0003-1811-5087
https://orcid.org/0000-0002-9789-9547
https://orcid.org/0000-0001-8239-2930
identifier_str_mv 1613-0073
url http://hdl.handle.net/1843/66341
https://orcid.org/0000-0003-0200-3646
https://orcid.org/0000-0002-8950-0890
https://orcid.org/0000-0003-4052-7993
https://orcid.org/0000-0002-8921-5598
https://orcid.org/0000-0003-2228-7965
https://orcid.org/0000-0002-3150-3503
https://orcid.org/0000-0002-6740-7855
https://orcid.org/0000-0003-2637-3086
https://orcid.org/0000-0003-1811-5087
https://orcid.org/0000-0002-9789-9547
https://orcid.org/0000-0001-8239-2930
dc.language.iso.fl_str_mv eng
language eng
dc.relation.ispartof.pt_BR.fl_str_mv Iberian Languages Evaluation Forum
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.publisher.initials.fl_str_mv UFMG
dc.publisher.country.fl_str_mv Brasil
dc.publisher.department.fl_str_mv FALE - FACULDADE DE LETRAS
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFMG
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Repositório Institucional da UFMG
collection Repositório Institucional da UFMG
bitstream.url.fl_str_mv https://repositorio.ufmg.br/bitstream/1843/66341/1/License.txt
https://repositorio.ufmg.br/bitstream/1843/66341/2/PUCRJ-PUCPR-UFMG%20at%20eHealth-KD%20Challenge%202021%20a%20multilingual%20BERT-based%20system%20for%20joint%20entity%20recognition%20and%20relation%20extraction.pdf
bitstream.checksum.fl_str_mv fa505098d172de0bc8864fc1287ffe22
ab2e86fc636493c13bb3ef17fac783da
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv
_version_ 1803589320002502656