PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Outros Autores: | , , , , , , , , , |
Tipo de documento: | Artigo de conferência |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UFMG |
Texto Completo: | http://hdl.handle.net/1843/66341 https://orcid.org/0000-0003-0200-3646 https://orcid.org/0000-0002-8950-0890 https://orcid.org/0000-0003-4052-7993 https://orcid.org/0000-0002-8921-5598 https://orcid.org/0000-0003-2228-7965 https://orcid.org/0000-0002-3150-3503 https://orcid.org/0000-0002-6740-7855 https://orcid.org/0000-0003-2637-3086 https://orcid.org/0000-0003-1811-5087 https://orcid.org/0000-0002-9789-9547 https://orcid.org/0000-0001-8239-2930 |
Resumo: | This study introduces the system submitted to the eHealthKD Challenge 2021 by the PUCRJ-PUCPR-UFMG team. We proposed a multilingual BERT-based system for joint entity recognition and relation extraction in multidomain texts. Our end-to-end multitasking model benefits from the transformer architecture, which has proved to capture better the global dependencies of the input text. Also, the use of a multilingual model contributed to our system to perform well even in the set of tests containing non-Spanish sentences. Our system ranked first in the entity recognition task and second in the Main scenario, where both tasks of entity recognition and relation extraction had to be solved. The full code of our approach and more details of the implementation are publicly available |
id |
UFMG_c27e2c31ebcb9d13a4804188478a367e |
---|---|
oai_identifier_str |
oai:repositorio.ufmg.br:1843/66341 |
network_acronym_str |
UFMG |
network_name_str |
Repositório Institucional da UFMG |
repository_id_str |
|
spelling |
2024-03-21T18:20:27Z2024-03-21T18:20:27Z20216836911613-0073http://hdl.handle.net/1843/66341https://orcid.org/0000-0003-0200-3646https://orcid.org/0000-0002-8950-0890https://orcid.org/0000-0003-4052-7993https://orcid.org/0000-0002-8921-5598https://orcid.org/0000-0003-2228-7965https://orcid.org/0000-0002-3150-3503https://orcid.org/0000-0002-6740-7855https://orcid.org/0000-0003-2637-3086https://orcid.org/0000-0003-1811-5087https://orcid.org/0000-0002-9789-9547https://orcid.org/0000-0001-8239-2930This study introduces the system submitted to the eHealthKD Challenge 2021 by the PUCRJ-PUCPR-UFMG team. We proposed a multilingual BERT-based system for joint entity recognition and relation extraction in multidomain texts. Our end-to-end multitasking model benefits from the transformer architecture, which has proved to capture better the global dependencies of the input text. Also, the use of a multilingual model contributed to our system to perform well even in the set of tests containing non-Spanish sentences. Our system ranked first in the entity recognition task and second in the Main scenario, where both tasks of entity recognition and relation extraction had to be solved. The full code of our approach and more details of the implementation are publicly availableengUniversidade Federal de Minas GeraisUFMGBrasilFALE - FACULDADE DE LETRASIberian Languages Evaluation ForumProcessamento da linguagem natural (Computação)Informática na medicinaeHealthEntity recognitionRelation extractionBERTDeep learningPUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extractioninfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObjecthttps://ceur-ws.org/Vol-2943/ehealth_paper3.pdfThiago Castro FerreiraJoão Vitor Andrioli de SouzaLucas Ferro Antunes de OliveiraLucas Emanuel Silva e OliveiraClaudia Moro BarraEmerson Cabrera ParaísoYohan Bonescki GumielAdriana Silvina PaganoGiovanni Pazini Meneghel PaivaElisa Terumi Rubel SchneiderLucas Pavanelliapplication/pdfinfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMGLICENSELicense.txtLicense.txttext/plain; charset=utf-82042https://repositorio.ufmg.br/bitstream/1843/66341/1/License.txtfa505098d172de0bc8864fc1287ffe22MD51ORIGINALPUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021 a multilingual BERT-based system for joint entity recognition and relation extraction.pdfPUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021 a multilingual BERT-based system for joint entity recognition and relation extraction.pdfapplication/pdf619758https://repositorio.ufmg.br/bitstream/1843/66341/2/PUCRJ-PUCPR-UFMG%20at%20eHealth-KD%20Challenge%202021%20a%20multilingual%20BERT-based%20system%20for%20joint%20entity%20recognition%20and%20relation%20extraction.pdfab2e86fc636493c13bb3ef17fac783daMD521843/663412024-03-21 15:20:28.143oai:repositorio.ufmg.br:1843/66341TElDRU7vv71BIERFIERJU1RSSUJVSe+/ve+/vU8gTu+/vU8tRVhDTFVTSVZBIERPIFJFUE9TSVTvv71SSU8gSU5TVElUVUNJT05BTCBEQSBVRk1HCiAKCkNvbSBhIGFwcmVzZW50Ye+/ve+/vW8gZGVzdGEgbGljZW7vv71hLCB2b2Pvv70gKG8gYXV0b3IgKGVzKSBvdSBvIHRpdHVsYXIgZG9zIGRpcmVpdG9zIGRlIGF1dG9yKSBjb25jZWRlIGFvIFJlcG9zaXTvv71yaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIChSSS1VRk1HKSBvIGRpcmVpdG8gbu+/vW8gZXhjbHVzaXZvIGUgaXJyZXZvZ++/vXZlbCBkZSByZXByb2R1emlyIGUvb3UgZGlzdHJpYnVpciBhIHN1YSBwdWJsaWNh77+977+9byAoaW5jbHVpbmRvIG8gcmVzdW1vKSBwb3IgdG9kbyBvIG11bmRvIG5vIGZvcm1hdG8gaW1wcmVzc28gZSBlbGV0cu+/vW5pY28gZSBlbSBxdWFscXVlciBtZWlvLCBpbmNsdWluZG8gb3MgZm9ybWF0b3Mg77+9dWRpbyBvdSB277+9ZGVvLgoKVm9j77+9IGRlY2xhcmEgcXVlIGNvbmhlY2UgYSBwb2zvv710aWNhIGRlIGNvcHlyaWdodCBkYSBlZGl0b3JhIGRvIHNldSBkb2N1bWVudG8gZSBxdWUgY29uaGVjZSBlIGFjZWl0YSBhcyBEaXJldHJpemVzIGRvIFJJLVVGTUcuCgpWb2Pvv70gY29uY29yZGEgcXVlIG8gUmVwb3NpdO+/vXJpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGTUcgcG9kZSwgc2VtIGFsdGVyYXIgbyBjb250Ze+/vWRvLCB0cmFuc3BvciBhIHN1YSBwdWJsaWNh77+977+9byBwYXJhIHF1YWxxdWVyIG1laW8gb3UgZm9ybWF0byBwYXJhIGZpbnMgZGUgcHJlc2VydmHvv73vv71vLgoKVm9j77+9IHRhbWLvv71tIGNvbmNvcmRhIHF1ZSBvIFJlcG9zaXTvv71yaW8gSW5zdGl0dWNpb25hbCBkYSBVRk1HIHBvZGUgbWFudGVyIG1haXMgZGUgdW1hIGPvv71waWEgZGUgc3VhIHB1YmxpY2Hvv73vv71vIHBhcmEgZmlucyBkZSBzZWd1cmFu77+9YSwgYmFjay11cCBlIHByZXNlcnZh77+977+9by4KClZvY++/vSBkZWNsYXJhIHF1ZSBhIHN1YSBwdWJsaWNh77+977+9byDvv70gb3JpZ2luYWwgZSBxdWUgdm9j77+9IHRlbSBvIHBvZGVyIGRlIGNvbmNlZGVyIG9zIGRpcmVpdG9zIGNvbnRpZG9zIG5lc3RhIGxpY2Vu77+9YS4gVm9j77+9IHRhbWLvv71tIGRlY2xhcmEgcXVlIG8gZGVw77+9c2l0byBkZSBzdWEgcHVibGljYe+/ve+/vW8gbu+/vW8sIHF1ZSBzZWphIGRlIHNldSBjb25oZWNpbWVudG8sIGluZnJpbmdlIGRpcmVpdG9zIGF1dG9yYWlzIGRlIG5pbmd177+9bS4KCkNhc28gYSBzdWEgcHVibGljYe+/ve+/vW8gY29udGVuaGEgbWF0ZXJpYWwgcXVlIHZvY++/vSBu77+9byBwb3NzdWkgYSB0aXR1bGFyaWRhZGUgZG9zIGRpcmVpdG9zIGF1dG9yYWlzLCB2b2Pvv70gZGVjbGFyYSBxdWUgb2J0ZXZlIGEgcGVybWlzc++/vW8gaXJyZXN0cml0YSBkbyBkZXRlbnRvciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMgcGFyYSBjb25jZWRlciBhbyBSZXBvc2l077+9cmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBvcyBkaXJlaXRvcyBhcHJlc2VudGFkb3MgbmVzdGEgbGljZW7vv71hLCBlIHF1ZSBlc3NlIG1hdGVyaWFsIGRlIHByb3ByaWVkYWRlIGRlIHRlcmNlaXJvcyBlc3Tvv70gY2xhcmFtZW50ZSBpZGVudGlmaWNhZG8gZSByZWNvbmhlY2lkbyBubyB0ZXh0byBvdSBubyBjb250Ze+/vWRvIGRhIHB1YmxpY2Hvv73vv71vIG9yYSBkZXBvc2l0YWRhLgoKQ0FTTyBBIFBVQkxJQ0Hvv73vv71PIE9SQSBERVBPU0lUQURBIFRFTkhBIFNJRE8gUkVTVUxUQURPIERFIFVNIFBBVFJPQ++/vU5JTyBPVSBBUE9JTyBERSBVTUEgQUfvv71OQ0lBIERFIEZPTUVOVE8gT1UgT1VUUk8gT1JHQU5JU01PLCBWT0Pvv70gREVDTEFSQSBRVUUgUkVTUEVJVE9VIFRPRE9TIEUgUVVBSVNRVUVSIERJUkVJVE9TIERFIFJFVklT77+9TyBDT01PIFRBTULvv71NIEFTIERFTUFJUyBPQlJJR0Hvv73vv71FUyBFWElHSURBUyBQT1IgQ09OVFJBVE8gT1UgQUNPUkRPLgoKTyBSZXBvc2l077+9cmlvIEluc3RpdHVjaW9uYWwgZGEgVUZNRyBzZSBjb21wcm9tZXRlIGEgaWRlbnRpZmljYXIgY2xhcmFtZW50ZSBvIHNldSBub21lKHMpIG91IG8ocykgbm9tZXMocykgZG8ocykgZGV0ZW50b3IoZXMpIGRvcyBkaXJlaXRvcyBhdXRvcmFpcyBkYSBwdWJsaWNh77+977+9bywgZSBu77+9byBmYXLvv70gcXVhbHF1ZXIgYWx0ZXJh77+977+9bywgYWzvv71tIGRhcXVlbGFzIGNvbmNlZGlkYXMgcG9yIGVzdGEgbGljZW7vv71hLgo=Repositório de PublicaçõesPUBhttps://repositorio.ufmg.br/oaiopendoar:2024-03-21T18:20:28Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false |
dc.title.pt_BR.fl_str_mv |
PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction |
title |
PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction |
spellingShingle |
PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction Thiago Castro Ferreira eHealth Entity recognition Relation extraction BERT Deep learning Processamento da linguagem natural (Computação) Informática na medicina |
title_short |
PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction |
title_full |
PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction |
title_fullStr |
PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction |
title_full_unstemmed |
PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction |
title_sort |
PUCRJ-PUCPR-UFMG at eHealth-KD Challenge 2021: a multilingual BERT-based system for joint entity recognition and relation extraction |
author |
Thiago Castro Ferreira |
author_facet |
Thiago Castro Ferreira João Vitor Andrioli de Souza Lucas Ferro Antunes de Oliveira Lucas Emanuel Silva e Oliveira Claudia Moro Barra Emerson Cabrera Paraíso Yohan Bonescki Gumiel Adriana Silvina Pagano Giovanni Pazini Meneghel Paiva Elisa Terumi Rubel Schneider Lucas Pavanelli |
author_role |
author |
author2 |
João Vitor Andrioli de Souza Lucas Ferro Antunes de Oliveira Lucas Emanuel Silva e Oliveira Claudia Moro Barra Emerson Cabrera Paraíso Yohan Bonescki Gumiel Adriana Silvina Pagano Giovanni Pazini Meneghel Paiva Elisa Terumi Rubel Schneider Lucas Pavanelli |
author2_role |
author author author author author author author author author author |
dc.contributor.author.fl_str_mv |
Thiago Castro Ferreira João Vitor Andrioli de Souza Lucas Ferro Antunes de Oliveira Lucas Emanuel Silva e Oliveira Claudia Moro Barra Emerson Cabrera Paraíso Yohan Bonescki Gumiel Adriana Silvina Pagano Giovanni Pazini Meneghel Paiva Elisa Terumi Rubel Schneider Lucas Pavanelli |
dc.subject.por.fl_str_mv |
eHealth Entity recognition Relation extraction BERT Deep learning |
topic |
eHealth Entity recognition Relation extraction BERT Deep learning Processamento da linguagem natural (Computação) Informática na medicina |
dc.subject.other.pt_BR.fl_str_mv |
Processamento da linguagem natural (Computação) Informática na medicina |
description |
This study introduces the system submitted to the eHealthKD Challenge 2021 by the PUCRJ-PUCPR-UFMG team. We proposed a multilingual BERT-based system for joint entity recognition and relation extraction in multidomain texts. Our end-to-end multitasking model benefits from the transformer architecture, which has proved to capture better the global dependencies of the input text. Also, the use of a multilingual model contributed to our system to perform well even in the set of tests containing non-Spanish sentences. Our system ranked first in the entity recognition task and second in the Main scenario, where both tasks of entity recognition and relation extraction had to be solved. The full code of our approach and more details of the implementation are publicly available |
publishDate |
2021 |
dc.date.issued.fl_str_mv |
2021 |
dc.date.accessioned.fl_str_mv |
2024-03-21T18:20:27Z |
dc.date.available.fl_str_mv |
2024-03-21T18:20:27Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/1843/66341 |
dc.identifier.issn.pt_BR.fl_str_mv |
1613-0073 |
dc.identifier.orcid.pt_BR.fl_str_mv |
https://orcid.org/0000-0003-0200-3646 https://orcid.org/0000-0002-8950-0890 https://orcid.org/0000-0003-4052-7993 https://orcid.org/0000-0002-8921-5598 https://orcid.org/0000-0003-2228-7965 https://orcid.org/0000-0002-3150-3503 https://orcid.org/0000-0002-6740-7855 https://orcid.org/0000-0003-2637-3086 https://orcid.org/0000-0003-1811-5087 https://orcid.org/0000-0002-9789-9547 https://orcid.org/0000-0001-8239-2930 |
identifier_str_mv |
1613-0073 |
url |
http://hdl.handle.net/1843/66341 https://orcid.org/0000-0003-0200-3646 https://orcid.org/0000-0002-8950-0890 https://orcid.org/0000-0003-4052-7993 https://orcid.org/0000-0002-8921-5598 https://orcid.org/0000-0003-2228-7965 https://orcid.org/0000-0002-3150-3503 https://orcid.org/0000-0002-6740-7855 https://orcid.org/0000-0003-2637-3086 https://orcid.org/0000-0003-1811-5087 https://orcid.org/0000-0002-9789-9547 https://orcid.org/0000-0001-8239-2930 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.ispartof.pt_BR.fl_str_mv |
Iberian Languages Evaluation Forum |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Universidade Federal de Minas Gerais |
dc.publisher.initials.fl_str_mv |
UFMG |
dc.publisher.country.fl_str_mv |
Brasil |
dc.publisher.department.fl_str_mv |
FALE - FACULDADE DE LETRAS |
publisher.none.fl_str_mv |
Universidade Federal de Minas Gerais |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFMG instname:Universidade Federal de Minas Gerais (UFMG) instacron:UFMG |
instname_str |
Universidade Federal de Minas Gerais (UFMG) |
instacron_str |
UFMG |
institution |
UFMG |
reponame_str |
Repositório Institucional da UFMG |
collection |
Repositório Institucional da UFMG |
bitstream.url.fl_str_mv |
https://repositorio.ufmg.br/bitstream/1843/66341/1/License.txt https://repositorio.ufmg.br/bitstream/1843/66341/2/PUCRJ-PUCPR-UFMG%20at%20eHealth-KD%20Challenge%202021%20a%20multilingual%20BERT-based%20system%20for%20joint%20entity%20recognition%20and%20relation%20extraction.pdf |
bitstream.checksum.fl_str_mv |
fa505098d172de0bc8864fc1287ffe22 ab2e86fc636493c13bb3ef17fac783da |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 |
repository.name.fl_str_mv |
Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG) |
repository.mail.fl_str_mv |
|
_version_ |
1803589320002502656 |