Named entity annotation of an 18th-century transcribed corpus: problems and challenges
Autor(a) principal: | |
---|---|
Data de Publicação: | 2022 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10174/32163 |
Resumo: | This paper reviews a stage of the process of annotating named entities in 18th-century texts to enrich historical research sources and link them to other bases. The categories in question are person, location and organisation, valid categories for historian analysis. We discuss the difficulties observed in the process and point eventual solutions. |
id |
RCAP_596a6c77111e02e900c0bde6ad9fd4a7 |
---|---|
oai_identifier_str |
oai:dspace.uevora.pt:10174/32163 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Named entity annotation of an 18th-century transcribed corpus: problems and challengesDigital HumanitiesNamed Entity RecognitionThis paper reviews a stage of the process of annotating named entities in 18th-century texts to enrich historical research sources and link them to other bases. The categories in question are person, location and organisation, valid categories for historian analysis. We discuss the difficulties observed in the process and point eventual solutions.Partially supported by the Portuguese Foundation FCT, under the projects CEECIND/01997/2017 and UIDB/00057/2020.CEUR2022-06-02T10:56:03Z2022-06-022022-04-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/32163http://hdl.handle.net/10174/32163engCameron, H.F., Olival, F., Vieira, R., Neto, J.F.S.: Named entity annotation of an 18th century transcribed corpus: problems, challenges. In: Proceedings of the Second Workshop on Digital Humanities and Natural Language Processing (2nd DHandNLP 2022), co-located with International Conference on the Computational Processing of Portuguese (PROPOR 2022), Virtual Event, Fortaleza, Brazil, 21st March, 2022. CEUR Workshop Proceedings, vol. 3128, pp. 18–25. http://ceur-ws.org/Vol-3128/paper8.pdhttp://ceur-ws.org/Vol-3128/paper8.pdfhelenafc@uevora.ptmfo@uevora.ptrenatav@uevora.ptnd299Cameron, HelenaOlival, FernandaVieira, RenataSantos, Joaquiminfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:32:27Zoai:dspace.uevora.pt:10174/32163Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:21:11.223691Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
spellingShingle |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges Cameron, Helena Digital Humanities Named Entity Recognition |
title_short |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title_full |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title_fullStr |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title_full_unstemmed |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
title_sort |
Named entity annotation of an 18th-century transcribed corpus: problems and challenges |
author |
Cameron, Helena |
author_facet |
Cameron, Helena Olival, Fernanda Vieira, Renata Santos, Joaquim |
author_role |
author |
author2 |
Olival, Fernanda Vieira, Renata Santos, Joaquim |
author2_role |
author author author |
dc.contributor.author.fl_str_mv |
Cameron, Helena Olival, Fernanda Vieira, Renata Santos, Joaquim |
dc.subject.por.fl_str_mv |
Digital Humanities Named Entity Recognition |
topic |
Digital Humanities Named Entity Recognition |
description |
This paper reviews a stage of the process of annotating named entities in 18th-century texts to enrich historical research sources and link them to other bases. The categories in question are person, location and organisation, valid categories for historian analysis. We discuss the difficulties observed in the process and point eventual solutions. |
publishDate |
2022 |
dc.date.none.fl_str_mv |
2022-06-02T10:56:03Z 2022-06-02 2022-04-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10174/32163 http://hdl.handle.net/10174/32163 |
url |
http://hdl.handle.net/10174/32163 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Cameron, H.F., Olival, F., Vieira, R., Neto, J.F.S.: Named entity annotation of an 18th century transcribed corpus: problems, challenges. In: Proceedings of the Second Workshop on Digital Humanities and Natural Language Processing (2nd DHandNLP 2022), co-located with International Conference on the Computational Processing of Portuguese (PROPOR 2022), Virtual Event, Fortaleza, Brazil, 21st March, 2022. CEUR Workshop Proceedings, vol. 3128, pp. 18–25. http://ceur-ws.org/Vol-3128/paper8.pd http://ceur-ws.org/Vol-3128/paper8.pdf helenafc@uevora.pt mfo@uevora.pt renatav@uevora.pt nd 299 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
CEUR |
publisher.none.fl_str_mv |
CEUR |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136693156052992 |