Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
Autor(a) principal: | |
---|---|
Data de Publicação: | 2021 |
Outros Autores: | , , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10174/30166 https://doi.org/Vieira, R., Olival, F., Cameron, H.F., Santos, J., Sequeira, O. and Santos, I., 2021. Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, p.20. DOI: http://doi.org/10.5334/johd.43org/10.5334/johd.43 https://doi.org/10.5334/johd.43 |
Resumo: | This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection where two researchers annotated a portion of it manually for evaluation purposes. In this dataset, we provide the tagged texts, the lists of extracted entities, and frequency counts. The corpus is useful for historians, allowing, for instance, comparative analyses between parishes and regions or to calculate the area of influence of a locality. The paper describes the creation and evaluation of the corpus, discusses its applications and limitations. This first release may be improved by other researchers interested in the historical source itself or in the technology employed in its annotation. |
id |
RCAP_57d64c86064c1b762ba682c75d2d5769 |
---|---|
oai_identifier_str |
oai:dspace.uevora.pt:10174/30166 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named EntitiesEntidades nomeadasMemórias ParoquiaisThis work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection where two researchers annotated a portion of it manually for evaluation purposes. In this dataset, we provide the tagged texts, the lists of extracted entities, and frequency counts. The corpus is useful for historians, allowing, for instance, comparative analyses between parishes and regions or to calculate the area of influence of a locality. The paper describes the creation and evaluation of the corpus, discusses its applications and limitations. This first release may be improved by other researchers interested in the historical source itself or in the technology employed in its annotation.FCT CEECIND/01997/2017, UIDB/00057/2020Journal of Open Humanities Data2021-09-30T10:04:55Z2021-09-302021-09-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/30166https://doi.org/Vieira, R., Olival, F., Cameron, H.F., Santos, J., Sequeira, O. and Santos, I., 2021. Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, p.20. DOI: http://doi.org/10.5334/johd.43org/10.5334/johd.43http://hdl.handle.net/10174/30166https://doi.org/10.5334/johd.43enghttps://openhumanitiesdata.metajnl.com/articles/10.5334/johd.43/renatav@uevora.ptmfo@uevora.pthelenafc@uevora.ptd47240@alunos.uevora.ptfloor.sequeira9@hotmail.comifs@uevora.pt736Vieira, RenataOlival, FernandaCameron, HelenaSantos, JoaquimSequeira, OfeliaSantos, Ivoinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:27:51Zoai:dspace.uevora.pt:10174/30166Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:19:38.947694Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities |
title |
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities |
spellingShingle |
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities Vieira, Renata Entidades nomeadas Memórias Paroquiais |
title_short |
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities |
title_full |
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities |
title_fullStr |
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities |
title_full_unstemmed |
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities |
title_sort |
Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities |
author |
Vieira, Renata |
author_facet |
Vieira, Renata Olival, Fernanda Cameron, Helena Santos, Joaquim Sequeira, Ofelia Santos, Ivo |
author_role |
author |
author2 |
Olival, Fernanda Cameron, Helena Santos, Joaquim Sequeira, Ofelia Santos, Ivo |
author2_role |
author author author author author |
dc.contributor.author.fl_str_mv |
Vieira, Renata Olival, Fernanda Cameron, Helena Santos, Joaquim Sequeira, Ofelia Santos, Ivo |
dc.subject.por.fl_str_mv |
Entidades nomeadas Memórias Paroquiais |
topic |
Entidades nomeadas Memórias Paroquiais |
description |
This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection where two researchers annotated a portion of it manually for evaluation purposes. In this dataset, we provide the tagged texts, the lists of extracted entities, and frequency counts. The corpus is useful for historians, allowing, for instance, comparative analyses between parishes and regions or to calculate the area of influence of a locality. The paper describes the creation and evaluation of the corpus, discusses its applications and limitations. This first release may be improved by other researchers interested in the historical source itself or in the technology employed in its annotation. |
publishDate |
2021 |
dc.date.none.fl_str_mv |
2021-09-30T10:04:55Z 2021-09-30 2021-09-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10174/30166 https://doi.org/Vieira, R., Olival, F., Cameron, H.F., Santos, J., Sequeira, O. and Santos, I., 2021. Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, p.20. DOI: http://doi.org/10.5334/johd.43org/10.5334/johd.43 http://hdl.handle.net/10174/30166 https://doi.org/10.5334/johd.43 |
url |
http://hdl.handle.net/10174/30166 https://doi.org/Vieira, R., Olival, F., Cameron, H.F., Santos, J., Sequeira, O. and Santos, I., 2021. Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, p.20. DOI: http://doi.org/10.5334/johd.43org/10.5334/johd.43 https://doi.org/10.5334/johd.43 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
https://openhumanitiesdata.metajnl.com/articles/10.5334/johd.43/ renatav@uevora.pt mfo@uevora.pt helenafc@uevora.pt d47240@alunos.uevora.pt floor.sequeira9@hotmail.com ifs@uevora.pt 736 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
Journal of Open Humanities Data |
publisher.none.fl_str_mv |
Journal of Open Humanities Data |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136677980012544 |