Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities

Detalhes bibliográficos
Autor(a) principal: Vieira, Renata
Data de Publicação: 2021
Outros Autores: Olival, Fernanda, Cameron, Helena, Santos, Joaquim, Sequeira, Ofelia, Santos, Ivo
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/10174/30166
https://doi.org/Vieira, R., Olival, F., Cameron, H.F., Santos, J., Sequeira, O. and Santos, I., 2021. Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, p.20. DOI: http://doi.org/10.5334/johd.43org/10.5334/johd.43
https://doi.org/10.5334/johd.43
Resumo: This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection where two researchers annotated a portion of it manually for evaluation purposes. In this dataset, we provide the tagged texts, the lists of extracted entities, and frequency counts. The corpus is useful for historians, allowing, for instance, comparative analyses between parishes and regions or to calculate the area of influence of a locality. The paper describes the creation and evaluation of the corpus, discusses its applications and limitations. This first release may be improved by other researchers interested in the historical source itself or in the technology employed in its annotation.
id RCAP_57d64c86064c1b762ba682c75d2d5769
oai_identifier_str oai:dspace.uevora.pt:10174/30166
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named EntitiesEntidades nomeadasMemórias ParoquiaisThis work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection where two researchers annotated a portion of it manually for evaluation purposes. In this dataset, we provide the tagged texts, the lists of extracted entities, and frequency counts. The corpus is useful for historians, allowing, for instance, comparative analyses between parishes and regions or to calculate the area of influence of a locality. The paper describes the creation and evaluation of the corpus, discusses its applications and limitations. This first release may be improved by other researchers interested in the historical source itself or in the technology employed in its annotation.FCT CEECIND/01997/2017, UIDB/00057/2020Journal of Open Humanities Data2021-09-30T10:04:55Z2021-09-302021-09-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/30166https://doi.org/Vieira, R., Olival, F., Cameron, H.F., Santos, J., Sequeira, O. and Santos, I., 2021. Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, p.20. DOI: http://doi.org/10.5334/johd.43org/10.5334/johd.43http://hdl.handle.net/10174/30166https://doi.org/10.5334/johd.43enghttps://openhumanitiesdata.metajnl.com/articles/10.5334/johd.43/renatav@uevora.ptmfo@uevora.pthelenafc@uevora.ptd47240@alunos.uevora.ptfloor.sequeira9@hotmail.comifs@uevora.pt736Vieira, RenataOlival, FernandaCameron, HelenaSantos, JoaquimSequeira, OfeliaSantos, Ivoinfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:27:51Zoai:dspace.uevora.pt:10174/30166Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:19:38.947694Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
spellingShingle Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
Vieira, Renata
Entidades nomeadas
Memórias Paroquiais
title_short Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title_full Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title_fullStr Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title_full_unstemmed Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
title_sort Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities
author Vieira, Renata
author_facet Vieira, Renata
Olival, Fernanda
Cameron, Helena
Santos, Joaquim
Sequeira, Ofelia
Santos, Ivo
author_role author
author2 Olival, Fernanda
Cameron, Helena
Santos, Joaquim
Sequeira, Ofelia
Santos, Ivo
author2_role author
author
author
author
author
dc.contributor.author.fl_str_mv Vieira, Renata
Olival, Fernanda
Cameron, Helena
Santos, Joaquim
Sequeira, Ofelia
Santos, Ivo
dc.subject.por.fl_str_mv Entidades nomeadas
Memórias Paroquiais
topic Entidades nomeadas
Memórias Paroquiais
description This work presents an enriched version of the Parish Memories (1758–1761), an essential Portuguese historical source manually transcribed. It is enriched with annotations of named entities of the types PERSON, LOCATION, and ORGANIZATION. The annotation was done automatically for the whole collection where two researchers annotated a portion of it manually for evaluation purposes. In this dataset, we provide the tagged texts, the lists of extracted entities, and frequency counts. The corpus is useful for historians, allowing, for instance, comparative analyses between parishes and regions or to calculate the area of influence of a locality. The paper describes the creation and evaluation of the corpus, discusses its applications and limitations. This first release may be improved by other researchers interested in the historical source itself or in the technology employed in its annotation.
publishDate 2021
dc.date.none.fl_str_mv 2021-09-30T10:04:55Z
2021-09-30
2021-09-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/10174/30166
https://doi.org/Vieira, R., Olival, F., Cameron, H.F., Santos, J., Sequeira, O. and Santos, I., 2021. Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, p.20. DOI: http://doi.org/10.5334/johd.43org/10.5334/johd.43
http://hdl.handle.net/10174/30166
https://doi.org/10.5334/johd.43
url http://hdl.handle.net/10174/30166
https://doi.org/Vieira, R., Olival, F., Cameron, H.F., Santos, J., Sequeira, O. and Santos, I., 2021. Enriching the 1758 Portuguese Parish Memories (Alentejo) with Named Entities. Journal of Open Humanities Data, 7, p.20. DOI: http://doi.org/10.5334/johd.43org/10.5334/johd.43
https://doi.org/10.5334/johd.43
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv https://openhumanitiesdata.metajnl.com/articles/10.5334/johd.43/
renatav@uevora.pt
mfo@uevora.pt
helenafc@uevora.pt
d47240@alunos.uevora.pt
floor.sequeira9@hotmail.com
ifs@uevora.pt
736
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Journal of Open Humanities Data
publisher.none.fl_str_mv Journal of Open Humanities Data
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799136677980012544