Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/10174/28049 |
Resumo: | The common approach to research in History and Archaeology tends to the continuous development of new databases, completely independent of each other with the consequence of data fragmentation, atomisation of knowledge, and ultimately the creation of data silos. This happens because of academic tradition, but also because these disciplines work with fragmented information to understand historical data, the contexts, which enables the creation of multiple narratives and interpretations. However, for these disciplines, the context is a key aspect that always should be preserved. The Memórias Paroquiais (Parish Memories) correspond to a survey, organized in 3 major parts (land, mountain and river) and are an essential source for obtaining a radiography of Portugal in 1758-1761. We believe that this primary source could reach a new exponent if worked from a different approach: semantically annotated, processed and modeled. We propose that the Portuguese Parish Memories, due to their intrinsic characteristics, should constitute a Knowledge Base (KB) to connect with other historical sources and research outputs. Ultimately, the Parish Memories could be a Gold Standard for the Natural Language Processing with impact on the research on other historical sources of Early Modern History Portugal, regardless of the knowledge domain. |
id |
RCAP_67b4d0e60e81d6feea88aefdc5a6b059 |
---|---|
oai_identifier_str |
oai:dspace.uevora.pt:10174/28049 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standardDigital HumanitiesNatural Language ProcessingMemórias ParoquiaisKnowledge BaseThe common approach to research in History and Archaeology tends to the continuous development of new databases, completely independent of each other with the consequence of data fragmentation, atomisation of knowledge, and ultimately the creation of data silos. This happens because of academic tradition, but also because these disciplines work with fragmented information to understand historical data, the contexts, which enables the creation of multiple narratives and interpretations. However, for these disciplines, the context is a key aspect that always should be preserved. The Memórias Paroquiais (Parish Memories) correspond to a survey, organized in 3 major parts (land, mountain and river) and are an essential source for obtaining a radiography of Portugal in 1758-1761. We believe that this primary source could reach a new exponent if worked from a different approach: semantically annotated, processed and modeled. We propose that the Portuguese Parish Memories, due to their intrinsic characteristics, should constitute a Knowledge Base (KB) to connect with other historical sources and research outputs. Ultimately, the Parish Memories could be a Gold Standard for the Natural Language Processing with impact on the research on other historical sources of Early Modern History Portugal, regardless of the knowledge domain.This work is funded by national funds through the Foundation for Science and Technology, under the project UIDB/00057/20202020-08-10T15:18:57Z2020-08-102020-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlehttp://hdl.handle.net/10174/28049http://hdl.handle.net/10174/28049engSantos, Ivo; Olival, Fernanda; Sequeira, Ofélia, «Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard», in DHandNLP 2020: Digital Humanities and Natural Language Processing: Proceedings of the Workshop on Digital Humanities and Natural Language Processing (DHandNLP 2020) co-located with International Conference on the Computational Processing of Portuguese (PROPOR 2020). ed by M. José Finatto; Renta Vieira; Senja Pollak; Saturnino Luz, Évora, 2020, Vol. 2607, ISSN: 1613-0073, - http://ceur-ws.org/Vol-2607/.Io1613-0073http://ceur-ws.org/Vol-2607/.IoDepartamento de Históriaifs@uevora.ptmfo@uevora.ptosequeira@uevora.pt709Santos, IvoOlival, FernandaSequeira, Oféliainfo:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2024-01-03T19:24:03Zoai:dspace.uevora.pt:10174/28049Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-20T01:18:00.548217Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard |
title |
Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard |
spellingShingle |
Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard Santos, Ivo Digital Humanities Natural Language Processing Memórias Paroquiais Knowledge Base |
title_short |
Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard |
title_full |
Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard |
title_fullStr |
Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard |
title_full_unstemmed |
Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard |
title_sort |
Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard |
author |
Santos, Ivo |
author_facet |
Santos, Ivo Olival, Fernanda Sequeira, Ofélia |
author_role |
author |
author2 |
Olival, Fernanda Sequeira, Ofélia |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Santos, Ivo Olival, Fernanda Sequeira, Ofélia |
dc.subject.por.fl_str_mv |
Digital Humanities Natural Language Processing Memórias Paroquiais Knowledge Base |
topic |
Digital Humanities Natural Language Processing Memórias Paroquiais Knowledge Base |
description |
The common approach to research in History and Archaeology tends to the continuous development of new databases, completely independent of each other with the consequence of data fragmentation, atomisation of knowledge, and ultimately the creation of data silos. This happens because of academic tradition, but also because these disciplines work with fragmented information to understand historical data, the contexts, which enables the creation of multiple narratives and interpretations. However, for these disciplines, the context is a key aspect that always should be preserved. The Memórias Paroquiais (Parish Memories) correspond to a survey, organized in 3 major parts (land, mountain and river) and are an essential source for obtaining a radiography of Portugal in 1758-1761. We believe that this primary source could reach a new exponent if worked from a different approach: semantically annotated, processed and modeled. We propose that the Portuguese Parish Memories, due to their intrinsic characteristics, should constitute a Knowledge Base (KB) to connect with other historical sources and research outputs. Ultimately, the Parish Memories could be a Gold Standard for the Natural Language Processing with impact on the research on other historical sources of Early Modern History Portugal, regardless of the knowledge domain. |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020-08-10T15:18:57Z 2020-08-10 2020-01-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/10174/28049 http://hdl.handle.net/10174/28049 |
url |
http://hdl.handle.net/10174/28049 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Santos, Ivo; Olival, Fernanda; Sequeira, Ofélia, «Excavating the data pit: the Portuguese Parish Memories (1758) as a gold standard», in DHandNLP 2020: Digital Humanities and Natural Language Processing: Proceedings of the Workshop on Digital Humanities and Natural Language Processing (DHandNLP 2020) co-located with International Conference on the Computational Processing of Portuguese (PROPOR 2020). ed by M. José Finatto; Renta Vieira; Senja Pollak; Saturnino Luz, Évora, 2020, Vol. 2607, ISSN: 1613-0073, - http://ceur-ws.org/Vol-2607/.Io 1613-0073 http://ceur-ws.org/Vol-2607/.Io Departamento de História ifs@uevora.pt mfo@uevora.pt osequeira@uevora.pt 709 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799136662658220032 |