Beyond relational databases: preserving the data

Detalhes bibliográficos
Autor(a) principal: Ramalho, José Carlos
Data de Publicação: 2020
Outros Autores: Ferreira, Bruno, Faria, Luís, Ferreira, Miguel
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
Texto Completo: http://hdl.handle.net/1822/73404
Resumo: Relational databases are one of the main technologies supporting information assets in today’s organizations. They are designed to store, organize and retrieve digital information, and are such a fundamental part of information systems that most would not be able to function without them. Very often, the information contained in databases is irreplaceable or prohibitively expensive to reacquire; therefore, steps must be taken to ensure that the information within databases is preserved. This paper describes a methodology for long-term preservation of relational databases based on information extraction and format migration to a preservation format. It also presents a tool that was developed to support this methodology: Database Preservation Toolkit (DBPTK), as well as the processes and formats needed to preserve databases. The DBPTK connects to live relational databases and extracts information into formats more adequate for long-term preservation. Supported preservation formats include the SIARD 2, created by a cooperation between the Swiss Federal Archives and the E-ARK project that is becoming a standard in the area. DBPTK has a flexible plugin-based architecture enabling its use for other purposes like database upgrade and database migration between different systems. Presented real case scenarios demonstrate the usefulness, correctness and performance of the tool.
id RCAP_e855ae9444600d01e5b5a744503b51e0
oai_identifier_str oai:repositorium.sdum.uminho.pt:1822/73404
network_acronym_str RCAP
network_name_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository_id_str 7160
spelling Beyond relational databases: preserving the dataDigital ArchivingeArchivingDatabase preservationE-ARKSIARDDatabasesRelational databasesRDMSDatabase preservation toolkitEngenharia e Tecnologia::Outras Engenharias e TecnologiasPaz, justiça e instituições eficazesRelational databases are one of the main technologies supporting information assets in today’s organizations. They are designed to store, organize and retrieve digital information, and are such a fundamental part of information systems that most would not be able to function without them. Very often, the information contained in databases is irreplaceable or prohibitively expensive to reacquire; therefore, steps must be taken to ensure that the information within databases is preserved. This paper describes a methodology for long-term preservation of relational databases based on information extraction and format migration to a preservation format. It also presents a tool that was developed to support this methodology: Database Preservation Toolkit (DBPTK), as well as the processes and formats needed to preserve databases. The DBPTK connects to live relational databases and extracts information into formats more adequate for long-term preservation. Supported preservation formats include the SIARD 2, created by a cooperation between the Swiss Federal Archives and the E-ARK project that is becoming a standard in the area. DBPTK has a flexible plugin-based architecture enabling its use for other purposes like database upgrade and database migration between different systems. Presented real case scenarios demonstrate the usefulness, correctness and performance of the tool.The initial E-ARK project was in part supported by the European Commission within the Competitiveness and Innovation Programme 2007–2013, Grant Agreement no. 620998 under the Policy Support Programme.RoutledgeUniversidade do MinhoRamalho, José CarlosFerreira, BrunoFaria, LuísFerreira, Miguel20202020-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/1822/73404engJosé Carlos Ramalho, Bruno Ferreira, Luis Faria & Miguel Ferreira (2020) Beyond Relational Databases: Preserving the Data, New Review of Information Networking, 25:2, 107-118, DOI: 10.1080/13614576.2021.19193981361-45761740-786910.1080/13614576.2021.1919398https://www.tandfonline.com/doi/abs/10.1080/13614576.2021.1919398info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-21T11:58:16Zoai:repositorium.sdum.uminho.pt:1822/73404Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T18:47:58.757547Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse
dc.title.none.fl_str_mv Beyond relational databases: preserving the data
title Beyond relational databases: preserving the data
spellingShingle Beyond relational databases: preserving the data
Ramalho, José Carlos
Digital Archiving
eArchiving
Database preservation
E-ARK
SIARD
Databases
Relational databases
RDMS
Database preservation toolkit
Engenharia e Tecnologia::Outras Engenharias e Tecnologias
Paz, justiça e instituições eficazes
title_short Beyond relational databases: preserving the data
title_full Beyond relational databases: preserving the data
title_fullStr Beyond relational databases: preserving the data
title_full_unstemmed Beyond relational databases: preserving the data
title_sort Beyond relational databases: preserving the data
author Ramalho, José Carlos
author_facet Ramalho, José Carlos
Ferreira, Bruno
Faria, Luís
Ferreira, Miguel
author_role author
author2 Ferreira, Bruno
Faria, Luís
Ferreira, Miguel
author2_role author
author
author
dc.contributor.none.fl_str_mv Universidade do Minho
dc.contributor.author.fl_str_mv Ramalho, José Carlos
Ferreira, Bruno
Faria, Luís
Ferreira, Miguel
dc.subject.por.fl_str_mv Digital Archiving
eArchiving
Database preservation
E-ARK
SIARD
Databases
Relational databases
RDMS
Database preservation toolkit
Engenharia e Tecnologia::Outras Engenharias e Tecnologias
Paz, justiça e instituições eficazes
topic Digital Archiving
eArchiving
Database preservation
E-ARK
SIARD
Databases
Relational databases
RDMS
Database preservation toolkit
Engenharia e Tecnologia::Outras Engenharias e Tecnologias
Paz, justiça e instituições eficazes
description Relational databases are one of the main technologies supporting information assets in today’s organizations. They are designed to store, organize and retrieve digital information, and are such a fundamental part of information systems that most would not be able to function without them. Very often, the information contained in databases is irreplaceable or prohibitively expensive to reacquire; therefore, steps must be taken to ensure that the information within databases is preserved. This paper describes a methodology for long-term preservation of relational databases based on information extraction and format migration to a preservation format. It also presents a tool that was developed to support this methodology: Database Preservation Toolkit (DBPTK), as well as the processes and formats needed to preserve databases. The DBPTK connects to live relational databases and extracts information into formats more adequate for long-term preservation. Supported preservation formats include the SIARD 2, created by a cooperation between the Swiss Federal Archives and the E-ARK project that is becoming a standard in the area. DBPTK has a flexible plugin-based architecture enabling its use for other purposes like database upgrade and database migration between different systems. Presented real case scenarios demonstrate the usefulness, correctness and performance of the tool.
publishDate 2020
dc.date.none.fl_str_mv 2020
2020-01-01T00:00:00Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://hdl.handle.net/1822/73404
url http://hdl.handle.net/1822/73404
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv José Carlos Ramalho, Bruno Ferreira, Luis Faria & Miguel Ferreira (2020) Beyond Relational Databases: Preserving the Data, New Review of Information Networking, 25:2, 107-118, DOI: 10.1080/13614576.2021.1919398
1361-4576
1740-7869
10.1080/13614576.2021.1919398
https://www.tandfonline.com/doi/abs/10.1080/13614576.2021.1919398
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Routledge
publisher.none.fl_str_mv Routledge
dc.source.none.fl_str_mv reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron:RCAAP
instname_str Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
instacron_str RCAAP
institution RCAAP
reponame_str Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
collection Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)
repository.name.fl_str_mv Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação
repository.mail.fl_str_mv
_version_ 1799132239808692224