Beyond relational databases: preserving the data
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Outros Autores: | , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
Texto Completo: | http://hdl.handle.net/1822/73404 |
Resumo: | Relational databases are one of the main technologies supporting information assets in today’s organizations. They are designed to store, organize and retrieve digital information, and are such a fundamental part of information systems that most would not be able to function without them. Very often, the information contained in databases is irreplaceable or prohibitively expensive to reacquire; therefore, steps must be taken to ensure that the information within databases is preserved. This paper describes a methodology for long-term preservation of relational databases based on information extraction and format migration to a preservation format. It also presents a tool that was developed to support this methodology: Database Preservation Toolkit (DBPTK), as well as the processes and formats needed to preserve databases. The DBPTK connects to live relational databases and extracts information into formats more adequate for long-term preservation. Supported preservation formats include the SIARD 2, created by a cooperation between the Swiss Federal Archives and the E-ARK project that is becoming a standard in the area. DBPTK has a flexible plugin-based architecture enabling its use for other purposes like database upgrade and database migration between different systems. Presented real case scenarios demonstrate the usefulness, correctness and performance of the tool. |
id |
RCAP_e855ae9444600d01e5b5a744503b51e0 |
---|---|
oai_identifier_str |
oai:repositorium.sdum.uminho.pt:1822/73404 |
network_acronym_str |
RCAP |
network_name_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository_id_str |
7160 |
spelling |
Beyond relational databases: preserving the dataDigital ArchivingeArchivingDatabase preservationE-ARKSIARDDatabasesRelational databasesRDMSDatabase preservation toolkitEngenharia e Tecnologia::Outras Engenharias e TecnologiasPaz, justiça e instituições eficazesRelational databases are one of the main technologies supporting information assets in today’s organizations. They are designed to store, organize and retrieve digital information, and are such a fundamental part of information systems that most would not be able to function without them. Very often, the information contained in databases is irreplaceable or prohibitively expensive to reacquire; therefore, steps must be taken to ensure that the information within databases is preserved. This paper describes a methodology for long-term preservation of relational databases based on information extraction and format migration to a preservation format. It also presents a tool that was developed to support this methodology: Database Preservation Toolkit (DBPTK), as well as the processes and formats needed to preserve databases. The DBPTK connects to live relational databases and extracts information into formats more adequate for long-term preservation. Supported preservation formats include the SIARD 2, created by a cooperation between the Swiss Federal Archives and the E-ARK project that is becoming a standard in the area. DBPTK has a flexible plugin-based architecture enabling its use for other purposes like database upgrade and database migration between different systems. Presented real case scenarios demonstrate the usefulness, correctness and performance of the tool.The initial E-ARK project was in part supported by the European Commission within the Competitiveness and Innovation Programme 2007–2013, Grant Agreement no. 620998 under the Policy Support Programme.RoutledgeUniversidade do MinhoRamalho, José CarlosFerreira, BrunoFaria, LuísFerreira, Miguel20202020-01-01T00:00:00Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleapplication/pdfhttp://hdl.handle.net/1822/73404engJosé Carlos Ramalho, Bruno Ferreira, Luis Faria & Miguel Ferreira (2020) Beyond Relational Databases: Preserving the Data, New Review of Information Networking, 25:2, 107-118, DOI: 10.1080/13614576.2021.19193981361-45761740-786910.1080/13614576.2021.1919398https://www.tandfonline.com/doi/abs/10.1080/13614576.2021.1919398info:eu-repo/semantics/openAccessreponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos)instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãoinstacron:RCAAP2023-07-21T11:58:16Zoai:repositorium.sdum.uminho.pt:1822/73404Portal AgregadorONGhttps://www.rcaap.pt/oai/openaireopendoar:71602024-03-19T18:47:58.757547Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informaçãofalse |
dc.title.none.fl_str_mv |
Beyond relational databases: preserving the data |
title |
Beyond relational databases: preserving the data |
spellingShingle |
Beyond relational databases: preserving the data Ramalho, José Carlos Digital Archiving eArchiving Database preservation E-ARK SIARD Databases Relational databases RDMS Database preservation toolkit Engenharia e Tecnologia::Outras Engenharias e Tecnologias Paz, justiça e instituições eficazes |
title_short |
Beyond relational databases: preserving the data |
title_full |
Beyond relational databases: preserving the data |
title_fullStr |
Beyond relational databases: preserving the data |
title_full_unstemmed |
Beyond relational databases: preserving the data |
title_sort |
Beyond relational databases: preserving the data |
author |
Ramalho, José Carlos |
author_facet |
Ramalho, José Carlos Ferreira, Bruno Faria, Luís Ferreira, Miguel |
author_role |
author |
author2 |
Ferreira, Bruno Faria, Luís Ferreira, Miguel |
author2_role |
author author author |
dc.contributor.none.fl_str_mv |
Universidade do Minho |
dc.contributor.author.fl_str_mv |
Ramalho, José Carlos Ferreira, Bruno Faria, Luís Ferreira, Miguel |
dc.subject.por.fl_str_mv |
Digital Archiving eArchiving Database preservation E-ARK SIARD Databases Relational databases RDMS Database preservation toolkit Engenharia e Tecnologia::Outras Engenharias e Tecnologias Paz, justiça e instituições eficazes |
topic |
Digital Archiving eArchiving Database preservation E-ARK SIARD Databases Relational databases RDMS Database preservation toolkit Engenharia e Tecnologia::Outras Engenharias e Tecnologias Paz, justiça e instituições eficazes |
description |
Relational databases are one of the main technologies supporting information assets in today’s organizations. They are designed to store, organize and retrieve digital information, and are such a fundamental part of information systems that most would not be able to function without them. Very often, the information contained in databases is irreplaceable or prohibitively expensive to reacquire; therefore, steps must be taken to ensure that the information within databases is preserved. This paper describes a methodology for long-term preservation of relational databases based on information extraction and format migration to a preservation format. It also presents a tool that was developed to support this methodology: Database Preservation Toolkit (DBPTK), as well as the processes and formats needed to preserve databases. The DBPTK connects to live relational databases and extracts information into formats more adequate for long-term preservation. Supported preservation formats include the SIARD 2, created by a cooperation between the Swiss Federal Archives and the E-ARK project that is becoming a standard in the area. DBPTK has a flexible plugin-based architecture enabling its use for other purposes like database upgrade and database migration between different systems. Presented real case scenarios demonstrate the usefulness, correctness and performance of the tool. |
publishDate |
2020 |
dc.date.none.fl_str_mv |
2020 2020-01-01T00:00:00Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://hdl.handle.net/1822/73404 |
url |
http://hdl.handle.net/1822/73404 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
José Carlos Ramalho, Bruno Ferreira, Luis Faria & Miguel Ferreira (2020) Beyond Relational Databases: Preserving the Data, New Review of Information Networking, 25:2, 107-118, DOI: 10.1080/13614576.2021.1919398 1361-4576 1740-7869 10.1080/13614576.2021.1919398 https://www.tandfonline.com/doi/abs/10.1080/13614576.2021.1919398 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Routledge |
publisher.none.fl_str_mv |
Routledge |
dc.source.none.fl_str_mv |
reponame:Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) instname:Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação instacron:RCAAP |
instname_str |
Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
instacron_str |
RCAAP |
institution |
RCAAP |
reponame_str |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
collection |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) |
repository.name.fl_str_mv |
Repositório Científico de Acesso Aberto de Portugal (Repositórios Cientìficos) - Agência para a Sociedade do Conhecimento (UMIC) - FCT - Sociedade da Informação |
repository.mail.fl_str_mv |
|
_version_ |
1799132239808692224 |