A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses

Detalhes bibliográficos
Autor(a) principal: Valencio, Carlos Roberto [UNESP]
Data de Publicação: 2017
Outros Autores: Neto, Paulo Scarpelini [UNESP], Neves, Leandro Alves [UNESP], Zafalon, Geraldo Francisco Donega [UNESP], Souza, Rogeria Cristiane Gratao De [UNESP], Colombini, Angelo Cesar
Tipo de documento: Artigo de conferência
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1109/PDCAT.2016.053
http://hdl.handle.net/11449/179002
Resumo: This work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments.
id UNSP_909c138295078ccecc2a739c31c0c7d1
oai_identifier_str oai:repositorio.unesp.br:11449/179002
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling A configurable strategy for extraction, transformation and load to support data propagation on active data warehousesActive data warehouseData warehouseETLNear real-Time data warehouseThis work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments.Department of Computer Science and Statistics São Paulo State University (UNESP)Department of Computer Science and Statistics Federal University of São Carlos (UFSCar)Department of Computer Science and Statistics São Paulo State University (UNESP)Universidade Estadual Paulista (Unesp)Universidade Federal de São Carlos (UFSCar)Valencio, Carlos Roberto [UNESP]Neto, Paulo Scarpelini [UNESP]Neves, Leandro Alves [UNESP]Zafalon, Geraldo Francisco Donega [UNESP]Souza, Rogeria Cristiane Gratao De [UNESP]Colombini, Angelo Cesar2018-12-11T17:33:06Z2018-12-11T17:33:06Z2017-06-07info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject204-209http://dx.doi.org/10.1109/PDCAT.2016.053Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings, p. 204-209.http://hdl.handle.net/11449/17900210.1109/PDCAT.2016.0532-s2.0-85021920269464481225387583221390538148793120000-0002-9325-3159Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengParallel and Distributed Computing, Applications and Technologies, PDCAT Proceedingsinfo:eu-repo/semantics/openAccess2021-10-23T21:47:03Zoai:repositorio.unesp.br:11449/179002Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T18:23:08.111977Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses
title A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses
spellingShingle A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses
Valencio, Carlos Roberto [UNESP]
Active data warehouse
Data warehouse
ETL
Near real-Time data warehouse
title_short A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses
title_full A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses
title_fullStr A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses
title_full_unstemmed A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses
title_sort A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses
author Valencio, Carlos Roberto [UNESP]
author_facet Valencio, Carlos Roberto [UNESP]
Neto, Paulo Scarpelini [UNESP]
Neves, Leandro Alves [UNESP]
Zafalon, Geraldo Francisco Donega [UNESP]
Souza, Rogeria Cristiane Gratao De [UNESP]
Colombini, Angelo Cesar
author_role author
author2 Neto, Paulo Scarpelini [UNESP]
Neves, Leandro Alves [UNESP]
Zafalon, Geraldo Francisco Donega [UNESP]
Souza, Rogeria Cristiane Gratao De [UNESP]
Colombini, Angelo Cesar
author2_role author
author
author
author
author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (Unesp)
Universidade Federal de São Carlos (UFSCar)
dc.contributor.author.fl_str_mv Valencio, Carlos Roberto [UNESP]
Neto, Paulo Scarpelini [UNESP]
Neves, Leandro Alves [UNESP]
Zafalon, Geraldo Francisco Donega [UNESP]
Souza, Rogeria Cristiane Gratao De [UNESP]
Colombini, Angelo Cesar
dc.subject.por.fl_str_mv Active data warehouse
Data warehouse
ETL
Near real-Time data warehouse
topic Active data warehouse
Data warehouse
ETL
Near real-Time data warehouse
description This work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments.
publishDate 2017
dc.date.none.fl_str_mv 2017-06-07
2018-12-11T17:33:06Z
2018-12-11T17:33:06Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1109/PDCAT.2016.053
Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings, p. 204-209.
http://hdl.handle.net/11449/179002
10.1109/PDCAT.2016.053
2-s2.0-85021920269
4644812253875832
2139053814879312
0000-0002-9325-3159
url http://dx.doi.org/10.1109/PDCAT.2016.053
http://hdl.handle.net/11449/179002
identifier_str_mv Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings, p. 204-209.
10.1109/PDCAT.2016.053
2-s2.0-85021920269
4644812253875832
2139053814879312
0000-0002-9325-3159
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 204-209
dc.source.none.fl_str_mv Scopus
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1808128925431234560