A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses
Autor(a) principal: | |
---|---|
Data de Publicação: | 2017 |
Outros Autores: | , , , , |
Tipo de documento: | Artigo de conferência |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.1109/PDCAT.2016.053 http://hdl.handle.net/11449/179002 |
Resumo: | This work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments. |
id |
UNSP_909c138295078ccecc2a739c31c0c7d1 |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/179002 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
A configurable strategy for extraction, transformation and load to support data propagation on active data warehousesActive data warehouseData warehouseETLNear real-Time data warehouseThis work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments.Department of Computer Science and Statistics São Paulo State University (UNESP)Department of Computer Science and Statistics Federal University of São Carlos (UFSCar)Department of Computer Science and Statistics São Paulo State University (UNESP)Universidade Estadual Paulista (Unesp)Universidade Federal de São Carlos (UFSCar)Valencio, Carlos Roberto [UNESP]Neto, Paulo Scarpelini [UNESP]Neves, Leandro Alves [UNESP]Zafalon, Geraldo Francisco Donega [UNESP]Souza, Rogeria Cristiane Gratao De [UNESP]Colombini, Angelo Cesar2018-12-11T17:33:06Z2018-12-11T17:33:06Z2017-06-07info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject204-209http://dx.doi.org/10.1109/PDCAT.2016.053Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings, p. 204-209.http://hdl.handle.net/11449/17900210.1109/PDCAT.2016.0532-s2.0-85021920269464481225387583221390538148793120000-0002-9325-3159Scopusreponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPengParallel and Distributed Computing, Applications and Technologies, PDCAT Proceedingsinfo:eu-repo/semantics/openAccess2021-10-23T21:47:03Zoai:repositorio.unesp.br:11449/179002Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T18:23:08.111977Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses |
title |
A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses |
spellingShingle |
A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses Valencio, Carlos Roberto [UNESP] Active data warehouse Data warehouse ETL Near real-Time data warehouse |
title_short |
A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses |
title_full |
A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses |
title_fullStr |
A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses |
title_full_unstemmed |
A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses |
title_sort |
A configurable strategy for extraction, transformation and load to support data propagation on active data warehouses |
author |
Valencio, Carlos Roberto [UNESP] |
author_facet |
Valencio, Carlos Roberto [UNESP] Neto, Paulo Scarpelini [UNESP] Neves, Leandro Alves [UNESP] Zafalon, Geraldo Francisco Donega [UNESP] Souza, Rogeria Cristiane Gratao De [UNESP] Colombini, Angelo Cesar |
author_role |
author |
author2 |
Neto, Paulo Scarpelini [UNESP] Neves, Leandro Alves [UNESP] Zafalon, Geraldo Francisco Donega [UNESP] Souza, Rogeria Cristiane Gratao De [UNESP] Colombini, Angelo Cesar |
author2_role |
author author author author author |
dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (Unesp) Universidade Federal de São Carlos (UFSCar) |
dc.contributor.author.fl_str_mv |
Valencio, Carlos Roberto [UNESP] Neto, Paulo Scarpelini [UNESP] Neves, Leandro Alves [UNESP] Zafalon, Geraldo Francisco Donega [UNESP] Souza, Rogeria Cristiane Gratao De [UNESP] Colombini, Angelo Cesar |
dc.subject.por.fl_str_mv |
Active data warehouse Data warehouse ETL Near real-Time data warehouse |
topic |
Active data warehouse Data warehouse ETL Near real-Time data warehouse |
description |
This work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments. |
publishDate |
2017 |
dc.date.none.fl_str_mv |
2017-06-07 2018-12-11T17:33:06Z 2018-12-11T17:33:06Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1109/PDCAT.2016.053 Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings, p. 204-209. http://hdl.handle.net/11449/179002 10.1109/PDCAT.2016.053 2-s2.0-85021920269 4644812253875832 2139053814879312 0000-0002-9325-3159 |
url |
http://dx.doi.org/10.1109/PDCAT.2016.053 http://hdl.handle.net/11449/179002 |
identifier_str_mv |
Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings, p. 204-209. 10.1109/PDCAT.2016.053 2-s2.0-85021920269 4644812253875832 2139053814879312 0000-0002-9325-3159 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
Parallel and Distributed Computing, Applications and Technologies, PDCAT Proceedings |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
204-209 |
dc.source.none.fl_str_mv |
Scopus reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808128925431234560 |