A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses

Detalhes bibliográficos
Autor(a) principal: Valencio, Carlos Roberto [UNESP]
Data de Publicação: 2016
Outros Autores: Neto, Paulo Scarpelini [UNESP], Neves, Leandro Alves [UNESP], Donega Zafalon, Geraldo Francisco [UNESP], Gratao de Souza, Rogeria Cristiane [UNESP], Colombini, Angelo Cesar, Shen, H., Sang, Y., Tian, H.
Tipo de documento: Artigo de conferência
Idioma: eng
Título da fonte: Repositório Institucional da UNESP
Texto Completo: http://dx.doi.org/10.1109/PDCAT.2016.52
http://hdl.handle.net/11449/165634
Resumo: This work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments.
id UNSP_500f989866d6430c2b44c9c8371c4b1e
oai_identifier_str oai:repositorio.unesp.br:11449/165634
network_acronym_str UNSP
network_name_str Repositório Institucional da UNESP
repository_id_str 2946
spelling A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehousesdata warehouseETLactive data warehousenear real-time data warehouseThis work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments.Sao Paulo State Univ UNESP, Dept Comp Sci & Stat, Sao Paulo, BrazilUniv Fed Sao Carlos, Dept Comp Sci & Stat, Sao Carlos, SP, BrazilSao Paulo State Univ UNESP, Dept Comp Sci & Stat, Sao Paulo, BrazilIeeeUniversidade Estadual Paulista (Unesp)Universidade Federal de São Carlos (UFSCar)Valencio, Carlos Roberto [UNESP]Neto, Paulo Scarpelini [UNESP]Neves, Leandro Alves [UNESP]Donega Zafalon, Geraldo Francisco [UNESP]Gratao de Souza, Rogeria Cristiane [UNESP]Colombini, Angelo CesarShen, H.Sang, Y.Tian, H.2018-11-28T13:17:40Z2018-11-28T13:17:40Z2016-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject204-209http://dx.doi.org/10.1109/PDCAT.2016.522016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat). New York: Ieee, p. 204-209, 2016.http://hdl.handle.net/11449/16563410.1109/PDCAT.2016.52WOS:000403774200043464481225387583221390538148793120000-0002-9325-3159Web of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPeng2016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat)info:eu-repo/semantics/openAccess2021-10-23T21:47:03Zoai:repositorio.unesp.br:11449/165634Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462021-10-23T21:47:03Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false
dc.title.none.fl_str_mv A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses
title A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses
spellingShingle A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses
Valencio, Carlos Roberto [UNESP]
data warehouse
ETL
active data warehouse
near real-time data warehouse
title_short A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses
title_full A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses
title_fullStr A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses
title_full_unstemmed A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses
title_sort A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses
author Valencio, Carlos Roberto [UNESP]
author_facet Valencio, Carlos Roberto [UNESP]
Neto, Paulo Scarpelini [UNESP]
Neves, Leandro Alves [UNESP]
Donega Zafalon, Geraldo Francisco [UNESP]
Gratao de Souza, Rogeria Cristiane [UNESP]
Colombini, Angelo Cesar
Shen, H.
Sang, Y.
Tian, H.
author_role author
author2 Neto, Paulo Scarpelini [UNESP]
Neves, Leandro Alves [UNESP]
Donega Zafalon, Geraldo Francisco [UNESP]
Gratao de Souza, Rogeria Cristiane [UNESP]
Colombini, Angelo Cesar
Shen, H.
Sang, Y.
Tian, H.
author2_role author
author
author
author
author
author
author
author
dc.contributor.none.fl_str_mv Universidade Estadual Paulista (Unesp)
Universidade Federal de São Carlos (UFSCar)
dc.contributor.author.fl_str_mv Valencio, Carlos Roberto [UNESP]
Neto, Paulo Scarpelini [UNESP]
Neves, Leandro Alves [UNESP]
Donega Zafalon, Geraldo Francisco [UNESP]
Gratao de Souza, Rogeria Cristiane [UNESP]
Colombini, Angelo Cesar
Shen, H.
Sang, Y.
Tian, H.
dc.subject.por.fl_str_mv data warehouse
ETL
active data warehouse
near real-time data warehouse
topic data warehouse
ETL
active data warehouse
near real-time data warehouse
description This work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments.
publishDate 2016
dc.date.none.fl_str_mv 2016-01-01
2018-11-28T13:17:40Z
2018-11-28T13:17:40Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/conferenceObject
format conferenceObject
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://dx.doi.org/10.1109/PDCAT.2016.52
2016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat). New York: Ieee, p. 204-209, 2016.
http://hdl.handle.net/11449/165634
10.1109/PDCAT.2016.52
WOS:000403774200043
4644812253875832
2139053814879312
0000-0002-9325-3159
url http://dx.doi.org/10.1109/PDCAT.2016.52
http://hdl.handle.net/11449/165634
identifier_str_mv 2016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat). New York: Ieee, p. 204-209, 2016.
10.1109/PDCAT.2016.52
WOS:000403774200043
4644812253875832
2139053814879312
0000-0002-9325-3159
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 2016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat)
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv 204-209
dc.publisher.none.fl_str_mv Ieee
publisher.none.fl_str_mv Ieee
dc.source.none.fl_str_mv Web of Science
reponame:Repositório Institucional da UNESP
instname:Universidade Estadual Paulista (UNESP)
instacron:UNESP
instname_str Universidade Estadual Paulista (UNESP)
instacron_str UNESP
institution UNESP
reponame_str Repositório Institucional da UNESP
collection Repositório Institucional da UNESP
repository.name.fl_str_mv Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)
repository.mail.fl_str_mv
_version_ 1797790273874427904