A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses
Autor(a) principal: | |
---|---|
Data de Publicação: | 2016 |
Outros Autores: | , , , , , , , |
Tipo de documento: | Artigo de conferência |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UNESP |
Texto Completo: | http://dx.doi.org/10.1109/PDCAT.2016.52 http://hdl.handle.net/11449/165634 |
Resumo: | This work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments. |
id |
UNSP_500f989866d6430c2b44c9c8371c4b1e |
---|---|
oai_identifier_str |
oai:repositorio.unesp.br:11449/165634 |
network_acronym_str |
UNSP |
network_name_str |
Repositório Institucional da UNESP |
repository_id_str |
2946 |
spelling |
A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehousesdata warehouseETLactive data warehousenear real-time data warehouseThis work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments.Sao Paulo State Univ UNESP, Dept Comp Sci & Stat, Sao Paulo, BrazilUniv Fed Sao Carlos, Dept Comp Sci & Stat, Sao Carlos, SP, BrazilSao Paulo State Univ UNESP, Dept Comp Sci & Stat, Sao Paulo, BrazilIeeeUniversidade Estadual Paulista (Unesp)Universidade Federal de São Carlos (UFSCar)Valencio, Carlos Roberto [UNESP]Neto, Paulo Scarpelini [UNESP]Neves, Leandro Alves [UNESP]Donega Zafalon, Geraldo Francisco [UNESP]Gratao de Souza, Rogeria Cristiane [UNESP]Colombini, Angelo CesarShen, H.Sang, Y.Tian, H.2018-11-28T13:17:40Z2018-11-28T13:17:40Z2016-01-01info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/conferenceObject204-209http://dx.doi.org/10.1109/PDCAT.2016.522016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat). New York: Ieee, p. 204-209, 2016.http://hdl.handle.net/11449/16563410.1109/PDCAT.2016.52WOS:000403774200043464481225387583221390538148793120000-0002-9325-3159Web of Sciencereponame:Repositório Institucional da UNESPinstname:Universidade Estadual Paulista (UNESP)instacron:UNESPeng2016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat)info:eu-repo/semantics/openAccess2021-10-23T21:47:03Zoai:repositorio.unesp.br:11449/165634Repositório InstitucionalPUBhttp://repositorio.unesp.br/oai/requestopendoar:29462024-08-05T22:46:17.251780Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP)false |
dc.title.none.fl_str_mv |
A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses |
title |
A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses |
spellingShingle |
A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses Valencio, Carlos Roberto [UNESP] data warehouse ETL active data warehouse near real-time data warehouse |
title_short |
A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses |
title_full |
A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses |
title_fullStr |
A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses |
title_full_unstemmed |
A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses |
title_sort |
A Configurable Strategy for Extraction, Transformation and Load to Support Data Propagation on Active Data Warehouses |
author |
Valencio, Carlos Roberto [UNESP] |
author_facet |
Valencio, Carlos Roberto [UNESP] Neto, Paulo Scarpelini [UNESP] Neves, Leandro Alves [UNESP] Donega Zafalon, Geraldo Francisco [UNESP] Gratao de Souza, Rogeria Cristiane [UNESP] Colombini, Angelo Cesar Shen, H. Sang, Y. Tian, H. |
author_role |
author |
author2 |
Neto, Paulo Scarpelini [UNESP] Neves, Leandro Alves [UNESP] Donega Zafalon, Geraldo Francisco [UNESP] Gratao de Souza, Rogeria Cristiane [UNESP] Colombini, Angelo Cesar Shen, H. Sang, Y. Tian, H. |
author2_role |
author author author author author author author author |
dc.contributor.none.fl_str_mv |
Universidade Estadual Paulista (Unesp) Universidade Federal de São Carlos (UFSCar) |
dc.contributor.author.fl_str_mv |
Valencio, Carlos Roberto [UNESP] Neto, Paulo Scarpelini [UNESP] Neves, Leandro Alves [UNESP] Donega Zafalon, Geraldo Francisco [UNESP] Gratao de Souza, Rogeria Cristiane [UNESP] Colombini, Angelo Cesar Shen, H. Sang, Y. Tian, H. |
dc.subject.por.fl_str_mv |
data warehouse ETL active data warehouse near real-time data warehouse |
topic |
data warehouse ETL active data warehouse near real-time data warehouse |
description |
This work consists of the construction of a strategy called ETL-PoCon to execute Extraction, Transformation and Load (ETL) processes in active Data Warehouses (DW) with a configurable policy. The original contribution of this work is to provide a strategy that considerably reduces the quantity of data transfers to active DW, besides maintaining a satisfactory level of data freshness. Said reduction is obtained by means of configurable policies of data propagation based on relevance of the data regarding to the information stored in the DW. The strategy was implemented in a database related to health worker that contains more than seventy thousand records of occupational accidents. Experiments have shown that the ETL-PoCon strategy significantly contributes towards a reduction of the overload on the systems involved in the active DW environment, since all results presented a reduction higher than 60% in the amount of DW refreshments. |
publishDate |
2016 |
dc.date.none.fl_str_mv |
2016-01-01 2018-11-28T13:17:40Z 2018-11-28T13:17:40Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/conferenceObject |
format |
conferenceObject |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://dx.doi.org/10.1109/PDCAT.2016.52 2016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat). New York: Ieee, p. 204-209, 2016. http://hdl.handle.net/11449/165634 10.1109/PDCAT.2016.52 WOS:000403774200043 4644812253875832 2139053814879312 0000-0002-9325-3159 |
url |
http://dx.doi.org/10.1109/PDCAT.2016.52 http://hdl.handle.net/11449/165634 |
identifier_str_mv |
2016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat). New York: Ieee, p. 204-209, 2016. 10.1109/PDCAT.2016.52 WOS:000403774200043 4644812253875832 2139053814879312 0000-0002-9325-3159 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
2016 17th International Conference On Parallel And Distributed Computing, Applications And Technologies (pdcat) |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
204-209 |
dc.publisher.none.fl_str_mv |
Ieee |
publisher.none.fl_str_mv |
Ieee |
dc.source.none.fl_str_mv |
Web of Science reponame:Repositório Institucional da UNESP instname:Universidade Estadual Paulista (UNESP) instacron:UNESP |
instname_str |
Universidade Estadual Paulista (UNESP) |
instacron_str |
UNESP |
institution |
UNESP |
reponame_str |
Repositório Institucional da UNESP |
collection |
Repositório Institucional da UNESP |
repository.name.fl_str_mv |
Repositório Institucional da UNESP - Universidade Estadual Paulista (UNESP) |
repository.mail.fl_str_mv |
|
_version_ |
1808129460937949184 |