Intuitive: modelo conceitual para workflows de ETL
Autor(a) principal: | |
---|---|
Data de Publicação: | 2020 |
Tipo de documento: | Dissertação |
Idioma: | por |
Título da fonte: | Repositório Institucional da UFSCAR |
Texto Completo: | https://repositorio.ufscar.br/handle/ufscar/13968 |
Resumo: | The information domain is seen as a competitive differential in the most varied business areas, such as health, agribusiness, telecommunications, logistics, and government agencies. The correct and updated information is a valuable subsidy for corporative strategic decisions. Additionally, nowadays, huge volumes of data are generated at high speed and in various formats. In this context, research has been made to propose new models, architectures, processes, and algorithms that can contribute to transforming data into useful information for strategic decision making. In this scenario, a data warehousing environment plays a key role. The environment contains the data warehouse (DW), a huge repository with data that serves as a basis for responding to OLAP (Online Analytical Processing) queries. In a data warehousing environment, the ETL process is used to extract raw data from different data sources and to transform, clean, and integrate that data, loading to the DW. The ETL process is used for first data loading and, also for refreshing the data in the DW. This master's research investigated the best practices in conceptual modeling for ETL workflows and, as a result, proposes a new model, called “Intuitive”. The Intuitive Model adds simplicity, agility, clarity, and consistency to the modeling stage and can contribute to the improvement of construction and maintenance of ETL workflows. Theoretical analysis activities and practical experiments were performed with the users’ participation in order to validate the Intuitive Model. Such steps allowed us to evaluate that the elements of the Intuitive Model are sufficient to represent clearly several regular ETL scenarios showing advantages in comparison with the main related work in the state of the art. |
id |
SCAR_6cda1a0b1464334910651cc7a253a701 |
---|---|
oai_identifier_str |
oai:repositorio.ufscar.br:ufscar/13968 |
network_acronym_str |
SCAR |
network_name_str |
Repositório Institucional da UFSCAR |
repository_id_str |
4322 |
spelling |
Portes, Ana Célia Ribeiro BizigatoCiferri, Ricardo Rodrigueshttp://lattes.cnpq.br/8382221522817502http://lattes.cnpq.br/9091259735091455b98eb193-dd80-4f58-afab-75ef5cf94bfb2021-03-12T17:25:17Z2021-03-12T17:25:17Z2020-09-09PORTES, Ana Célia Ribeiro Bizigato. Intuitive: modelo conceitual para workflows de ETL. 2020. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/13968.https://repositorio.ufscar.br/handle/ufscar/13968The information domain is seen as a competitive differential in the most varied business areas, such as health, agribusiness, telecommunications, logistics, and government agencies. The correct and updated information is a valuable subsidy for corporative strategic decisions. Additionally, nowadays, huge volumes of data are generated at high speed and in various formats. In this context, research has been made to propose new models, architectures, processes, and algorithms that can contribute to transforming data into useful information for strategic decision making. In this scenario, a data warehousing environment plays a key role. The environment contains the data warehouse (DW), a huge repository with data that serves as a basis for responding to OLAP (Online Analytical Processing) queries. In a data warehousing environment, the ETL process is used to extract raw data from different data sources and to transform, clean, and integrate that data, loading to the DW. The ETL process is used for first data loading and, also for refreshing the data in the DW. This master's research investigated the best practices in conceptual modeling for ETL workflows and, as a result, proposes a new model, called “Intuitive”. The Intuitive Model adds simplicity, agility, clarity, and consistency to the modeling stage and can contribute to the improvement of construction and maintenance of ETL workflows. Theoretical analysis activities and practical experiments were performed with the users’ participation in order to validate the Intuitive Model. Such steps allowed us to evaluate that the elements of the Intuitive Model are sufficient to represent clearly several regular ETL scenarios showing advantages in comparison with the main related work in the state of the art.O domínio da informação é visto como um diferencial competitivo nas mais variadas áreas de negócio, tais como na saúde, agronegócio, telecomunicações, logística e em órgãos governamentais. A informação correta e atualizada é um valioso subsídio para decisões estratégicas nas corporações. Soma-se a isso o fato de que, atualmente, imensos volumes de dados são gerados em alta velocidade e em diversos formatos. Nesse contexto, pesquisas têm sido realizadas com o objetivo de propor novos modelos, arquiteturas, processos e algoritmos que possam contribuir para a transformação dos dados em informações úteis para a tomada de decisão estratégica. Nesse cenário, um ambiente de data warehousing exerce um papel fundamental. Esse ambiente contém o data warehouse (DW), que é o grande repositório que armazena dados extraídos de diversas fontes e que foram devidamente tratados e acurados. Os dados contidos no DW são usados para responder a consultas OLAP (Online Analytical Processing). Em um ambiente de data warehousing, o processo de ETL é usado para a extração dos dados brutos das diversas fontes de dados, seguido das etapas de transformação, limpeza e integração desses dados, para no final prover o armazenamento dos dados acurados no DW. Além da carga inicial dos dados, o pesquisa de ETL é usado para a constante atualização dos dados no DW. Esta pesquisa de Mestrado investigou as melhores práticas utilizadas na modelagem conceitual de workflows de ETL e, como resultado, propõe um novo modelo, denominado “Intuitive”, que adiciona simplicidade, agilidade, clareza e consistência à etapa de modelagem, podendo contribuir para melhorar a construção e a manutenção de workflows de ETL. Para a validação do modelo Intuitive forma realizadas atividades de análise teórica e, também, experimentos práticos com a participação de usuários. Tais atividades permitiram avaliar o modelo Intuitive, cujos elementos se mostraram suficientes para representar com clareza diversos cenários típicos de ETL demonstrando vantagens quando comparado ao principal trabalho relacionado no estado da arte.Não recebi financiamentoporUniversidade Federal de São CarlosCâmpus São CarlosPrograma de Pós-Graduação em Ciência da Computação - PPGCCUFSCarAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessModelagemModelagem conceitualData warehouseETLModelingConceptual modelingWorkflowCIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAOIntuitive: modelo conceitual para workflows de ETLIntuitive: conceptual model for ETL workflowsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesis6006003b1d5172-8bf0-4d0b-8777-ab82599bbf09reponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINALDissertação-AnaCélia.pdfDissertação-AnaCélia.pdfINTUITIVE - Modelo Conceitual para Workflows de ETLapplication/pdf3485710https://repositorio.ufscar.br/bitstream/ufscar/13968/8/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdfe83e6d6480f248f086d75e9e32f1041cMD58Autorização para entrega .pdfAutorização para entrega .pdfAutorização do orientador para entrega da dissertaçãoapplication/pdf375111https://repositorio.ufscar.br/bitstream/ufscar/13968/3/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf8689a687f1c9e26a4376fbee4f55fc98MD53CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufscar.br/bitstream/ufscar/13968/9/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD59TEXTDissertação-AnaCélia.pdf.txtDissertação-AnaCélia.pdf.txtExtracted texttext/plain193036https://repositorio.ufscar.br/bitstream/ufscar/13968/10/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf.txte496663797d965f0772d34b96347eb4eMD510Autorização para entrega .pdf.txtAutorização para entrega .pdf.txtExtracted texttext/plain1https://repositorio.ufscar.br/bitstream/ufscar/13968/12/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf.txt68b329da9893e34099c7d8ad5cb9c940MD512THUMBNAILDissertação-AnaCélia.pdf.jpgDissertação-AnaCélia.pdf.jpgIM Thumbnailimage/jpeg7446https://repositorio.ufscar.br/bitstream/ufscar/13968/11/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf.jpgd62fac5d7d763d290a07986159c02b8eMD511Autorização para entrega .pdf.jpgAutorização para entrega .pdf.jpgIM Thumbnailimage/jpeg12722https://repositorio.ufscar.br/bitstream/ufscar/13968/13/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf.jpg4a9f78fcb98654aa42b2bc593f5df2ccMD513ufscar/139682023-09-18 18:32:07.445oai:repositorio.ufscar.br:ufscar/13968Repositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestopendoar:43222023-09-18T18:32:07Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false |
dc.title.por.fl_str_mv |
Intuitive: modelo conceitual para workflows de ETL |
dc.title.alternative.eng.fl_str_mv |
Intuitive: conceptual model for ETL workflows |
title |
Intuitive: modelo conceitual para workflows de ETL |
spellingShingle |
Intuitive: modelo conceitual para workflows de ETL Portes, Ana Célia Ribeiro Bizigato Modelagem Modelagem conceitual Data warehouse ETL Modeling Conceptual modeling Workflow CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO |
title_short |
Intuitive: modelo conceitual para workflows de ETL |
title_full |
Intuitive: modelo conceitual para workflows de ETL |
title_fullStr |
Intuitive: modelo conceitual para workflows de ETL |
title_full_unstemmed |
Intuitive: modelo conceitual para workflows de ETL |
title_sort |
Intuitive: modelo conceitual para workflows de ETL |
author |
Portes, Ana Célia Ribeiro Bizigato |
author_facet |
Portes, Ana Célia Ribeiro Bizigato |
author_role |
author |
dc.contributor.authorlattes.por.fl_str_mv |
http://lattes.cnpq.br/9091259735091455 |
dc.contributor.author.fl_str_mv |
Portes, Ana Célia Ribeiro Bizigato |
dc.contributor.advisor1.fl_str_mv |
Ciferri, Ricardo Rodrigues |
dc.contributor.advisor1Lattes.fl_str_mv |
http://lattes.cnpq.br/8382221522817502 |
dc.contributor.authorID.fl_str_mv |
b98eb193-dd80-4f58-afab-75ef5cf94bfb |
contributor_str_mv |
Ciferri, Ricardo Rodrigues |
dc.subject.por.fl_str_mv |
Modelagem Modelagem conceitual |
topic |
Modelagem Modelagem conceitual Data warehouse ETL Modeling Conceptual modeling Workflow CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO |
dc.subject.eng.fl_str_mv |
Data warehouse ETL Modeling Conceptual modeling Workflow |
dc.subject.cnpq.fl_str_mv |
CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO |
description |
The information domain is seen as a competitive differential in the most varied business areas, such as health, agribusiness, telecommunications, logistics, and government agencies. The correct and updated information is a valuable subsidy for corporative strategic decisions. Additionally, nowadays, huge volumes of data are generated at high speed and in various formats. In this context, research has been made to propose new models, architectures, processes, and algorithms that can contribute to transforming data into useful information for strategic decision making. In this scenario, a data warehousing environment plays a key role. The environment contains the data warehouse (DW), a huge repository with data that serves as a basis for responding to OLAP (Online Analytical Processing) queries. In a data warehousing environment, the ETL process is used to extract raw data from different data sources and to transform, clean, and integrate that data, loading to the DW. The ETL process is used for first data loading and, also for refreshing the data in the DW. This master's research investigated the best practices in conceptual modeling for ETL workflows and, as a result, proposes a new model, called “Intuitive”. The Intuitive Model adds simplicity, agility, clarity, and consistency to the modeling stage and can contribute to the improvement of construction and maintenance of ETL workflows. Theoretical analysis activities and practical experiments were performed with the users’ participation in order to validate the Intuitive Model. Such steps allowed us to evaluate that the elements of the Intuitive Model are sufficient to represent clearly several regular ETL scenarios showing advantages in comparison with the main related work in the state of the art. |
publishDate |
2020 |
dc.date.issued.fl_str_mv |
2020-09-09 |
dc.date.accessioned.fl_str_mv |
2021-03-12T17:25:17Z |
dc.date.available.fl_str_mv |
2021-03-12T17:25:17Z |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/masterThesis |
format |
masterThesis |
status_str |
publishedVersion |
dc.identifier.citation.fl_str_mv |
PORTES, Ana Célia Ribeiro Bizigato. Intuitive: modelo conceitual para workflows de ETL. 2020. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/13968. |
dc.identifier.uri.fl_str_mv |
https://repositorio.ufscar.br/handle/ufscar/13968 |
identifier_str_mv |
PORTES, Ana Célia Ribeiro Bizigato. Intuitive: modelo conceitual para workflows de ETL. 2020. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/13968. |
url |
https://repositorio.ufscar.br/handle/ufscar/13968 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.confidence.fl_str_mv |
600 600 |
dc.relation.authority.fl_str_mv |
3b1d5172-8bf0-4d0b-8777-ab82599bbf09 |
dc.rights.driver.fl_str_mv |
Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/ info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/ |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
Universidade Federal de São Carlos Câmpus São Carlos |
dc.publisher.program.fl_str_mv |
Programa de Pós-Graduação em Ciência da Computação - PPGCC |
dc.publisher.initials.fl_str_mv |
UFSCar |
publisher.none.fl_str_mv |
Universidade Federal de São Carlos Câmpus São Carlos |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFSCAR instname:Universidade Federal de São Carlos (UFSCAR) instacron:UFSCAR |
instname_str |
Universidade Federal de São Carlos (UFSCAR) |
instacron_str |
UFSCAR |
institution |
UFSCAR |
reponame_str |
Repositório Institucional da UFSCAR |
collection |
Repositório Institucional da UFSCAR |
bitstream.url.fl_str_mv |
https://repositorio.ufscar.br/bitstream/ufscar/13968/8/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf https://repositorio.ufscar.br/bitstream/ufscar/13968/3/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf https://repositorio.ufscar.br/bitstream/ufscar/13968/9/license_rdf https://repositorio.ufscar.br/bitstream/ufscar/13968/10/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf.txt https://repositorio.ufscar.br/bitstream/ufscar/13968/12/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf.txt https://repositorio.ufscar.br/bitstream/ufscar/13968/11/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf.jpg https://repositorio.ufscar.br/bitstream/ufscar/13968/13/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf.jpg |
bitstream.checksum.fl_str_mv |
e83e6d6480f248f086d75e9e32f1041c 8689a687f1c9e26a4376fbee4f55fc98 e39d27027a6cc9cb039ad269a5db8e34 e496663797d965f0772d34b96347eb4e 68b329da9893e34099c7d8ad5cb9c940 d62fac5d7d763d290a07986159c02b8e 4a9f78fcb98654aa42b2bc593f5df2cc |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 MD5 MD5 MD5 |
repository.name.fl_str_mv |
Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR) |
repository.mail.fl_str_mv |
|
_version_ |
1813715624664236032 |