Intuitive: modelo conceitual para workflows de ETL

Detalhes bibliográficos
Autor(a) principal: Portes, Ana Célia Ribeiro Bizigato
Data de Publicação: 2020
Tipo de documento: Dissertação
Idioma: por
Título da fonte: Repositório Institucional da UFSCAR
Texto Completo: https://repositorio.ufscar.br/handle/ufscar/13968
Resumo: The information domain is seen as a competitive differential in the most varied business areas, such as health, agribusiness, telecommunications, logistics, and government agencies. The correct and updated information is a valuable subsidy for corporative strategic decisions. Additionally, nowadays, huge volumes of data are generated at high speed and in various formats. In this context, research has been made to propose new models, architectures, processes, and algorithms that can contribute to transforming data into useful information for strategic decision making. In this scenario, a data warehousing environment plays a key role. The environment contains the data warehouse (DW), a huge repository with data that serves as a basis for responding to OLAP (Online Analytical Processing) queries. In a data warehousing environment, the ETL process is used to extract raw data from different data sources and to transform, clean, and integrate that data, loading to the DW. The ETL process is used for first data loading and, also for refreshing the data in the DW. This master's research investigated the best practices in conceptual modeling for ETL workflows and, as a result, proposes a new model, called “Intuitive”. The Intuitive Model adds simplicity, agility, clarity, and consistency to the modeling stage and can contribute to the improvement of construction and maintenance of ETL workflows. Theoretical analysis activities and practical experiments were performed with the users’ participation in order to validate the Intuitive Model. Such steps allowed us to evaluate that the elements of the Intuitive Model are sufficient to represent clearly several regular ETL scenarios showing advantages in comparison with the main related work in the state of the art.
id SCAR_6cda1a0b1464334910651cc7a253a701
oai_identifier_str oai:repositorio.ufscar.br:ufscar/13968
network_acronym_str SCAR
network_name_str Repositório Institucional da UFSCAR
repository_id_str 4322
spelling Portes, Ana Célia Ribeiro BizigatoCiferri, Ricardo Rodrigueshttp://lattes.cnpq.br/8382221522817502http://lattes.cnpq.br/9091259735091455b98eb193-dd80-4f58-afab-75ef5cf94bfb2021-03-12T17:25:17Z2021-03-12T17:25:17Z2020-09-09PORTES, Ana Célia Ribeiro Bizigato. Intuitive: modelo conceitual para workflows de ETL. 2020. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/13968.https://repositorio.ufscar.br/handle/ufscar/13968The information domain is seen as a competitive differential in the most varied business areas, such as health, agribusiness, telecommunications, logistics, and government agencies. The correct and updated information is a valuable subsidy for corporative strategic decisions. Additionally, nowadays, huge volumes of data are generated at high speed and in various formats. In this context, research has been made to propose new models, architectures, processes, and algorithms that can contribute to transforming data into useful information for strategic decision making. In this scenario, a data warehousing environment plays a key role. The environment contains the data warehouse (DW), a huge repository with data that serves as a basis for responding to OLAP (Online Analytical Processing) queries. In a data warehousing environment, the ETL process is used to extract raw data from different data sources and to transform, clean, and integrate that data, loading to the DW. The ETL process is used for first data loading and, also for refreshing the data in the DW. This master's research investigated the best practices in conceptual modeling for ETL workflows and, as a result, proposes a new model, called “Intuitive”. The Intuitive Model adds simplicity, agility, clarity, and consistency to the modeling stage and can contribute to the improvement of construction and maintenance of ETL workflows. Theoretical analysis activities and practical experiments were performed with the users’ participation in order to validate the Intuitive Model. Such steps allowed us to evaluate that the elements of the Intuitive Model are sufficient to represent clearly several regular ETL scenarios showing advantages in comparison with the main related work in the state of the art.O domínio da informação é visto como um diferencial competitivo nas mais variadas áreas de negócio, tais como na saúde, agronegócio, telecomunicações, logística e em órgãos governamentais. A informação correta e atualizada é um valioso subsídio para decisões estratégicas nas corporações. Soma-se a isso o fato de que, atualmente, imensos volumes de dados são gerados em alta velocidade e em diversos formatos. Nesse contexto, pesquisas têm sido realizadas com o objetivo de propor novos modelos, arquiteturas, processos e algoritmos que possam contribuir para a transformação dos dados em informações úteis para a tomada de decisão estratégica. Nesse cenário, um ambiente de data warehousing exerce um papel fundamental. Esse ambiente contém o data warehouse (DW), que é o grande repositório que armazena dados extraídos de diversas fontes e que foram devidamente tratados e acurados. Os dados contidos no DW são usados para responder a consultas OLAP (Online Analytical Processing). Em um ambiente de data warehousing, o processo de ETL é usado para a extração dos dados brutos das diversas fontes de dados, seguido das etapas de transformação, limpeza e integração desses dados, para no final prover o armazenamento dos dados acurados no DW. Além da carga inicial dos dados, o pesquisa de ETL é usado para a constante atualização dos dados no DW. Esta pesquisa de Mestrado investigou as melhores práticas utilizadas na modelagem conceitual de workflows de ETL e, como resultado, propõe um novo modelo, denominado “Intuitive”, que adiciona simplicidade, agilidade, clareza e consistência à etapa de modelagem, podendo contribuir para melhorar a construção e a manutenção de workflows de ETL. Para a validação do modelo Intuitive forma realizadas atividades de análise teórica e, também, experimentos práticos com a participação de usuários. Tais atividades permitiram avaliar o modelo Intuitive, cujos elementos se mostraram suficientes para representar com clareza diversos cenários típicos de ETL demonstrando vantagens quando comparado ao principal trabalho relacionado no estado da arte.Não recebi financiamentoporUniversidade Federal de São CarlosCâmpus São CarlosPrograma de Pós-Graduação em Ciência da Computação - PPGCCUFSCarAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessModelagemModelagem conceitualData warehouseETLModelingConceptual modelingWorkflowCIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAOIntuitive: modelo conceitual para workflows de ETLIntuitive: conceptual model for ETL workflowsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesis6006003b1d5172-8bf0-4d0b-8777-ab82599bbf09reponame:Repositório Institucional da UFSCARinstname:Universidade Federal de São Carlos (UFSCAR)instacron:UFSCARORIGINALDissertação-AnaCélia.pdfDissertação-AnaCélia.pdfINTUITIVE - Modelo Conceitual para Workflows de ETLapplication/pdf3485710https://repositorio.ufscar.br/bitstream/ufscar/13968/8/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdfe83e6d6480f248f086d75e9e32f1041cMD58Autorização para entrega .pdfAutorização para entrega .pdfAutorização do orientador para entrega da dissertaçãoapplication/pdf375111https://repositorio.ufscar.br/bitstream/ufscar/13968/3/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf8689a687f1c9e26a4376fbee4f55fc98MD53CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufscar.br/bitstream/ufscar/13968/9/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD59TEXTDissertação-AnaCélia.pdf.txtDissertação-AnaCélia.pdf.txtExtracted texttext/plain193036https://repositorio.ufscar.br/bitstream/ufscar/13968/10/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf.txte496663797d965f0772d34b96347eb4eMD510Autorização para entrega .pdf.txtAutorização para entrega .pdf.txtExtracted texttext/plain1https://repositorio.ufscar.br/bitstream/ufscar/13968/12/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf.txt68b329da9893e34099c7d8ad5cb9c940MD512THUMBNAILDissertação-AnaCélia.pdf.jpgDissertação-AnaCélia.pdf.jpgIM Thumbnailimage/jpeg7446https://repositorio.ufscar.br/bitstream/ufscar/13968/11/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf.jpgd62fac5d7d763d290a07986159c02b8eMD511Autorização para entrega .pdf.jpgAutorização para entrega .pdf.jpgIM Thumbnailimage/jpeg12722https://repositorio.ufscar.br/bitstream/ufscar/13968/13/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf.jpg4a9f78fcb98654aa42b2bc593f5df2ccMD513ufscar/139682023-09-18 18:32:07.445oai:repositorio.ufscar.br:ufscar/13968Repositório InstitucionalPUBhttps://repositorio.ufscar.br/oai/requestopendoar:43222023-09-18T18:32:07Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)false
dc.title.por.fl_str_mv Intuitive: modelo conceitual para workflows de ETL
dc.title.alternative.eng.fl_str_mv Intuitive: conceptual model for ETL workflows
title Intuitive: modelo conceitual para workflows de ETL
spellingShingle Intuitive: modelo conceitual para workflows de ETL
Portes, Ana Célia Ribeiro Bizigato
Modelagem
Modelagem conceitual
Data warehouse
ETL
Modeling
Conceptual modeling
Workflow
CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
title_short Intuitive: modelo conceitual para workflows de ETL
title_full Intuitive: modelo conceitual para workflows de ETL
title_fullStr Intuitive: modelo conceitual para workflows de ETL
title_full_unstemmed Intuitive: modelo conceitual para workflows de ETL
title_sort Intuitive: modelo conceitual para workflows de ETL
author Portes, Ana Célia Ribeiro Bizigato
author_facet Portes, Ana Célia Ribeiro Bizigato
author_role author
dc.contributor.authorlattes.por.fl_str_mv http://lattes.cnpq.br/9091259735091455
dc.contributor.author.fl_str_mv Portes, Ana Célia Ribeiro Bizigato
dc.contributor.advisor1.fl_str_mv Ciferri, Ricardo Rodrigues
dc.contributor.advisor1Lattes.fl_str_mv http://lattes.cnpq.br/8382221522817502
dc.contributor.authorID.fl_str_mv b98eb193-dd80-4f58-afab-75ef5cf94bfb
contributor_str_mv Ciferri, Ricardo Rodrigues
dc.subject.por.fl_str_mv Modelagem
Modelagem conceitual
topic Modelagem
Modelagem conceitual
Data warehouse
ETL
Modeling
Conceptual modeling
Workflow
CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
dc.subject.eng.fl_str_mv Data warehouse
ETL
Modeling
Conceptual modeling
Workflow
dc.subject.cnpq.fl_str_mv CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
description The information domain is seen as a competitive differential in the most varied business areas, such as health, agribusiness, telecommunications, logistics, and government agencies. The correct and updated information is a valuable subsidy for corporative strategic decisions. Additionally, nowadays, huge volumes of data are generated at high speed and in various formats. In this context, research has been made to propose new models, architectures, processes, and algorithms that can contribute to transforming data into useful information for strategic decision making. In this scenario, a data warehousing environment plays a key role. The environment contains the data warehouse (DW), a huge repository with data that serves as a basis for responding to OLAP (Online Analytical Processing) queries. In a data warehousing environment, the ETL process is used to extract raw data from different data sources and to transform, clean, and integrate that data, loading to the DW. The ETL process is used for first data loading and, also for refreshing the data in the DW. This master's research investigated the best practices in conceptual modeling for ETL workflows and, as a result, proposes a new model, called “Intuitive”. The Intuitive Model adds simplicity, agility, clarity, and consistency to the modeling stage and can contribute to the improvement of construction and maintenance of ETL workflows. Theoretical analysis activities and practical experiments were performed with the users’ participation in order to validate the Intuitive Model. Such steps allowed us to evaluate that the elements of the Intuitive Model are sufficient to represent clearly several regular ETL scenarios showing advantages in comparison with the main related work in the state of the art.
publishDate 2020
dc.date.issued.fl_str_mv 2020-09-09
dc.date.accessioned.fl_str_mv 2021-03-12T17:25:17Z
dc.date.available.fl_str_mv 2021-03-12T17:25:17Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.citation.fl_str_mv PORTES, Ana Célia Ribeiro Bizigato. Intuitive: modelo conceitual para workflows de ETL. 2020. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/13968.
dc.identifier.uri.fl_str_mv https://repositorio.ufscar.br/handle/ufscar/13968
identifier_str_mv PORTES, Ana Célia Ribeiro Bizigato. Intuitive: modelo conceitual para workflows de ETL. 2020. Dissertação (Mestrado em Ciência da Computação) – Universidade Federal de São Carlos, São Carlos, 2020. Disponível em: https://repositorio.ufscar.br/handle/ufscar/13968.
url https://repositorio.ufscar.br/handle/ufscar/13968
dc.language.iso.fl_str_mv por
language por
dc.relation.confidence.fl_str_mv 600
600
dc.relation.authority.fl_str_mv 3b1d5172-8bf0-4d0b-8777-ab82599bbf09
dc.rights.driver.fl_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Universidade Federal de São Carlos
Câmpus São Carlos
dc.publisher.program.fl_str_mv Programa de Pós-Graduação em Ciência da Computação - PPGCC
dc.publisher.initials.fl_str_mv UFSCar
publisher.none.fl_str_mv Universidade Federal de São Carlos
Câmpus São Carlos
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFSCAR
instname:Universidade Federal de São Carlos (UFSCAR)
instacron:UFSCAR
instname_str Universidade Federal de São Carlos (UFSCAR)
instacron_str UFSCAR
institution UFSCAR
reponame_str Repositório Institucional da UFSCAR
collection Repositório Institucional da UFSCAR
bitstream.url.fl_str_mv https://repositorio.ufscar.br/bitstream/ufscar/13968/8/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf
https://repositorio.ufscar.br/bitstream/ufscar/13968/3/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf
https://repositorio.ufscar.br/bitstream/ufscar/13968/9/license_rdf
https://repositorio.ufscar.br/bitstream/ufscar/13968/10/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf.txt
https://repositorio.ufscar.br/bitstream/ufscar/13968/12/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf.txt
https://repositorio.ufscar.br/bitstream/ufscar/13968/11/Disserta%c3%a7%c3%a3o-AnaC%c3%a9lia.pdf.jpg
https://repositorio.ufscar.br/bitstream/ufscar/13968/13/Autoriza%c3%a7%c3%a3o%20para%20entrega%20.pdf.jpg
bitstream.checksum.fl_str_mv e83e6d6480f248f086d75e9e32f1041c
8689a687f1c9e26a4376fbee4f55fc98
e39d27027a6cc9cb039ad269a5db8e34
e496663797d965f0772d34b96347eb4e
68b329da9893e34099c7d8ad5cb9c940
d62fac5d7d763d290a07986159c02b8e
4a9f78fcb98654aa42b2bc593f5df2cc
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFSCAR - Universidade Federal de São Carlos (UFSCAR)
repository.mail.fl_str_mv
_version_ 1813715624664236032