A metadata curation framework for data ecosystems

Detalhes bibliográficos
Autor(a) principal: OLIVEIRA, Marcelo Iury de Sousa
Data de Publicação: 2019
Tipo de documento: Tese
Idioma: eng
Título da fonte: Repositório Institucional da UFPE
dARK ID: ark:/64986/001300000f5kj
Texto Completo: https://repositorio.ufpe.br/handle/123456789/33910
Resumo: A Data Ecosystem can be defined as a complex socio-technical network that enables collaboration between autonomous actors in order to explore data. Such ecosystems provide an environment for creating, managing and sustaining data sharing initiatives. While Data Ecosystems are thus arguably gaining importance, several ecosystems are not sustainable and consequently the effort spent by their actors end up not being properly used or forgotten. A comprehensive and meaningful description of all Data Ecosystem resources is needed. The increasing recognition of metadata as an essential asset had motivated an increased demand for metadata curation solutions. Metadata curation covers the creation and harvesting of metadata, appraisal and selection of metadata, quality assurance, preservation of metadata and other lifecycle stages, and also involves a number of IT systems. While important, in general, the current initiatives on metadata curation are a confusing mixture of activities, standards, terms and vocabularies, methods and tools. Referential guidelines would provide a basis to choose standard terms and definitions, processes and practices, roles and deliverables for metadata curation practitioners. In this context, this thesis aims to propose a framework, called Louvre, which offers a wide range of processes for aiding to curate metadata in Data Ecosystems. Each process describes a coherent set of engineering and management activities related to metadata curation. The Louvre structure is flexible and may be adapted to the needs of the actors interested in curating Data Ecosystem metadata. In this sense, processes are organized in functional dimensions, enabling modularization of the framework. Louvre also provides a set of best practices aligned with principles of agile and open collaboration for managing curation work through the collaborative effort of self-organizing actors. Finally, the framework is based on state-of-the-art in the area. This research also contributes to the Data Ecosystem area, by mapping the state-of-the-art of Data Ecosystems. In addition, it contributes also to the understanding of several issues related to the Data Ecosystems creation and maintenance. Also noteworthy is the definition, formalization and modelling of essential constructs related to Data Ecosystems.
id UFPE_b3ff0302facb21d8849bcb3f48fbc131
oai_identifier_str oai:repositorio.ufpe.br:123456789/33910
network_acronym_str UFPE
network_name_str Repositório Institucional da UFPE
repository_id_str 2221
spelling OLIVEIRA, Marcelo Iury de Sousahttp://lattes.cnpq.br/2328386382232459http://lattes.cnpq.br/2512064355660153LÓSCIO, Bernadette Farias2019-09-27T20:48:13Z2019-09-27T20:48:13Z2019-02-22https://repositorio.ufpe.br/handle/123456789/33910ark:/64986/001300000f5kjA Data Ecosystem can be defined as a complex socio-technical network that enables collaboration between autonomous actors in order to explore data. Such ecosystems provide an environment for creating, managing and sustaining data sharing initiatives. While Data Ecosystems are thus arguably gaining importance, several ecosystems are not sustainable and consequently the effort spent by their actors end up not being properly used or forgotten. A comprehensive and meaningful description of all Data Ecosystem resources is needed. The increasing recognition of metadata as an essential asset had motivated an increased demand for metadata curation solutions. Metadata curation covers the creation and harvesting of metadata, appraisal and selection of metadata, quality assurance, preservation of metadata and other lifecycle stages, and also involves a number of IT systems. While important, in general, the current initiatives on metadata curation are a confusing mixture of activities, standards, terms and vocabularies, methods and tools. Referential guidelines would provide a basis to choose standard terms and definitions, processes and practices, roles and deliverables for metadata curation practitioners. In this context, this thesis aims to propose a framework, called Louvre, which offers a wide range of processes for aiding to curate metadata in Data Ecosystems. Each process describes a coherent set of engineering and management activities related to metadata curation. The Louvre structure is flexible and may be adapted to the needs of the actors interested in curating Data Ecosystem metadata. In this sense, processes are organized in functional dimensions, enabling modularization of the framework. Louvre also provides a set of best practices aligned with principles of agile and open collaboration for managing curation work through the collaborative effort of self-organizing actors. Finally, the framework is based on state-of-the-art in the area. This research also contributes to the Data Ecosystem area, by mapping the state-of-the-art of Data Ecosystems. In addition, it contributes also to the understanding of several issues related to the Data Ecosystems creation and maintenance. Also noteworthy is the definition, formalization and modelling of essential constructs related to Data Ecosystems.CAPESCNPqUm Ecossistema de Dados pode ser definido como uma rede sociotécnica complexa que permite a colaboração entre atores autônomos para explorar dados. Esses ecossistemas fornecem um ambiente para criar, gerenciar e sustentar iniciativas de compartilhamento de dados. Embora os Ecossistemas de Dados estejam ganhando importância, vários ecossistemas não são sustentáveis e, consequentemente, o esforço despendido por seus atores acaba não sendo adequadamente usado ou esquecido. É necessária uma descrição abrangente e significativa dos recursos do Ecossistema de Dados. O crescente reconhecimento dos metadados como um ativo essencial motivou uma demanda crescente por soluções de curadoria de metadados. A curadoria de metadados abrange a criação e a coleta de metadados, avaliação e seleção de metadados, garantia de qualidade, preservação de metadados e outras etapas do ciclo de vida. Assim como, a curadoria de metadados também envolve o uso de vários sistemas e ferramentas de gestão e preservação de metadados. Embora importantes, as atuais iniciativas de curadoria de metadados são uma mistura confusa de atividades, padrões, termos e vocabulários, métodos e ferramentas. As guidelines e modelos de referencia poderiam forcener uma base para escolher termos e definições padrão, processos e práticas, papéis e resultados para os profissionais de curadoria de metadados. Neste contexto, esta tese tem como objetivo propor um framework, denominado Louvre, que oferece uma ampla gama de processos para auxiliar na organização de metadados em Ecossistemas de Dados. Cada processo descreve um conjunto coerente de atividades de engenharia e gerenciamento relacionadas à curadoria de metadados. A estrutura do Louvre é flexível e pode ser adaptada às necessidades dos atores interessados em realizar a curadoria de metadados. Nesse sentido, os processos são organizados em dimensões funcionais, possibilitando a modularização do framework. O Louvre também fornece um conjunto de práticas recomendadas alinhadas com princípios de desenvolmento ágil e colaboração aberta para gerenciar o trabalho de curadoria através do esforço colaborativo de atores auto-organizados. Esta pesquisa também contribui para a área de Ecossistemas de Dados, mapeando o estado da arte da área. Além disso, este trabalho ainda contribui para o entendimento de várias questões relacionadas à criação e manutenção de Ecossistemas de Dados. Destaca-se também a definição, formalização e modelagem de constructos essenciais relacionados a Ecossistemas de Dados.engUniversidade Federal de PernambucoPrograma de Pos Graduacao em Ciencia da ComputacaoUFPEBrasilAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessBanco de dadosMetamodelosGestão de metadadosA metadata curation framework for data ecosystemsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisdoutoradoreponame:Repositório Institucional da UFPEinstname:Universidade Federal de Pernambuco (UFPE)instacron:UFPETHUMBNAILTESE Marcelo Iury de Souza Oliveira.pdf.jpgTESE Marcelo Iury de Souza Oliveira.pdf.jpgGenerated Thumbnailimage/jpeg1246https://repositorio.ufpe.br/bitstream/123456789/33910/5/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf.jpg99f9ccf1e3c32e26c82df7ce9e829d07MD55ORIGINALTESE Marcelo Iury de Souza Oliveira.pdfTESE Marcelo Iury de Souza Oliveira.pdfapplication/pdf6291603https://repositorio.ufpe.br/bitstream/123456789/33910/1/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdfc5c83cd46fabb8117e10456432c24d0dMD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufpe.br/bitstream/123456789/33910/2/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82310https://repositorio.ufpe.br/bitstream/123456789/33910/3/license.txtbd573a5ca8288eb7272482765f819534MD53TEXTTESE Marcelo Iury de Souza Oliveira.pdf.txtTESE Marcelo Iury de Souza Oliveira.pdf.txtExtracted texttext/plain649783https://repositorio.ufpe.br/bitstream/123456789/33910/4/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf.txt29abdafcb2d1a5e098f64cd178684652MD54123456789/339102021-07-15 19:41:09.689oai:repositorio.ufpe.br:123456789/33910TGljZW7Dp2EgZGUgRGlzdHJpYnVpw6fDo28gTsOjbyBFeGNsdXNpdmEKClRvZG8gZGVwb3NpdGFudGUgZGUgbWF0ZXJpYWwgbm8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgKFJJKSBkZXZlIGNvbmNlZGVyLCDDoCBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkZSBQZXJuYW1idWNvIChVRlBFKSwgdW1hIExpY2Vuw6dhIGRlIERpc3RyaWJ1acOnw6NvIE7Do28gRXhjbHVzaXZhIHBhcmEgbWFudGVyIGUgdG9ybmFyIGFjZXNzw612ZWlzIG9zIHNldXMgZG9jdW1lbnRvcywgZW0gZm9ybWF0byBkaWdpdGFsLCBuZXN0ZSByZXBvc2l0w7NyaW8uCgpDb20gYSBjb25jZXNzw6NvIGRlc3RhIGxpY2Vuw6dhIG7Do28gZXhjbHVzaXZhLCBvIGRlcG9zaXRhbnRlIG1hbnTDqW0gdG9kb3Mgb3MgZGlyZWl0b3MgZGUgYXV0b3IuCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fXwoKTGljZW7Dp2EgZGUgRGlzdHJpYnVpw6fDo28gTsOjbyBFeGNsdXNpdmEKCkFvIGNvbmNvcmRhciBjb20gZXN0YSBsaWNlbsOnYSBlIGFjZWl0w6EtbGEsIHZvY8OqIChhdXRvciBvdSBkZXRlbnRvciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMpOgoKYSkgRGVjbGFyYSBxdWUgY29uaGVjZSBhIHBvbMOtdGljYSBkZSBjb3B5cmlnaHQgZGEgZWRpdG9yYSBkbyBzZXUgZG9jdW1lbnRvOwpiKSBEZWNsYXJhIHF1ZSBjb25oZWNlIGUgYWNlaXRhIGFzIERpcmV0cml6ZXMgcGFyYSBvIFJlcG9zaXTDs3JpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGUEU7CmMpIENvbmNlZGUgw6AgVUZQRSBvIGRpcmVpdG8gbsOjbyBleGNsdXNpdm8gZGUgYXJxdWl2YXIsIHJlcHJvZHV6aXIsIGNvbnZlcnRlciAoY29tbyBkZWZpbmlkbyBhIHNlZ3VpciksIGNvbXVuaWNhciBlL291IGRpc3RyaWJ1aXIsIG5vIFJJLCBvIGRvY3VtZW50byBlbnRyZWd1ZSAoaW5jbHVpbmRvIG8gcmVzdW1vL2Fic3RyYWN0KSBlbSBmb3JtYXRvIGRpZ2l0YWwgb3UgcG9yIG91dHJvIG1laW87CmQpIERlY2xhcmEgcXVlIGF1dG9yaXphIGEgVUZQRSBhIGFycXVpdmFyIG1haXMgZGUgdW1hIGPDs3BpYSBkZXN0ZSBkb2N1bWVudG8gZSBjb252ZXJ0w6otbG8sIHNlbSBhbHRlcmFyIG8gc2V1IGNvbnRlw7pkbywgcGFyYSBxdWFscXVlciBmb3JtYXRvIGRlIGZpY2hlaXJvLCBtZWlvIG91IHN1cG9ydGUsIHBhcmEgZWZlaXRvcyBkZSBzZWd1cmFuw6dhLCBwcmVzZXJ2YcOnw6NvIChiYWNrdXApIGUgYWNlc3NvOwplKSBEZWNsYXJhIHF1ZSBvIGRvY3VtZW50byBzdWJtZXRpZG8gw6kgbyBzZXUgdHJhYmFsaG8gb3JpZ2luYWwgZSBxdWUgZGV0w6ltIG8gZGlyZWl0byBkZSBjb25jZWRlciBhIHRlcmNlaXJvcyBvcyBkaXJlaXRvcyBjb250aWRvcyBuZXN0YSBsaWNlbsOnYS4gRGVjbGFyYSB0YW1iw6ltIHF1ZSBhIGVudHJlZ2EgZG8gZG9jdW1lbnRvIG7Do28gaW5mcmluZ2Ugb3MgZGlyZWl0b3MgZGUgb3V0cmEgcGVzc29hIG91IGVudGlkYWRlOwpmKSBEZWNsYXJhIHF1ZSwgbm8gY2FzbyBkbyBkb2N1bWVudG8gc3VibWV0aWRvIGNvbnRlciBtYXRlcmlhbCBkbyBxdWFsIG7Do28gZGV0w6ltIG9zIGRpcmVpdG9zIGRlCmF1dG9yLCBvYnRldmUgYSBhdXRvcml6YcOnw6NvIGlycmVzdHJpdGEgZG8gcmVzcGVjdGl2byBkZXRlbnRvciBkZXNzZXMgZGlyZWl0b3MgcGFyYSBjZWRlciDDoApVRlBFIG9zIGRpcmVpdG9zIHJlcXVlcmlkb3MgcG9yIGVzdGEgTGljZW7Dp2EgZSBhdXRvcml6YXIgYSB1bml2ZXJzaWRhZGUgYSB1dGlsaXrDoS1sb3MgbGVnYWxtZW50ZS4gRGVjbGFyYSB0YW1iw6ltIHF1ZSBlc3NlIG1hdGVyaWFsIGN1am9zIGRpcmVpdG9zIHPDo28gZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3UgY29udGXDumRvIGRvIGRvY3VtZW50byBlbnRyZWd1ZTsKZykgU2UgbyBkb2N1bWVudG8gZW50cmVndWUgw6kgYmFzZWFkbyBlbSB0cmFiYWxobyBmaW5hbmNpYWRvIG91IGFwb2lhZG8gcG9yIG91dHJhIGluc3RpdHVpw6fDo28gcXVlIG7Do28gYSBVRlBFLCBkZWNsYXJhIHF1ZSBjdW1wcml1IHF1YWlzcXVlciBvYnJpZ2HDp8O1ZXMgZXhpZ2lkYXMgcGVsbyByZXNwZWN0aXZvIGNvbnRyYXRvIG91IGFjb3Jkby4KCkEgVUZQRSBpZGVudGlmaWNhcsOhIGNsYXJhbWVudGUgbyhzKSBub21lKHMpIGRvKHMpIGF1dG9yIChlcykgZG9zIGRpcmVpdG9zIGRvIGRvY3VtZW50byBlbnRyZWd1ZSBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIHBhcmEgYWzDqW0gZG8gcHJldmlzdG8gbmEgYWzDrW5lYSBjKS4KRepositório InstitucionalPUBhttps://repositorio.ufpe.br/oai/requestattena@ufpe.bropendoar:22212021-07-15T22:41:09Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE)false
dc.title.pt_BR.fl_str_mv A metadata curation framework for data ecosystems
title A metadata curation framework for data ecosystems
spellingShingle A metadata curation framework for data ecosystems
OLIVEIRA, Marcelo Iury de Sousa
Banco de dados
Metamodelos
Gestão de metadados
title_short A metadata curation framework for data ecosystems
title_full A metadata curation framework for data ecosystems
title_fullStr A metadata curation framework for data ecosystems
title_full_unstemmed A metadata curation framework for data ecosystems
title_sort A metadata curation framework for data ecosystems
author OLIVEIRA, Marcelo Iury de Sousa
author_facet OLIVEIRA, Marcelo Iury de Sousa
author_role author
dc.contributor.authorLattes.pt_BR.fl_str_mv http://lattes.cnpq.br/2328386382232459
dc.contributor.advisorLattes.pt_BR.fl_str_mv http://lattes.cnpq.br/2512064355660153
dc.contributor.author.fl_str_mv OLIVEIRA, Marcelo Iury de Sousa
dc.contributor.advisor1.fl_str_mv LÓSCIO, Bernadette Farias
contributor_str_mv LÓSCIO, Bernadette Farias
dc.subject.por.fl_str_mv Banco de dados
Metamodelos
Gestão de metadados
topic Banco de dados
Metamodelos
Gestão de metadados
description A Data Ecosystem can be defined as a complex socio-technical network that enables collaboration between autonomous actors in order to explore data. Such ecosystems provide an environment for creating, managing and sustaining data sharing initiatives. While Data Ecosystems are thus arguably gaining importance, several ecosystems are not sustainable and consequently the effort spent by their actors end up not being properly used or forgotten. A comprehensive and meaningful description of all Data Ecosystem resources is needed. The increasing recognition of metadata as an essential asset had motivated an increased demand for metadata curation solutions. Metadata curation covers the creation and harvesting of metadata, appraisal and selection of metadata, quality assurance, preservation of metadata and other lifecycle stages, and also involves a number of IT systems. While important, in general, the current initiatives on metadata curation are a confusing mixture of activities, standards, terms and vocabularies, methods and tools. Referential guidelines would provide a basis to choose standard terms and definitions, processes and practices, roles and deliverables for metadata curation practitioners. In this context, this thesis aims to propose a framework, called Louvre, which offers a wide range of processes for aiding to curate metadata in Data Ecosystems. Each process describes a coherent set of engineering and management activities related to metadata curation. The Louvre structure is flexible and may be adapted to the needs of the actors interested in curating Data Ecosystem metadata. In this sense, processes are organized in functional dimensions, enabling modularization of the framework. Louvre also provides a set of best practices aligned with principles of agile and open collaboration for managing curation work through the collaborative effort of self-organizing actors. Finally, the framework is based on state-of-the-art in the area. This research also contributes to the Data Ecosystem area, by mapping the state-of-the-art of Data Ecosystems. In addition, it contributes also to the understanding of several issues related to the Data Ecosystems creation and maintenance. Also noteworthy is the definition, formalization and modelling of essential constructs related to Data Ecosystems.
publishDate 2019
dc.date.accessioned.fl_str_mv 2019-09-27T20:48:13Z
dc.date.available.fl_str_mv 2019-09-27T20:48:13Z
dc.date.issued.fl_str_mv 2019-02-22
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/doctoralThesis
format doctoralThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://repositorio.ufpe.br/handle/123456789/33910
dc.identifier.dark.fl_str_mv ark:/64986/001300000f5kj
url https://repositorio.ufpe.br/handle/123456789/33910
identifier_str_mv ark:/64986/001300000f5kj
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Attribution-NonCommercial-NoDerivs 3.0 Brazil
http://creativecommons.org/licenses/by-nc-nd/3.0/br/
eu_rights_str_mv openAccess
dc.publisher.none.fl_str_mv Universidade Federal de Pernambuco
dc.publisher.program.fl_str_mv Programa de Pos Graduacao em Ciencia da Computacao
dc.publisher.initials.fl_str_mv UFPE
dc.publisher.country.fl_str_mv Brasil
publisher.none.fl_str_mv Universidade Federal de Pernambuco
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFPE
instname:Universidade Federal de Pernambuco (UFPE)
instacron:UFPE
instname_str Universidade Federal de Pernambuco (UFPE)
instacron_str UFPE
institution UFPE
reponame_str Repositório Institucional da UFPE
collection Repositório Institucional da UFPE
bitstream.url.fl_str_mv https://repositorio.ufpe.br/bitstream/123456789/33910/5/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf.jpg
https://repositorio.ufpe.br/bitstream/123456789/33910/1/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf
https://repositorio.ufpe.br/bitstream/123456789/33910/2/license_rdf
https://repositorio.ufpe.br/bitstream/123456789/33910/3/license.txt
https://repositorio.ufpe.br/bitstream/123456789/33910/4/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf.txt
bitstream.checksum.fl_str_mv 99f9ccf1e3c32e26c82df7ce9e829d07
c5c83cd46fabb8117e10456432c24d0d
e39d27027a6cc9cb039ad269a5db8e34
bd573a5ca8288eb7272482765f819534
29abdafcb2d1a5e098f64cd178684652
bitstream.checksumAlgorithm.fl_str_mv MD5
MD5
MD5
MD5
MD5
repository.name.fl_str_mv Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE)
repository.mail.fl_str_mv attena@ufpe.br
_version_ 1814448253025910784