A metadata curation framework for data ecosystems
Autor(a) principal: | |
---|---|
Data de Publicação: | 2019 |
Tipo de documento: | Tese |
Idioma: | eng |
Título da fonte: | Repositório Institucional da UFPE |
dARK ID: | ark:/64986/001300000f5kj |
Texto Completo: | https://repositorio.ufpe.br/handle/123456789/33910 |
Resumo: | A Data Ecosystem can be defined as a complex socio-technical network that enables collaboration between autonomous actors in order to explore data. Such ecosystems provide an environment for creating, managing and sustaining data sharing initiatives. While Data Ecosystems are thus arguably gaining importance, several ecosystems are not sustainable and consequently the effort spent by their actors end up not being properly used or forgotten. A comprehensive and meaningful description of all Data Ecosystem resources is needed. The increasing recognition of metadata as an essential asset had motivated an increased demand for metadata curation solutions. Metadata curation covers the creation and harvesting of metadata, appraisal and selection of metadata, quality assurance, preservation of metadata and other lifecycle stages, and also involves a number of IT systems. While important, in general, the current initiatives on metadata curation are a confusing mixture of activities, standards, terms and vocabularies, methods and tools. Referential guidelines would provide a basis to choose standard terms and definitions, processes and practices, roles and deliverables for metadata curation practitioners. In this context, this thesis aims to propose a framework, called Louvre, which offers a wide range of processes for aiding to curate metadata in Data Ecosystems. Each process describes a coherent set of engineering and management activities related to metadata curation. The Louvre structure is flexible and may be adapted to the needs of the actors interested in curating Data Ecosystem metadata. In this sense, processes are organized in functional dimensions, enabling modularization of the framework. Louvre also provides a set of best practices aligned with principles of agile and open collaboration for managing curation work through the collaborative effort of self-organizing actors. Finally, the framework is based on state-of-the-art in the area. This research also contributes to the Data Ecosystem area, by mapping the state-of-the-art of Data Ecosystems. In addition, it contributes also to the understanding of several issues related to the Data Ecosystems creation and maintenance. Also noteworthy is the definition, formalization and modelling of essential constructs related to Data Ecosystems. |
id |
UFPE_b3ff0302facb21d8849bcb3f48fbc131 |
---|---|
oai_identifier_str |
oai:repositorio.ufpe.br:123456789/33910 |
network_acronym_str |
UFPE |
network_name_str |
Repositório Institucional da UFPE |
repository_id_str |
2221 |
spelling |
OLIVEIRA, Marcelo Iury de Sousahttp://lattes.cnpq.br/2328386382232459http://lattes.cnpq.br/2512064355660153LÓSCIO, Bernadette Farias2019-09-27T20:48:13Z2019-09-27T20:48:13Z2019-02-22https://repositorio.ufpe.br/handle/123456789/33910ark:/64986/001300000f5kjA Data Ecosystem can be defined as a complex socio-technical network that enables collaboration between autonomous actors in order to explore data. Such ecosystems provide an environment for creating, managing and sustaining data sharing initiatives. While Data Ecosystems are thus arguably gaining importance, several ecosystems are not sustainable and consequently the effort spent by their actors end up not being properly used or forgotten. A comprehensive and meaningful description of all Data Ecosystem resources is needed. The increasing recognition of metadata as an essential asset had motivated an increased demand for metadata curation solutions. Metadata curation covers the creation and harvesting of metadata, appraisal and selection of metadata, quality assurance, preservation of metadata and other lifecycle stages, and also involves a number of IT systems. While important, in general, the current initiatives on metadata curation are a confusing mixture of activities, standards, terms and vocabularies, methods and tools. Referential guidelines would provide a basis to choose standard terms and definitions, processes and practices, roles and deliverables for metadata curation practitioners. In this context, this thesis aims to propose a framework, called Louvre, which offers a wide range of processes for aiding to curate metadata in Data Ecosystems. Each process describes a coherent set of engineering and management activities related to metadata curation. The Louvre structure is flexible and may be adapted to the needs of the actors interested in curating Data Ecosystem metadata. In this sense, processes are organized in functional dimensions, enabling modularization of the framework. Louvre also provides a set of best practices aligned with principles of agile and open collaboration for managing curation work through the collaborative effort of self-organizing actors. Finally, the framework is based on state-of-the-art in the area. This research also contributes to the Data Ecosystem area, by mapping the state-of-the-art of Data Ecosystems. In addition, it contributes also to the understanding of several issues related to the Data Ecosystems creation and maintenance. Also noteworthy is the definition, formalization and modelling of essential constructs related to Data Ecosystems.CAPESCNPqUm Ecossistema de Dados pode ser definido como uma rede sociotécnica complexa que permite a colaboração entre atores autônomos para explorar dados. Esses ecossistemas fornecem um ambiente para criar, gerenciar e sustentar iniciativas de compartilhamento de dados. Embora os Ecossistemas de Dados estejam ganhando importância, vários ecossistemas não são sustentáveis e, consequentemente, o esforço despendido por seus atores acaba não sendo adequadamente usado ou esquecido. É necessária uma descrição abrangente e significativa dos recursos do Ecossistema de Dados. O crescente reconhecimento dos metadados como um ativo essencial motivou uma demanda crescente por soluções de curadoria de metadados. A curadoria de metadados abrange a criação e a coleta de metadados, avaliação e seleção de metadados, garantia de qualidade, preservação de metadados e outras etapas do ciclo de vida. Assim como, a curadoria de metadados também envolve o uso de vários sistemas e ferramentas de gestão e preservação de metadados. Embora importantes, as atuais iniciativas de curadoria de metadados são uma mistura confusa de atividades, padrões, termos e vocabulários, métodos e ferramentas. As guidelines e modelos de referencia poderiam forcener uma base para escolher termos e definições padrão, processos e práticas, papéis e resultados para os profissionais de curadoria de metadados. Neste contexto, esta tese tem como objetivo propor um framework, denominado Louvre, que oferece uma ampla gama de processos para auxiliar na organização de metadados em Ecossistemas de Dados. Cada processo descreve um conjunto coerente de atividades de engenharia e gerenciamento relacionadas à curadoria de metadados. A estrutura do Louvre é flexível e pode ser adaptada às necessidades dos atores interessados em realizar a curadoria de metadados. Nesse sentido, os processos são organizados em dimensões funcionais, possibilitando a modularização do framework. O Louvre também fornece um conjunto de práticas recomendadas alinhadas com princípios de desenvolmento ágil e colaboração aberta para gerenciar o trabalho de curadoria através do esforço colaborativo de atores auto-organizados. Esta pesquisa também contribui para a área de Ecossistemas de Dados, mapeando o estado da arte da área. Além disso, este trabalho ainda contribui para o entendimento de várias questões relacionadas à criação e manutenção de Ecossistemas de Dados. Destaca-se também a definição, formalização e modelagem de constructos essenciais relacionados a Ecossistemas de Dados.engUniversidade Federal de PernambucoPrograma de Pos Graduacao em Ciencia da ComputacaoUFPEBrasilAttribution-NonCommercial-NoDerivs 3.0 Brazilhttp://creativecommons.org/licenses/by-nc-nd/3.0/br/info:eu-repo/semantics/openAccessBanco de dadosMetamodelosGestão de metadadosA metadata curation framework for data ecosystemsinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/doctoralThesisdoutoradoreponame:Repositório Institucional da UFPEinstname:Universidade Federal de Pernambuco (UFPE)instacron:UFPETHUMBNAILTESE Marcelo Iury de Souza Oliveira.pdf.jpgTESE Marcelo Iury de Souza Oliveira.pdf.jpgGenerated Thumbnailimage/jpeg1246https://repositorio.ufpe.br/bitstream/123456789/33910/5/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf.jpg99f9ccf1e3c32e26c82df7ce9e829d07MD55ORIGINALTESE Marcelo Iury de Souza Oliveira.pdfTESE Marcelo Iury de Souza Oliveira.pdfapplication/pdf6291603https://repositorio.ufpe.br/bitstream/123456789/33910/1/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdfc5c83cd46fabb8117e10456432c24d0dMD51CC-LICENSElicense_rdflicense_rdfapplication/rdf+xml; charset=utf-8811https://repositorio.ufpe.br/bitstream/123456789/33910/2/license_rdfe39d27027a6cc9cb039ad269a5db8e34MD52LICENSElicense.txtlicense.txttext/plain; charset=utf-82310https://repositorio.ufpe.br/bitstream/123456789/33910/3/license.txtbd573a5ca8288eb7272482765f819534MD53TEXTTESE Marcelo Iury de Souza Oliveira.pdf.txtTESE Marcelo Iury de Souza Oliveira.pdf.txtExtracted texttext/plain649783https://repositorio.ufpe.br/bitstream/123456789/33910/4/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf.txt29abdafcb2d1a5e098f64cd178684652MD54123456789/339102021-07-15 19:41:09.689oai:repositorio.ufpe.br:123456789/33910TGljZW7Dp2EgZGUgRGlzdHJpYnVpw6fDo28gTsOjbyBFeGNsdXNpdmEKClRvZG8gZGVwb3NpdGFudGUgZGUgbWF0ZXJpYWwgbm8gUmVwb3NpdMOzcmlvIEluc3RpdHVjaW9uYWwgKFJJKSBkZXZlIGNvbmNlZGVyLCDDoCBVbml2ZXJzaWRhZGUgRmVkZXJhbCBkZSBQZXJuYW1idWNvIChVRlBFKSwgdW1hIExpY2Vuw6dhIGRlIERpc3RyaWJ1acOnw6NvIE7Do28gRXhjbHVzaXZhIHBhcmEgbWFudGVyIGUgdG9ybmFyIGFjZXNzw612ZWlzIG9zIHNldXMgZG9jdW1lbnRvcywgZW0gZm9ybWF0byBkaWdpdGFsLCBuZXN0ZSByZXBvc2l0w7NyaW8uCgpDb20gYSBjb25jZXNzw6NvIGRlc3RhIGxpY2Vuw6dhIG7Do28gZXhjbHVzaXZhLCBvIGRlcG9zaXRhbnRlIG1hbnTDqW0gdG9kb3Mgb3MgZGlyZWl0b3MgZGUgYXV0b3IuCl9fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fXwoKTGljZW7Dp2EgZGUgRGlzdHJpYnVpw6fDo28gTsOjbyBFeGNsdXNpdmEKCkFvIGNvbmNvcmRhciBjb20gZXN0YSBsaWNlbsOnYSBlIGFjZWl0w6EtbGEsIHZvY8OqIChhdXRvciBvdSBkZXRlbnRvciBkb3MgZGlyZWl0b3MgYXV0b3JhaXMpOgoKYSkgRGVjbGFyYSBxdWUgY29uaGVjZSBhIHBvbMOtdGljYSBkZSBjb3B5cmlnaHQgZGEgZWRpdG9yYSBkbyBzZXUgZG9jdW1lbnRvOwpiKSBEZWNsYXJhIHF1ZSBjb25oZWNlIGUgYWNlaXRhIGFzIERpcmV0cml6ZXMgcGFyYSBvIFJlcG9zaXTDs3JpbyBJbnN0aXR1Y2lvbmFsIGRhIFVGUEU7CmMpIENvbmNlZGUgw6AgVUZQRSBvIGRpcmVpdG8gbsOjbyBleGNsdXNpdm8gZGUgYXJxdWl2YXIsIHJlcHJvZHV6aXIsIGNvbnZlcnRlciAoY29tbyBkZWZpbmlkbyBhIHNlZ3VpciksIGNvbXVuaWNhciBlL291IGRpc3RyaWJ1aXIsIG5vIFJJLCBvIGRvY3VtZW50byBlbnRyZWd1ZSAoaW5jbHVpbmRvIG8gcmVzdW1vL2Fic3RyYWN0KSBlbSBmb3JtYXRvIGRpZ2l0YWwgb3UgcG9yIG91dHJvIG1laW87CmQpIERlY2xhcmEgcXVlIGF1dG9yaXphIGEgVUZQRSBhIGFycXVpdmFyIG1haXMgZGUgdW1hIGPDs3BpYSBkZXN0ZSBkb2N1bWVudG8gZSBjb252ZXJ0w6otbG8sIHNlbSBhbHRlcmFyIG8gc2V1IGNvbnRlw7pkbywgcGFyYSBxdWFscXVlciBmb3JtYXRvIGRlIGZpY2hlaXJvLCBtZWlvIG91IHN1cG9ydGUsIHBhcmEgZWZlaXRvcyBkZSBzZWd1cmFuw6dhLCBwcmVzZXJ2YcOnw6NvIChiYWNrdXApIGUgYWNlc3NvOwplKSBEZWNsYXJhIHF1ZSBvIGRvY3VtZW50byBzdWJtZXRpZG8gw6kgbyBzZXUgdHJhYmFsaG8gb3JpZ2luYWwgZSBxdWUgZGV0w6ltIG8gZGlyZWl0byBkZSBjb25jZWRlciBhIHRlcmNlaXJvcyBvcyBkaXJlaXRvcyBjb250aWRvcyBuZXN0YSBsaWNlbsOnYS4gRGVjbGFyYSB0YW1iw6ltIHF1ZSBhIGVudHJlZ2EgZG8gZG9jdW1lbnRvIG7Do28gaW5mcmluZ2Ugb3MgZGlyZWl0b3MgZGUgb3V0cmEgcGVzc29hIG91IGVudGlkYWRlOwpmKSBEZWNsYXJhIHF1ZSwgbm8gY2FzbyBkbyBkb2N1bWVudG8gc3VibWV0aWRvIGNvbnRlciBtYXRlcmlhbCBkbyBxdWFsIG7Do28gZGV0w6ltIG9zIGRpcmVpdG9zIGRlCmF1dG9yLCBvYnRldmUgYSBhdXRvcml6YcOnw6NvIGlycmVzdHJpdGEgZG8gcmVzcGVjdGl2byBkZXRlbnRvciBkZXNzZXMgZGlyZWl0b3MgcGFyYSBjZWRlciDDoApVRlBFIG9zIGRpcmVpdG9zIHJlcXVlcmlkb3MgcG9yIGVzdGEgTGljZW7Dp2EgZSBhdXRvcml6YXIgYSB1bml2ZXJzaWRhZGUgYSB1dGlsaXrDoS1sb3MgbGVnYWxtZW50ZS4gRGVjbGFyYSB0YW1iw6ltIHF1ZSBlc3NlIG1hdGVyaWFsIGN1am9zIGRpcmVpdG9zIHPDo28gZGUgdGVyY2Vpcm9zIGVzdMOhIGNsYXJhbWVudGUgaWRlbnRpZmljYWRvIGUgcmVjb25oZWNpZG8gbm8gdGV4dG8gb3UgY29udGXDumRvIGRvIGRvY3VtZW50byBlbnRyZWd1ZTsKZykgU2UgbyBkb2N1bWVudG8gZW50cmVndWUgw6kgYmFzZWFkbyBlbSB0cmFiYWxobyBmaW5hbmNpYWRvIG91IGFwb2lhZG8gcG9yIG91dHJhIGluc3RpdHVpw6fDo28gcXVlIG7Do28gYSBVRlBFLCBkZWNsYXJhIHF1ZSBjdW1wcml1IHF1YWlzcXVlciBvYnJpZ2HDp8O1ZXMgZXhpZ2lkYXMgcGVsbyByZXNwZWN0aXZvIGNvbnRyYXRvIG91IGFjb3Jkby4KCkEgVUZQRSBpZGVudGlmaWNhcsOhIGNsYXJhbWVudGUgbyhzKSBub21lKHMpIGRvKHMpIGF1dG9yIChlcykgZG9zIGRpcmVpdG9zIGRvIGRvY3VtZW50byBlbnRyZWd1ZSBlIG7Do28gZmFyw6EgcXVhbHF1ZXIgYWx0ZXJhw6fDo28sIHBhcmEgYWzDqW0gZG8gcHJldmlzdG8gbmEgYWzDrW5lYSBjKS4KRepositório InstitucionalPUBhttps://repositorio.ufpe.br/oai/requestattena@ufpe.bropendoar:22212021-07-15T22:41:09Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE)false |
dc.title.pt_BR.fl_str_mv |
A metadata curation framework for data ecosystems |
title |
A metadata curation framework for data ecosystems |
spellingShingle |
A metadata curation framework for data ecosystems OLIVEIRA, Marcelo Iury de Sousa Banco de dados Metamodelos Gestão de metadados |
title_short |
A metadata curation framework for data ecosystems |
title_full |
A metadata curation framework for data ecosystems |
title_fullStr |
A metadata curation framework for data ecosystems |
title_full_unstemmed |
A metadata curation framework for data ecosystems |
title_sort |
A metadata curation framework for data ecosystems |
author |
OLIVEIRA, Marcelo Iury de Sousa |
author_facet |
OLIVEIRA, Marcelo Iury de Sousa |
author_role |
author |
dc.contributor.authorLattes.pt_BR.fl_str_mv |
http://lattes.cnpq.br/2328386382232459 |
dc.contributor.advisorLattes.pt_BR.fl_str_mv |
http://lattes.cnpq.br/2512064355660153 |
dc.contributor.author.fl_str_mv |
OLIVEIRA, Marcelo Iury de Sousa |
dc.contributor.advisor1.fl_str_mv |
LÓSCIO, Bernadette Farias |
contributor_str_mv |
LÓSCIO, Bernadette Farias |
dc.subject.por.fl_str_mv |
Banco de dados Metamodelos Gestão de metadados |
topic |
Banco de dados Metamodelos Gestão de metadados |
description |
A Data Ecosystem can be defined as a complex socio-technical network that enables collaboration between autonomous actors in order to explore data. Such ecosystems provide an environment for creating, managing and sustaining data sharing initiatives. While Data Ecosystems are thus arguably gaining importance, several ecosystems are not sustainable and consequently the effort spent by their actors end up not being properly used or forgotten. A comprehensive and meaningful description of all Data Ecosystem resources is needed. The increasing recognition of metadata as an essential asset had motivated an increased demand for metadata curation solutions. Metadata curation covers the creation and harvesting of metadata, appraisal and selection of metadata, quality assurance, preservation of metadata and other lifecycle stages, and also involves a number of IT systems. While important, in general, the current initiatives on metadata curation are a confusing mixture of activities, standards, terms and vocabularies, methods and tools. Referential guidelines would provide a basis to choose standard terms and definitions, processes and practices, roles and deliverables for metadata curation practitioners. In this context, this thesis aims to propose a framework, called Louvre, which offers a wide range of processes for aiding to curate metadata in Data Ecosystems. Each process describes a coherent set of engineering and management activities related to metadata curation. The Louvre structure is flexible and may be adapted to the needs of the actors interested in curating Data Ecosystem metadata. In this sense, processes are organized in functional dimensions, enabling modularization of the framework. Louvre also provides a set of best practices aligned with principles of agile and open collaboration for managing curation work through the collaborative effort of self-organizing actors. Finally, the framework is based on state-of-the-art in the area. This research also contributes to the Data Ecosystem area, by mapping the state-of-the-art of Data Ecosystems. In addition, it contributes also to the understanding of several issues related to the Data Ecosystems creation and maintenance. Also noteworthy is the definition, formalization and modelling of essential constructs related to Data Ecosystems. |
publishDate |
2019 |
dc.date.accessioned.fl_str_mv |
2019-09-27T20:48:13Z |
dc.date.available.fl_str_mv |
2019-09-27T20:48:13Z |
dc.date.issued.fl_str_mv |
2019-02-22 |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/doctoralThesis |
format |
doctoralThesis |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://repositorio.ufpe.br/handle/123456789/33910 |
dc.identifier.dark.fl_str_mv |
ark:/64986/001300000f5kj |
url |
https://repositorio.ufpe.br/handle/123456789/33910 |
identifier_str_mv |
ark:/64986/001300000f5kj |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/ info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Attribution-NonCommercial-NoDerivs 3.0 Brazil http://creativecommons.org/licenses/by-nc-nd/3.0/br/ |
eu_rights_str_mv |
openAccess |
dc.publisher.none.fl_str_mv |
Universidade Federal de Pernambuco |
dc.publisher.program.fl_str_mv |
Programa de Pos Graduacao em Ciencia da Computacao |
dc.publisher.initials.fl_str_mv |
UFPE |
dc.publisher.country.fl_str_mv |
Brasil |
publisher.none.fl_str_mv |
Universidade Federal de Pernambuco |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da UFPE instname:Universidade Federal de Pernambuco (UFPE) instacron:UFPE |
instname_str |
Universidade Federal de Pernambuco (UFPE) |
instacron_str |
UFPE |
institution |
UFPE |
reponame_str |
Repositório Institucional da UFPE |
collection |
Repositório Institucional da UFPE |
bitstream.url.fl_str_mv |
https://repositorio.ufpe.br/bitstream/123456789/33910/5/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf.jpg https://repositorio.ufpe.br/bitstream/123456789/33910/1/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf https://repositorio.ufpe.br/bitstream/123456789/33910/2/license_rdf https://repositorio.ufpe.br/bitstream/123456789/33910/3/license.txt https://repositorio.ufpe.br/bitstream/123456789/33910/4/TESE%20Marcelo%20Iury%20de%20Souza%20Oliveira.pdf.txt |
bitstream.checksum.fl_str_mv |
99f9ccf1e3c32e26c82df7ce9e829d07 c5c83cd46fabb8117e10456432c24d0d e39d27027a6cc9cb039ad269a5db8e34 bd573a5ca8288eb7272482765f819534 29abdafcb2d1a5e098f64cd178684652 |
bitstream.checksumAlgorithm.fl_str_mv |
MD5 MD5 MD5 MD5 MD5 |
repository.name.fl_str_mv |
Repositório Institucional da UFPE - Universidade Federal de Pernambuco (UFPE) |
repository.mail.fl_str_mv |
attena@ufpe.br |
_version_ |
1814448253025910784 |