The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)

Detalhes bibliográficos
Autor(a) principal: Jesus, Ananda Fernanda de
Data de Publicação: 2023
Outros Autores: Santarem Segundo, José Eduardo
Tipo de documento: Artigo
Idioma: por
Título da fonte: Em Questão (Online)
Texto Completo: https://seer.ufrgs.br/index.php/EmQuestao/article/view/129415
Resumo: The quality assessment process plays an important role in the reuse of data made available on the Web. To ensure the use and reuse of these data, it is necessary to formally describe them in a way that computational agents can understand. One of the possibilities to make this description viable is the Data Quality Vocabulary, elaborated by the World Wide Web Consortium. The objective was to verify the impact of the Data Quality Vocabulary in the process of formal description of the quality of data published on the Web, analyzing the objectives, characteristics, and structure of the vocabulary. The research has an exploratory and descriptive character, adopting as a method a study of the official documentation published by the consortium. As a result, an overview of the scenario that led to the development of the vocabulary was obtained, its structure was presented and its potential application was discussed. It is concluded that the Data Quality Vocabulary provides a general and customizable descriptive structure for providing the results of the data quality assessment process, which allows these results to be shared by its providers. It also allows the community to participate in the evaluation process and formally share the results obtained, thus reducing rework. It is also concluded that the vocabulary contributes to the reuse of data in the context of the Web by facilitating the use of automatic and semi-automatic tools in the evaluation and selection of data sources for the application. 
id UFRGS-8_6f037439cfa8e0b4aad335ee649e3941
oai_identifier_str oai:seer.ufrgs.br:article/129415
network_acronym_str UFRGS-8
network_name_str Em Questão (Online)
repository_id_str
spelling The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)A descrição formal da qualidade de dados publicados na Web: análise do Data Quality Vocabulary (DQV)qualidade de dados avaliação de qualidadeDQVdata qualityquality assessment DQVThe quality assessment process plays an important role in the reuse of data made available on the Web. To ensure the use and reuse of these data, it is necessary to formally describe them in a way that computational agents can understand. One of the possibilities to make this description viable is the Data Quality Vocabulary, elaborated by the World Wide Web Consortium. The objective was to verify the impact of the Data Quality Vocabulary in the process of formal description of the quality of data published on the Web, analyzing the objectives, characteristics, and structure of the vocabulary. The research has an exploratory and descriptive character, adopting as a method a study of the official documentation published by the consortium. As a result, an overview of the scenario that led to the development of the vocabulary was obtained, its structure was presented and its potential application was discussed. It is concluded that the Data Quality Vocabulary provides a general and customizable descriptive structure for providing the results of the data quality assessment process, which allows these results to be shared by its providers. It also allows the community to participate in the evaluation process and formally share the results obtained, thus reducing rework. It is also concluded that the vocabulary contributes to the reuse of data in the context of the Web by facilitating the use of automatic and semi-automatic tools in the evaluation and selection of data sources for the application. O processo de avaliação de qualidade desempenha um papel importante na reutilização dos dados disponibilizados na Web. Para garantir o uso e reuso desses dados faz-se necessária à sua descrição formal, de maneira compreensível à agentes computacionais. Uma das possibilidades para viabilizar essa descrição é o Data Quality Vocabulary, elaborado pelo Word Wide Web Consortium.  Objetivou-se verificar o impacto do Data Quality Vocabulary no processo de descrição formal da qualidade de dados publicados na Web, analisando os objetivos, características e a estrutura do vocabulário. A pesquisa possuí um caráter exploratório e descritivo, adotando como método um estudo da documentação oficial publicada pelo consórcio. Como resultados obteve-se um panorama do cenário que levou ao desenvolvimento do vocabulário, foi apresentada sua estrutura e discutido o seu potencial de aplicação. Conclui-se que o Data Quality Vocabulary disponibiliza uma estrutura descritiva geral e customizável para o fornecimento de resultados do processo de avaliação de qualidade de dados, o que permite que esses resultados sejam compartilhados pelos seus fornecedores. Permite ainda que a comunidade participe do processo de avaliação e compartilhe os resultados obtidos de maneira formal, diminuindo assim o retrabalho. Conclui-se ainda que o vocabulário contribui para o reuso de dados no contexto da Web ao facilitar o uso de ferramentas automáticas e semiautomáticas no processo de avaliação e seleção de fontes de dados para a aplicaçãoUniversidade Federal do Rio Grande do Sul, Faculdade de Biblioteconomia e Comunicação, Programa de Pós-Graduação em Ciência da Informação (Porto Alegre/RS)2023-10-05info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionAvaliado por Paresapplication/pdfapplication/pdfapplication/pdfhttps://seer.ufrgs.br/index.php/EmQuestao/article/view/12941510.1590/1808-5245.29.129415Em Questão; Vol. 29 (2023)Em Questão; Vol. 29 (2023)Em Questão; v. 29 (2023)1808-52451807-8893reponame:Em Questão (Online)instname:Universidade Federal do Rio Grande do Sul (UFRGS)instacron:UFRGSporhttps://seer.ufrgs.br/index.php/EmQuestao/article/view/129415/89762https://seer.ufrgs.br/index.php/EmQuestao/article/view/129415/89765https://seer.ufrgs.br/index.php/EmQuestao/article/view/129415/89766Copyright (c) 2022 Ananda Fernanda de Jesus, José Eduardo Santarem Segundohttps://creativecommons.org/licenses/by/4.0info:eu-repo/semantics/openAccessJesus, Ananda Fernanda deSantarem Segundo, José Eduardo 2023-12-07T14:02:23Zoai:seer.ufrgs.br:article/129415Revistahttps://seer.ufrgs.br/emquestao/PUBhttps://seer.ufrgs.br/EmQuestao/oaiemquestao@ufrgs.br||emquestao@ufrgs.br1808-52451807-8893opendoar:2023-12-07T14:02:23Em Questão (Online) - Universidade Federal do Rio Grande do Sul (UFRGS)false
dc.title.none.fl_str_mv The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)
A descrição formal da qualidade de dados publicados na Web: análise do Data Quality Vocabulary (DQV)
title The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)
spellingShingle The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)
Jesus, Ananda Fernanda de
qualidade de dados
avaliação de qualidade
DQV
data quality
quality assessment
DQV
title_short The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)
title_full The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)
title_fullStr The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)
title_full_unstemmed The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)
title_sort The formal description of the quality data published on the Web: analysis of the Data Quality Vocabulary (DQV)
author Jesus, Ananda Fernanda de
author_facet Jesus, Ananda Fernanda de
Santarem Segundo, José Eduardo
author_role author
author2 Santarem Segundo, José Eduardo
author2_role author
dc.contributor.author.fl_str_mv Jesus, Ananda Fernanda de
Santarem Segundo, José Eduardo
dc.subject.por.fl_str_mv qualidade de dados
avaliação de qualidade
DQV
data quality
quality assessment
DQV
topic qualidade de dados
avaliação de qualidade
DQV
data quality
quality assessment
DQV
description The quality assessment process plays an important role in the reuse of data made available on the Web. To ensure the use and reuse of these data, it is necessary to formally describe them in a way that computational agents can understand. One of the possibilities to make this description viable is the Data Quality Vocabulary, elaborated by the World Wide Web Consortium. The objective was to verify the impact of the Data Quality Vocabulary in the process of formal description of the quality of data published on the Web, analyzing the objectives, characteristics, and structure of the vocabulary. The research has an exploratory and descriptive character, adopting as a method a study of the official documentation published by the consortium. As a result, an overview of the scenario that led to the development of the vocabulary was obtained, its structure was presented and its potential application was discussed. It is concluded that the Data Quality Vocabulary provides a general and customizable descriptive structure for providing the results of the data quality assessment process, which allows these results to be shared by its providers. It also allows the community to participate in the evaluation process and formally share the results obtained, thus reducing rework. It is also concluded that the vocabulary contributes to the reuse of data in the context of the Web by facilitating the use of automatic and semi-automatic tools in the evaluation and selection of data sources for the application. 
publishDate 2023
dc.date.none.fl_str_mv 2023-10-05
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
Avaliado por Pares
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://seer.ufrgs.br/index.php/EmQuestao/article/view/129415
10.1590/1808-5245.29.129415
url https://seer.ufrgs.br/index.php/EmQuestao/article/view/129415
identifier_str_mv 10.1590/1808-5245.29.129415
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://seer.ufrgs.br/index.php/EmQuestao/article/view/129415/89762
https://seer.ufrgs.br/index.php/EmQuestao/article/view/129415/89765
https://seer.ufrgs.br/index.php/EmQuestao/article/view/129415/89766
dc.rights.driver.fl_str_mv Copyright (c) 2022 Ananda Fernanda de Jesus, José Eduardo Santarem Segundo
https://creativecommons.org/licenses/by/4.0
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2022 Ananda Fernanda de Jesus, José Eduardo Santarem Segundo
https://creativecommons.org/licenses/by/4.0
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
application/pdf
application/pdf
dc.publisher.none.fl_str_mv Universidade Federal do Rio Grande do Sul, Faculdade de Biblioteconomia e Comunicação, Programa de Pós-Graduação em Ciência da Informação (Porto Alegre/RS)
publisher.none.fl_str_mv Universidade Federal do Rio Grande do Sul, Faculdade de Biblioteconomia e Comunicação, Programa de Pós-Graduação em Ciência da Informação (Porto Alegre/RS)
dc.source.none.fl_str_mv Em Questão; Vol. 29 (2023)
Em Questão; Vol. 29 (2023)
Em Questão; v. 29 (2023)
1808-5245
1807-8893
reponame:Em Questão (Online)
instname:Universidade Federal do Rio Grande do Sul (UFRGS)
instacron:UFRGS
instname_str Universidade Federal do Rio Grande do Sul (UFRGS)
instacron_str UFRGS
institution UFRGS
reponame_str Em Questão (Online)
collection Em Questão (Online)
repository.name.fl_str_mv Em Questão (Online) - Universidade Federal do Rio Grande do Sul (UFRGS)
repository.mail.fl_str_mv emquestao@ufrgs.br||emquestao@ufrgs.br
_version_ 1789438637056720896