Analysis of datasets recovery process in government repositories

Detalhes bibliográficos
Autor(a) principal: Rodrigues, Fernando de Assis
Data de Publicação: 2015
Outros Autores: Sant'Ana, Ricardo César Gonçalves, Ferneda, Edberto
Tipo de documento: Artigo
Idioma: por
Título da fonte: InCID
Texto Completo: https://www.revistas.usp.br/incid/article/view/73496
Resumo: The present study aims to identify, in the recovery stage, attributes available in moments when a user conducts datasets researches in government repositories, based on the Life Cycle Data Model for Information Science (CVD-CI) proposed by Sant'Ana (2013). The research was bounded out conducting searches for data sets offered through the search engine available on the site Brazilian Open Data Portal, using the terms 'education' and 'Health'. The use of the term 'health' resulted in the recovery of 14 datasets and the term 'education' recovered 23, totaling 37 datasets. Analysis of these datasets was divided into two stages: the first were identified which attributes were available on page containing the results of searches from terms used. The second step was to identify the attributes available on the pages for each datasets retrieved in the search. As a result, it was built two tables: the first identifies the attributes that are available on search results pages that were generated by site search engine. The second identifies the attributes available in each dataset retrieved by the search. The results showed that in the first stage, there is no difference in the attributes available in the search results by both terms. However, in the second stage there were discrepancies in the attributes identified in each dataset.
id USP-15_743a3d0b30adf2583fbaeef45958bdcd
oai_identifier_str oai:revistas.usp.br:article/73496
network_acronym_str USP-15
network_name_str InCID
repository_id_str
spelling Analysis of datasets recovery process in government repositoriesAnálise do processo de recuperação de conjuntos de dados em repositórios governamentaisCiclo de Vida dos DadosColeta de DadosDados Abertos GovernamentaisRepositório GovernamentalData Life CycleData GatheringOpen Government DataGovernmental RepositoryThe present study aims to identify, in the recovery stage, attributes available in moments when a user conducts datasets researches in government repositories, based on the Life Cycle Data Model for Information Science (CVD-CI) proposed by Sant'Ana (2013). The research was bounded out conducting searches for data sets offered through the search engine available on the site Brazilian Open Data Portal, using the terms 'education' and 'Health'. The use of the term 'health' resulted in the recovery of 14 datasets and the term 'education' recovered 23, totaling 37 datasets. Analysis of these datasets was divided into two stages: the first were identified which attributes were available on page containing the results of searches from terms used. The second step was to identify the attributes available on the pages for each datasets retrieved in the search. As a result, it was built two tables: the first identifies the attributes that are available on search results pages that were generated by site search engine. The second identifies the attributes available in each dataset retrieved by the search. The results showed that in the first stage, there is no difference in the attributes available in the search results by both terms. However, in the second stage there were discrepancies in the attributes identified in each dataset.O presente trabalho tem como objetivo identificar, na fase de recuperação, atributos disponíveis nos momentos em que se realiza pesquisas por conjuntos de dados em repositórios governamentais, a partir do modelo de Ciclo de Vida de Dados para a Ciência da Informação (CVD-CI) proposto por Sant'Ana (2013). A pesquisa fora delimitada a realização de buscas por conjuntos de dados através do mecanismo oferecido pelo sítio Portal Brasileiro de Dados Abertos, utilizando os termos 'Educação' e 'Saúde'. O uso do termo 'Saúde' resultou na recuperação de 14 conjunto de dados e o termo 'Educação' recuperou 23, totalizando 37 conjuntos de dados. A análise destes conjuntos de dados dividiu-se em duas etapas: na primeira foram identificados quais atributos estavam disponíveis na página contendo o resultado das buscas a partir termos utilizados. A segunda etapa consistiu em identificar os atributos disponíveis nas páginas referentes a cada um dos conjuntos de dados recuperados na busca. Como resultado, fora construído dois quadros: o primeiro identifica os atributos que estão disponíveis nas páginas com resultados da pesquisa pelo mecanismo de busca do site; o segundo, identifica os atributos disponíveis em cada conjunto de dados recuperado pela pesquisa. Os resultados demonstraram que na primeira etapa, não há diferença nos atributos disponíveis nos resultados de busca por ambos os termos. Entretanto, na segunda etapa houve discrepâncias nos atributos identificados em cada conjunto de dados.Universidade de São Paulo. Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto2015-04-10info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://www.revistas.usp.br/incid/article/view/7349610.11606/issn.2178-2075.v6i1p38-56InCID: Revista de Ciência da Informação e Documentação; v. 6 n. 1 (2015); 38-562178-2075reponame:InCIDinstname:Universidade de São Paulo (USP)instacron:USPporhttps://www.revistas.usp.br/incid/article/view/73496/96247Copyright (c) 2015 InCID: Revista de Ciência da Informação e Documentaçãoinfo:eu-repo/semantics/openAccessRodrigues, Fernando de AssisSant'Ana, Ricardo César GonçalvesFerneda, Edberto2015-04-14T11:41:05ZRevistahttp://revistas.ffclrp.usp.br/incidPUB
dc.title.none.fl_str_mv Analysis of datasets recovery process in government repositories
Análise do processo de recuperação de conjuntos de dados em repositórios governamentais
title Analysis of datasets recovery process in government repositories
spellingShingle Analysis of datasets recovery process in government repositories
Rodrigues, Fernando de Assis
Ciclo de Vida dos Dados
Coleta de Dados
Dados Abertos Governamentais
Repositório Governamental
Data Life Cycle
Data Gathering
Open Government Data
Governmental Repository
title_short Analysis of datasets recovery process in government repositories
title_full Analysis of datasets recovery process in government repositories
title_fullStr Analysis of datasets recovery process in government repositories
title_full_unstemmed Analysis of datasets recovery process in government repositories
title_sort Analysis of datasets recovery process in government repositories
author Rodrigues, Fernando de Assis
author_facet Rodrigues, Fernando de Assis
Sant'Ana, Ricardo César Gonçalves
Ferneda, Edberto
author_role author
author2 Sant'Ana, Ricardo César Gonçalves
Ferneda, Edberto
author2_role author
author
dc.contributor.author.fl_str_mv Rodrigues, Fernando de Assis
Sant'Ana, Ricardo César Gonçalves
Ferneda, Edberto
dc.subject.por.fl_str_mv Ciclo de Vida dos Dados
Coleta de Dados
Dados Abertos Governamentais
Repositório Governamental
Data Life Cycle
Data Gathering
Open Government Data
Governmental Repository
topic Ciclo de Vida dos Dados
Coleta de Dados
Dados Abertos Governamentais
Repositório Governamental
Data Life Cycle
Data Gathering
Open Government Data
Governmental Repository
description The present study aims to identify, in the recovery stage, attributes available in moments when a user conducts datasets researches in government repositories, based on the Life Cycle Data Model for Information Science (CVD-CI) proposed by Sant'Ana (2013). The research was bounded out conducting searches for data sets offered through the search engine available on the site Brazilian Open Data Portal, using the terms 'education' and 'Health'. The use of the term 'health' resulted in the recovery of 14 datasets and the term 'education' recovered 23, totaling 37 datasets. Analysis of these datasets was divided into two stages: the first were identified which attributes were available on page containing the results of searches from terms used. The second step was to identify the attributes available on the pages for each datasets retrieved in the search. As a result, it was built two tables: the first identifies the attributes that are available on search results pages that were generated by site search engine. The second identifies the attributes available in each dataset retrieved by the search. The results showed that in the first stage, there is no difference in the attributes available in the search results by both terms. However, in the second stage there were discrepancies in the attributes identified in each dataset.
publishDate 2015
dc.date.none.fl_str_mv 2015-04-10
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://www.revistas.usp.br/incid/article/view/73496
10.11606/issn.2178-2075.v6i1p38-56
url https://www.revistas.usp.br/incid/article/view/73496
identifier_str_mv 10.11606/issn.2178-2075.v6i1p38-56
dc.language.iso.fl_str_mv por
language por
dc.relation.none.fl_str_mv https://www.revistas.usp.br/incid/article/view/73496/96247
dc.rights.driver.fl_str_mv Copyright (c) 2015 InCID: Revista de Ciência da Informação e Documentação
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Copyright (c) 2015 InCID: Revista de Ciência da Informação e Documentação
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.publisher.none.fl_str_mv Universidade de São Paulo. Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto
publisher.none.fl_str_mv Universidade de São Paulo. Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto
dc.source.none.fl_str_mv InCID: Revista de Ciência da Informação e Documentação; v. 6 n. 1 (2015); 38-56
2178-2075
reponame:InCID
instname:Universidade de São Paulo (USP)
instacron:USP
instname_str Universidade de São Paulo (USP)
instacron_str USP
institution USP
reponame_str InCID
collection InCID
repository.name.fl_str_mv
repository.mail.fl_str_mv
_version_ 1787713838696628224