Analysis of datasets recovery process in government repositories
Autor(a) principal: | |
---|---|
Data de Publicação: | 2015 |
Outros Autores: | , |
Tipo de documento: | Artigo |
Idioma: | por |
Título da fonte: | InCID |
Texto Completo: | https://www.revistas.usp.br/incid/article/view/73496 |
Resumo: | The present study aims to identify, in the recovery stage, attributes available in moments when a user conducts datasets researches in government repositories, based on the Life Cycle Data Model for Information Science (CVD-CI) proposed by Sant'Ana (2013). The research was bounded out conducting searches for data sets offered through the search engine available on the site Brazilian Open Data Portal, using the terms 'education' and 'Health'. The use of the term 'health' resulted in the recovery of 14 datasets and the term 'education' recovered 23, totaling 37 datasets. Analysis of these datasets was divided into two stages: the first were identified which attributes were available on page containing the results of searches from terms used. The second step was to identify the attributes available on the pages for each datasets retrieved in the search. As a result, it was built two tables: the first identifies the attributes that are available on search results pages that were generated by site search engine. The second identifies the attributes available in each dataset retrieved by the search. The results showed that in the first stage, there is no difference in the attributes available in the search results by both terms. However, in the second stage there were discrepancies in the attributes identified in each dataset. |
id |
USP-15_743a3d0b30adf2583fbaeef45958bdcd |
---|---|
oai_identifier_str |
oai:revistas.usp.br:article/73496 |
network_acronym_str |
USP-15 |
network_name_str |
InCID |
repository_id_str |
|
spelling |
Analysis of datasets recovery process in government repositoriesAnálise do processo de recuperação de conjuntos de dados em repositórios governamentaisCiclo de Vida dos DadosColeta de DadosDados Abertos GovernamentaisRepositório GovernamentalData Life CycleData GatheringOpen Government DataGovernmental RepositoryThe present study aims to identify, in the recovery stage, attributes available in moments when a user conducts datasets researches in government repositories, based on the Life Cycle Data Model for Information Science (CVD-CI) proposed by Sant'Ana (2013). The research was bounded out conducting searches for data sets offered through the search engine available on the site Brazilian Open Data Portal, using the terms 'education' and 'Health'. The use of the term 'health' resulted in the recovery of 14 datasets and the term 'education' recovered 23, totaling 37 datasets. Analysis of these datasets was divided into two stages: the first were identified which attributes were available on page containing the results of searches from terms used. The second step was to identify the attributes available on the pages for each datasets retrieved in the search. As a result, it was built two tables: the first identifies the attributes that are available on search results pages that were generated by site search engine. The second identifies the attributes available in each dataset retrieved by the search. The results showed that in the first stage, there is no difference in the attributes available in the search results by both terms. However, in the second stage there were discrepancies in the attributes identified in each dataset.O presente trabalho tem como objetivo identificar, na fase de recuperação, atributos disponíveis nos momentos em que se realiza pesquisas por conjuntos de dados em repositórios governamentais, a partir do modelo de Ciclo de Vida de Dados para a Ciência da Informação (CVD-CI) proposto por Sant'Ana (2013). A pesquisa fora delimitada a realização de buscas por conjuntos de dados através do mecanismo oferecido pelo sítio Portal Brasileiro de Dados Abertos, utilizando os termos 'Educação' e 'Saúde'. O uso do termo 'Saúde' resultou na recuperação de 14 conjunto de dados e o termo 'Educação' recuperou 23, totalizando 37 conjuntos de dados. A análise destes conjuntos de dados dividiu-se em duas etapas: na primeira foram identificados quais atributos estavam disponíveis na página contendo o resultado das buscas a partir termos utilizados. A segunda etapa consistiu em identificar os atributos disponíveis nas páginas referentes a cada um dos conjuntos de dados recuperados na busca. Como resultado, fora construído dois quadros: o primeiro identifica os atributos que estão disponíveis nas páginas com resultados da pesquisa pelo mecanismo de busca do site; o segundo, identifica os atributos disponíveis em cada conjunto de dados recuperado pela pesquisa. Os resultados demonstraram que na primeira etapa, não há diferença nos atributos disponíveis nos resultados de busca por ambos os termos. Entretanto, na segunda etapa houve discrepâncias nos atributos identificados em cada conjunto de dados.Universidade de São Paulo. Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto2015-04-10info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersionapplication/pdfhttps://www.revistas.usp.br/incid/article/view/7349610.11606/issn.2178-2075.v6i1p38-56InCID: Revista de Ciência da Informação e Documentação; v. 6 n. 1 (2015); 38-562178-2075reponame:InCIDinstname:Universidade de São Paulo (USP)instacron:USPporhttps://www.revistas.usp.br/incid/article/view/73496/96247Copyright (c) 2015 InCID: Revista de Ciência da Informação e Documentaçãoinfo:eu-repo/semantics/openAccessRodrigues, Fernando de AssisSant'Ana, Ricardo César GonçalvesFerneda, Edberto2015-04-14T11:41:05ZRevistahttp://revistas.ffclrp.usp.br/incidPUB |
dc.title.none.fl_str_mv |
Analysis of datasets recovery process in government repositories Análise do processo de recuperação de conjuntos de dados em repositórios governamentais |
title |
Analysis of datasets recovery process in government repositories |
spellingShingle |
Analysis of datasets recovery process in government repositories Rodrigues, Fernando de Assis Ciclo de Vida dos Dados Coleta de Dados Dados Abertos Governamentais Repositório Governamental Data Life Cycle Data Gathering Open Government Data Governmental Repository |
title_short |
Analysis of datasets recovery process in government repositories |
title_full |
Analysis of datasets recovery process in government repositories |
title_fullStr |
Analysis of datasets recovery process in government repositories |
title_full_unstemmed |
Analysis of datasets recovery process in government repositories |
title_sort |
Analysis of datasets recovery process in government repositories |
author |
Rodrigues, Fernando de Assis |
author_facet |
Rodrigues, Fernando de Assis Sant'Ana, Ricardo César Gonçalves Ferneda, Edberto |
author_role |
author |
author2 |
Sant'Ana, Ricardo César Gonçalves Ferneda, Edberto |
author2_role |
author author |
dc.contributor.author.fl_str_mv |
Rodrigues, Fernando de Assis Sant'Ana, Ricardo César Gonçalves Ferneda, Edberto |
dc.subject.por.fl_str_mv |
Ciclo de Vida dos Dados Coleta de Dados Dados Abertos Governamentais Repositório Governamental Data Life Cycle Data Gathering Open Government Data Governmental Repository |
topic |
Ciclo de Vida dos Dados Coleta de Dados Dados Abertos Governamentais Repositório Governamental Data Life Cycle Data Gathering Open Government Data Governmental Repository |
description |
The present study aims to identify, in the recovery stage, attributes available in moments when a user conducts datasets researches in government repositories, based on the Life Cycle Data Model for Information Science (CVD-CI) proposed by Sant'Ana (2013). The research was bounded out conducting searches for data sets offered through the search engine available on the site Brazilian Open Data Portal, using the terms 'education' and 'Health'. The use of the term 'health' resulted in the recovery of 14 datasets and the term 'education' recovered 23, totaling 37 datasets. Analysis of these datasets was divided into two stages: the first were identified which attributes were available on page containing the results of searches from terms used. The second step was to identify the attributes available on the pages for each datasets retrieved in the search. As a result, it was built two tables: the first identifies the attributes that are available on search results pages that were generated by site search engine. The second identifies the attributes available in each dataset retrieved by the search. The results showed that in the first stage, there is no difference in the attributes available in the search results by both terms. However, in the second stage there were discrepancies in the attributes identified in each dataset. |
publishDate |
2015 |
dc.date.none.fl_str_mv |
2015-04-10 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
https://www.revistas.usp.br/incid/article/view/73496 10.11606/issn.2178-2075.v6i1p38-56 |
url |
https://www.revistas.usp.br/incid/article/view/73496 |
identifier_str_mv |
10.11606/issn.2178-2075.v6i1p38-56 |
dc.language.iso.fl_str_mv |
por |
language |
por |
dc.relation.none.fl_str_mv |
https://www.revistas.usp.br/incid/article/view/73496/96247 |
dc.rights.driver.fl_str_mv |
Copyright (c) 2015 InCID: Revista de Ciência da Informação e Documentação info:eu-repo/semantics/openAccess |
rights_invalid_str_mv |
Copyright (c) 2015 InCID: Revista de Ciência da Informação e Documentação |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
application/pdf |
dc.publisher.none.fl_str_mv |
Universidade de São Paulo. Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto |
publisher.none.fl_str_mv |
Universidade de São Paulo. Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto |
dc.source.none.fl_str_mv |
InCID: Revista de Ciência da Informação e Documentação; v. 6 n. 1 (2015); 38-56 2178-2075 reponame:InCID instname:Universidade de São Paulo (USP) instacron:USP |
instname_str |
Universidade de São Paulo (USP) |
instacron_str |
USP |
institution |
USP |
reponame_str |
InCID |
collection |
InCID |
repository.name.fl_str_mv |
|
repository.mail.fl_str_mv |
|
_version_ |
1787713838696628224 |