Web data mining: validity of data from google earth for food retail evaluation

Detalhes bibliográficos
Autor(a) principal: Mariana Carvalhode Menezes
Data de Publicação: 2020
Outros Autores: Letícia de Oliveira Cardoso, Vanderlei Pascoal de Matos, Maria de Fátima de Pina, Bruna Vieira de Lima Costa, Larissa Loures Mendes, Milene Cristine Pessoa, Paulo Roberto Borges de Souza-Junior, Amélia Augusta de Lima Friche, Waleska Teixeira Caiaffa
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da UFMG
Texto Completo: http://hdl.handle.net/1843/66672
Resumo: To overcome the challenge of obtaining accurate data on community food retail, we developed an innovative tool to automatically capture food retail data from Google Earth (GE). The proposed method is relevant to non-commercial use or scholarly purposes. We aimed to test the validity of web sources data for the assessment of community food retail environment by comparison to ground-truth observations (gold standard). A secondary aim was to test whether validity differs by type of food outlet and socioeconomic status (SES). The study area included a sample of 300 census tracts stratified by SES in two of the largest cities in Brazil, Rio de Janeiro and Belo Horizonte. The GE web service was used to develop a tool for automatic acquisition of food retail data through the generation of a regular grid of points. To test its validity, this data was compared with the ground-truth data. Compared to the 856 outlets identified in 285 census tracts by the ground-truth method, the GE interface identified 731 outlets. In both cities, the GE interface scored moderate to excellent compared to the ground-truth data across all of the validity measures: sensitivity, specificity, positive predictive value, negative predictive value and accuracy (ranging from 66.3 to 100%). The validity did not differ by SES strata. Supermarkets, convenience stores and restaurants yielded better results than other store types. To our knowledge, this research is the first to investigate using GE as a tool to capture community food retail data. Our results suggest that the GE interface could be used to measure the community food environment. Validity was satisfactory for different SES areas and types of outlets.
id UFMG_4b576cc096c5a06ef41377b6c50389e2
oai_identifier_str oai:repositorio.ufmg.br:1843/66672
network_acronym_str UFMG
network_name_str Repositório Institucional da UFMG
repository_id_str
spelling Web data mining: validity of data from google earth for food retail evaluationCuradoria de DadosAlimentação no Contexto UrbanoFatores SocioeconômicosTo overcome the challenge of obtaining accurate data on community food retail, we developed an innovative tool to automatically capture food retail data from Google Earth (GE). The proposed method is relevant to non-commercial use or scholarly purposes. We aimed to test the validity of web sources data for the assessment of community food retail environment by comparison to ground-truth observations (gold standard). A secondary aim was to test whether validity differs by type of food outlet and socioeconomic status (SES). The study area included a sample of 300 census tracts stratified by SES in two of the largest cities in Brazil, Rio de Janeiro and Belo Horizonte. The GE web service was used to develop a tool for automatic acquisition of food retail data through the generation of a regular grid of points. To test its validity, this data was compared with the ground-truth data. Compared to the 856 outlets identified in 285 census tracts by the ground-truth method, the GE interface identified 731 outlets. In both cities, the GE interface scored moderate to excellent compared to the ground-truth data across all of the validity measures: sensitivity, specificity, positive predictive value, negative predictive value and accuracy (ranging from 66.3 to 100%). The validity did not differ by SES strata. Supermarkets, convenience stores and restaurants yielded better results than other store types. To our knowledge, this research is the first to investigate using GE as a tool to capture community food retail data. Our results suggest that the GE interface could be used to measure the community food environment. Validity was satisfactory for different SES areas and types of outlets.CNPq - Conselho Nacional de Desenvolvimento Científico e TecnológicoOutra AgênciaUniversidade Federal de Minas GeraisBrasilENF - DEPARTAMENTO DE NUTRIÇÃOUFMG2024-03-27T19:57:21Z2024-03-27T19:57:21Z2020-11-23info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlepdfapplication/pdf10.1007/s11524-020-00495-x1468-2869http://hdl.handle.net/1843/66672engJournal of Urban HealthMariana Carvalhode MenezesLetícia de Oliveira CardosoVanderlei Pascoal de MatosMaria de Fátima de PinaBruna Vieira de Lima CostaLarissa Loures MendesMilene Cristine PessoaPaulo Roberto Borges de Souza-JuniorAmélia Augusta de Lima FricheWaleska Teixeira Caiaffainfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da UFMGinstname:Universidade Federal de Minas Gerais (UFMG)instacron:UFMG2024-03-27T19:57:21Zoai:repositorio.ufmg.br:1843/66672Repositório InstitucionalPUBhttps://repositorio.ufmg.br/oairepositorio@ufmg.bropendoar:2024-03-27T19:57:21Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)false
dc.title.none.fl_str_mv Web data mining: validity of data from google earth for food retail evaluation
title Web data mining: validity of data from google earth for food retail evaluation
spellingShingle Web data mining: validity of data from google earth for food retail evaluation
Mariana Carvalhode Menezes
Curadoria de Dados
Alimentação no Contexto Urbano
Fatores Socioeconômicos
title_short Web data mining: validity of data from google earth for food retail evaluation
title_full Web data mining: validity of data from google earth for food retail evaluation
title_fullStr Web data mining: validity of data from google earth for food retail evaluation
title_full_unstemmed Web data mining: validity of data from google earth for food retail evaluation
title_sort Web data mining: validity of data from google earth for food retail evaluation
author Mariana Carvalhode Menezes
author_facet Mariana Carvalhode Menezes
Letícia de Oliveira Cardoso
Vanderlei Pascoal de Matos
Maria de Fátima de Pina
Bruna Vieira de Lima Costa
Larissa Loures Mendes
Milene Cristine Pessoa
Paulo Roberto Borges de Souza-Junior
Amélia Augusta de Lima Friche
Waleska Teixeira Caiaffa
author_role author
author2 Letícia de Oliveira Cardoso
Vanderlei Pascoal de Matos
Maria de Fátima de Pina
Bruna Vieira de Lima Costa
Larissa Loures Mendes
Milene Cristine Pessoa
Paulo Roberto Borges de Souza-Junior
Amélia Augusta de Lima Friche
Waleska Teixeira Caiaffa
author2_role author
author
author
author
author
author
author
author
author
dc.contributor.author.fl_str_mv Mariana Carvalhode Menezes
Letícia de Oliveira Cardoso
Vanderlei Pascoal de Matos
Maria de Fátima de Pina
Bruna Vieira de Lima Costa
Larissa Loures Mendes
Milene Cristine Pessoa
Paulo Roberto Borges de Souza-Junior
Amélia Augusta de Lima Friche
Waleska Teixeira Caiaffa
dc.subject.por.fl_str_mv Curadoria de Dados
Alimentação no Contexto Urbano
Fatores Socioeconômicos
topic Curadoria de Dados
Alimentação no Contexto Urbano
Fatores Socioeconômicos
description To overcome the challenge of obtaining accurate data on community food retail, we developed an innovative tool to automatically capture food retail data from Google Earth (GE). The proposed method is relevant to non-commercial use or scholarly purposes. We aimed to test the validity of web sources data for the assessment of community food retail environment by comparison to ground-truth observations (gold standard). A secondary aim was to test whether validity differs by type of food outlet and socioeconomic status (SES). The study area included a sample of 300 census tracts stratified by SES in two of the largest cities in Brazil, Rio de Janeiro and Belo Horizonte. The GE web service was used to develop a tool for automatic acquisition of food retail data through the generation of a regular grid of points. To test its validity, this data was compared with the ground-truth data. Compared to the 856 outlets identified in 285 census tracts by the ground-truth method, the GE interface identified 731 outlets. In both cities, the GE interface scored moderate to excellent compared to the ground-truth data across all of the validity measures: sensitivity, specificity, positive predictive value, negative predictive value and accuracy (ranging from 66.3 to 100%). The validity did not differ by SES strata. Supermarkets, convenience stores and restaurants yielded better results than other store types. To our knowledge, this research is the first to investigate using GE as a tool to capture community food retail data. Our results suggest that the GE interface could be used to measure the community food environment. Validity was satisfactory for different SES areas and types of outlets.
publishDate 2020
dc.date.none.fl_str_mv 2020-11-23
2024-03-27T19:57:21Z
2024-03-27T19:57:21Z
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv 10.1007/s11524-020-00495-x
1468-2869
http://hdl.handle.net/1843/66672
identifier_str_mv 10.1007/s11524-020-00495-x
1468-2869
url http://hdl.handle.net/1843/66672
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv Journal of Urban Health
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv pdf
application/pdf
dc.publisher.none.fl_str_mv Universidade Federal de Minas Gerais
Brasil
ENF - DEPARTAMENTO DE NUTRIÇÃO
UFMG
publisher.none.fl_str_mv Universidade Federal de Minas Gerais
Brasil
ENF - DEPARTAMENTO DE NUTRIÇÃO
UFMG
dc.source.none.fl_str_mv reponame:Repositório Institucional da UFMG
instname:Universidade Federal de Minas Gerais (UFMG)
instacron:UFMG
instname_str Universidade Federal de Minas Gerais (UFMG)
instacron_str UFMG
institution UFMG
reponame_str Repositório Institucional da UFMG
collection Repositório Institucional da UFMG
repository.name.fl_str_mv Repositório Institucional da UFMG - Universidade Federal de Minas Gerais (UFMG)
repository.mail.fl_str_mv repositorio@ufmg.br
_version_ 1823248354933473280