An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil

Detalhes bibliográficos
Autor(a) principal: Caten,Alexandre ten
Data de Publicação: 2013
Outros Autores: Dalmolin,Ricardo Simão Diniz, Pedron,Fabrício de Araújo, Ruiz,Luis Fernando Chimelo, Silva,Carlos Antônio da
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Revista Brasileira de Ciência do Solo (Online)
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832013000200007
Resumo: Digital information generates the possibility of a high degree of redundancy in the data available for fitting predictive models used for Digital Soil Mapping (DSM). Among these models, the Decision Tree (DT) technique has been increasingly applied due to its capacity of dealing with large datasets. The purpose of this study was to evaluate the impact of the data volume used to generate the DT models on the quality of soil maps. An area of 889.33 km² was chosen in the Northern region of the State of Rio Grande do Sul. The soil-landscape relationship was obtained from reambulation of the studied area and the alignment of the units in the 1:50,000 scale topographic mapping. Six predictive covariates linked to the factors soil formation, relief and organisms, together with data sets of 1, 3, 5, 10, 15, 20 and 25 % of the total data volume, were used to generate the predictive DT models in the data mining program Waikato Environment for Knowledge Analysis (WEKA). In this study, sample densities below 5 % resulted in models with lower power of capturing the complexity of the spatial distribution of the soil in the study area. The relation between the data volume to be handled and the predictive capacity of the models was best for samples between 5 and 15 %. For the models based on these sample densities, the collected field data indicated an accuracy of predictive mapping close to 70 %.
id SBCS-1_32cc4482f87639f0cd17f990fe191eaf
oai_identifier_str oai:scielo:S0100-06832013000200007
network_acronym_str SBCS-1
network_name_str Revista Brasileira de Ciência do Solo (Online)
repository_id_str
spelling An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazildecision treepedometrysoil surveymapping unitDigital information generates the possibility of a high degree of redundancy in the data available for fitting predictive models used for Digital Soil Mapping (DSM). Among these models, the Decision Tree (DT) technique has been increasingly applied due to its capacity of dealing with large datasets. The purpose of this study was to evaluate the impact of the data volume used to generate the DT models on the quality of soil maps. An area of 889.33 km² was chosen in the Northern region of the State of Rio Grande do Sul. The soil-landscape relationship was obtained from reambulation of the studied area and the alignment of the units in the 1:50,000 scale topographic mapping. Six predictive covariates linked to the factors soil formation, relief and organisms, together with data sets of 1, 3, 5, 10, 15, 20 and 25 % of the total data volume, were used to generate the predictive DT models in the data mining program Waikato Environment for Knowledge Analysis (WEKA). In this study, sample densities below 5 % resulted in models with lower power of capturing the complexity of the spatial distribution of the soil in the study area. The relation between the data volume to be handled and the predictive capacity of the models was best for samples between 5 and 15 %. For the models based on these sample densities, the collected field data indicated an accuracy of predictive mapping close to 70 %.Sociedade Brasileira de Ciência do Solo2013-04-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832013000200007Revista Brasileira de Ciência do Solo v.37 n.2 2013reponame:Revista Brasileira de Ciência do Solo (Online)instname:Sociedade Brasileira de Ciência do Solo (SBCS)instacron:SBCS10.1590/S0100-06832013000200007info:eu-repo/semantics/openAccessCaten,Alexandre tenDalmolin,Ricardo Simão DinizPedron,Fabrício de AraújoRuiz,Luis Fernando ChimeloSilva,Carlos Antônio daeng2013-06-03T00:00:00Zoai:scielo:S0100-06832013000200007Revistahttp://www.scielo.br/scielo.php?script=sci_serial&pid=0100-0683&lng=es&nrm=isohttps://old.scielo.br/oai/scielo-oai.php||sbcs@ufv.br1806-96570100-0683opendoar:2013-06-03T00:00Revista Brasileira de Ciência do Solo (Online) - Sociedade Brasileira de Ciência do Solo (SBCS)false
dc.title.none.fl_str_mv An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil
title An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil
spellingShingle An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil
Caten,Alexandre ten
decision tree
pedometry
soil survey
mapping unit
title_short An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil
title_full An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil
title_fullStr An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil
title_full_unstemmed An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil
title_sort An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil
author Caten,Alexandre ten
author_facet Caten,Alexandre ten
Dalmolin,Ricardo Simão Diniz
Pedron,Fabrício de Araújo
Ruiz,Luis Fernando Chimelo
Silva,Carlos Antônio da
author_role author
author2 Dalmolin,Ricardo Simão Diniz
Pedron,Fabrício de Araújo
Ruiz,Luis Fernando Chimelo
Silva,Carlos Antônio da
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Caten,Alexandre ten
Dalmolin,Ricardo Simão Diniz
Pedron,Fabrício de Araújo
Ruiz,Luis Fernando Chimelo
Silva,Carlos Antônio da
dc.subject.por.fl_str_mv decision tree
pedometry
soil survey
mapping unit
topic decision tree
pedometry
soil survey
mapping unit
description Digital information generates the possibility of a high degree of redundancy in the data available for fitting predictive models used for Digital Soil Mapping (DSM). Among these models, the Decision Tree (DT) technique has been increasingly applied due to its capacity of dealing with large datasets. The purpose of this study was to evaluate the impact of the data volume used to generate the DT models on the quality of soil maps. An area of 889.33 km² was chosen in the Northern region of the State of Rio Grande do Sul. The soil-landscape relationship was obtained from reambulation of the studied area and the alignment of the units in the 1:50,000 scale topographic mapping. Six predictive covariates linked to the factors soil formation, relief and organisms, together with data sets of 1, 3, 5, 10, 15, 20 and 25 % of the total data volume, were used to generate the predictive DT models in the data mining program Waikato Environment for Knowledge Analysis (WEKA). In this study, sample densities below 5 % resulted in models with lower power of capturing the complexity of the spatial distribution of the soil in the study area. The relation between the data volume to be handled and the predictive capacity of the models was best for samples between 5 and 15 %. For the models based on these sample densities, the collected field data indicated an accuracy of predictive mapping close to 70 %.
publishDate 2013
dc.date.none.fl_str_mv 2013-04-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832013000200007
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832013000200007
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/S0100-06832013000200007
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Sociedade Brasileira de Ciência do Solo
publisher.none.fl_str_mv Sociedade Brasileira de Ciência do Solo
dc.source.none.fl_str_mv Revista Brasileira de Ciência do Solo v.37 n.2 2013
reponame:Revista Brasileira de Ciência do Solo (Online)
instname:Sociedade Brasileira de Ciência do Solo (SBCS)
instacron:SBCS
instname_str Sociedade Brasileira de Ciência do Solo (SBCS)
instacron_str SBCS
institution SBCS
reponame_str Revista Brasileira de Ciência do Solo (Online)
collection Revista Brasileira de Ciência do Solo (Online)
repository.name.fl_str_mv Revista Brasileira de Ciência do Solo (Online) - Sociedade Brasileira de Ciência do Solo (SBCS)
repository.mail.fl_str_mv ||sbcs@ufv.br
_version_ 1752126518534864896