An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil
Autor(a) principal: | |
---|---|
Data de Publicação: | 2013 |
Outros Autores: | , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Revista Brasileira de Ciência do Solo (Online) |
Texto Completo: | http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832013000200007 |
Resumo: | Digital information generates the possibility of a high degree of redundancy in the data available for fitting predictive models used for Digital Soil Mapping (DSM). Among these models, the Decision Tree (DT) technique has been increasingly applied due to its capacity of dealing with large datasets. The purpose of this study was to evaluate the impact of the data volume used to generate the DT models on the quality of soil maps. An area of 889.33 km² was chosen in the Northern region of the State of Rio Grande do Sul. The soil-landscape relationship was obtained from reambulation of the studied area and the alignment of the units in the 1:50,000 scale topographic mapping. Six predictive covariates linked to the factors soil formation, relief and organisms, together with data sets of 1, 3, 5, 10, 15, 20 and 25 % of the total data volume, were used to generate the predictive DT models in the data mining program Waikato Environment for Knowledge Analysis (WEKA). In this study, sample densities below 5 % resulted in models with lower power of capturing the complexity of the spatial distribution of the soil in the study area. The relation between the data volume to be handled and the predictive capacity of the models was best for samples between 5 and 15 %. For the models based on these sample densities, the collected field data indicated an accuracy of predictive mapping close to 70 %. |
id |
SBCS-1_32cc4482f87639f0cd17f990fe191eaf |
---|---|
oai_identifier_str |
oai:scielo:S0100-06832013000200007 |
network_acronym_str |
SBCS-1 |
network_name_str |
Revista Brasileira de Ciência do Solo (Online) |
repository_id_str |
|
spelling |
An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazildecision treepedometrysoil surveymapping unitDigital information generates the possibility of a high degree of redundancy in the data available for fitting predictive models used for Digital Soil Mapping (DSM). Among these models, the Decision Tree (DT) technique has been increasingly applied due to its capacity of dealing with large datasets. The purpose of this study was to evaluate the impact of the data volume used to generate the DT models on the quality of soil maps. An area of 889.33 km² was chosen in the Northern region of the State of Rio Grande do Sul. The soil-landscape relationship was obtained from reambulation of the studied area and the alignment of the units in the 1:50,000 scale topographic mapping. Six predictive covariates linked to the factors soil formation, relief and organisms, together with data sets of 1, 3, 5, 10, 15, 20 and 25 % of the total data volume, were used to generate the predictive DT models in the data mining program Waikato Environment for Knowledge Analysis (WEKA). In this study, sample densities below 5 % resulted in models with lower power of capturing the complexity of the spatial distribution of the soil in the study area. The relation between the data volume to be handled and the predictive capacity of the models was best for samples between 5 and 15 %. For the models based on these sample densities, the collected field data indicated an accuracy of predictive mapping close to 70 %.Sociedade Brasileira de Ciência do Solo2013-04-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832013000200007Revista Brasileira de Ciência do Solo v.37 n.2 2013reponame:Revista Brasileira de Ciência do Solo (Online)instname:Sociedade Brasileira de Ciência do Solo (SBCS)instacron:SBCS10.1590/S0100-06832013000200007info:eu-repo/semantics/openAccessCaten,Alexandre tenDalmolin,Ricardo Simão DinizPedron,Fabrício de AraújoRuiz,Luis Fernando ChimeloSilva,Carlos Antônio daeng2013-06-03T00:00:00Zoai:scielo:S0100-06832013000200007Revistahttp://www.scielo.br/scielo.php?script=sci_serial&pid=0100-0683&lng=es&nrm=isohttps://old.scielo.br/oai/scielo-oai.php||sbcs@ufv.br1806-96570100-0683opendoar:2013-06-03T00:00Revista Brasileira de Ciência do Solo (Online) - Sociedade Brasileira de Ciência do Solo (SBCS)false |
dc.title.none.fl_str_mv |
An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil |
title |
An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil |
spellingShingle |
An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil Caten,Alexandre ten decision tree pedometry soil survey mapping unit |
title_short |
An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil |
title_full |
An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil |
title_fullStr |
An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil |
title_full_unstemmed |
An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil |
title_sort |
An appropriate data set size for digital soil mapping in Erechim, Rio Grande do Sul, Brazil |
author |
Caten,Alexandre ten |
author_facet |
Caten,Alexandre ten Dalmolin,Ricardo Simão Diniz Pedron,Fabrício de Araújo Ruiz,Luis Fernando Chimelo Silva,Carlos Antônio da |
author_role |
author |
author2 |
Dalmolin,Ricardo Simão Diniz Pedron,Fabrício de Araújo Ruiz,Luis Fernando Chimelo Silva,Carlos Antônio da |
author2_role |
author author author author |
dc.contributor.author.fl_str_mv |
Caten,Alexandre ten Dalmolin,Ricardo Simão Diniz Pedron,Fabrício de Araújo Ruiz,Luis Fernando Chimelo Silva,Carlos Antônio da |
dc.subject.por.fl_str_mv |
decision tree pedometry soil survey mapping unit |
topic |
decision tree pedometry soil survey mapping unit |
description |
Digital information generates the possibility of a high degree of redundancy in the data available for fitting predictive models used for Digital Soil Mapping (DSM). Among these models, the Decision Tree (DT) technique has been increasingly applied due to its capacity of dealing with large datasets. The purpose of this study was to evaluate the impact of the data volume used to generate the DT models on the quality of soil maps. An area of 889.33 km² was chosen in the Northern region of the State of Rio Grande do Sul. The soil-landscape relationship was obtained from reambulation of the studied area and the alignment of the units in the 1:50,000 scale topographic mapping. Six predictive covariates linked to the factors soil formation, relief and organisms, together with data sets of 1, 3, 5, 10, 15, 20 and 25 % of the total data volume, were used to generate the predictive DT models in the data mining program Waikato Environment for Knowledge Analysis (WEKA). In this study, sample densities below 5 % resulted in models with lower power of capturing the complexity of the spatial distribution of the soil in the study area. The relation between the data volume to be handled and the predictive capacity of the models was best for samples between 5 and 15 %. For the models based on these sample densities, the collected field data indicated an accuracy of predictive mapping close to 70 %. |
publishDate |
2013 |
dc.date.none.fl_str_mv |
2013-04-01 |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/article |
dc.type.status.fl_str_mv |
info:eu-repo/semantics/publishedVersion |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832013000200007 |
url |
http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832013000200007 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.relation.none.fl_str_mv |
10.1590/S0100-06832013000200007 |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.format.none.fl_str_mv |
text/html |
dc.publisher.none.fl_str_mv |
Sociedade Brasileira de Ciência do Solo |
publisher.none.fl_str_mv |
Sociedade Brasileira de Ciência do Solo |
dc.source.none.fl_str_mv |
Revista Brasileira de Ciência do Solo v.37 n.2 2013 reponame:Revista Brasileira de Ciência do Solo (Online) instname:Sociedade Brasileira de Ciência do Solo (SBCS) instacron:SBCS |
instname_str |
Sociedade Brasileira de Ciência do Solo (SBCS) |
instacron_str |
SBCS |
institution |
SBCS |
reponame_str |
Revista Brasileira de Ciência do Solo (Online) |
collection |
Revista Brasileira de Ciência do Solo (Online) |
repository.name.fl_str_mv |
Revista Brasileira de Ciência do Solo (Online) - Sociedade Brasileira de Ciência do Solo (SBCS) |
repository.mail.fl_str_mv |
||sbcs@ufv.br |
_version_ |
1752126518534864896 |