Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressions

Detalhes bibliográficos
Autor(a) principal: Pinheiro,Helena Saraiva Koenow
Data de Publicação: 2018
Outros Autores: Carvalho Junior,Waldir de, Chagas,César da Silva, Anjos,Lúcia Helena Cunha dos, Owens,Phillip Ray
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Revista Brasileira de Ciência do Solo (Online)
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832018000100304
Resumo: ABSTRACT: Users of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km2) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil. Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R2 values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.
id SBCS-1_3136e12548826bc3805d9f55705e1a01
oai_identifier_str oai:scielo:S0100-06832018000100304
network_acronym_str SBCS-1
network_name_str Revista Brasileira de Ciência do Solo (Online)
repository_id_str
spelling Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressionsterrain attributessoil depth functionsdigital soil mappingregression modelsABSTRACT: Users of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km2) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil. Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R2 values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.Sociedade Brasileira de Ciência do Solo2018-01-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832018000100304Revista Brasileira de Ciência do Solo v.42 2018reponame:Revista Brasileira de Ciência do Solo (Online)instname:Sociedade Brasileira de Ciência do Solo (SBCS)instacron:SBCS10.1590/18069657rbcs20170167info:eu-repo/semantics/openAccessPinheiro,Helena Saraiva KoenowCarvalho Junior,Waldir deChagas,César da SilvaAnjos,Lúcia Helena Cunha dosOwens,Phillip Rayeng2018-04-10T00:00:00Zoai:scielo:S0100-06832018000100304Revistahttp://www.scielo.br/scielo.php?script=sci_serial&pid=0100-0683&lng=es&nrm=isohttps://old.scielo.br/oai/scielo-oai.php||sbcs@ufv.br1806-96570100-0683opendoar:2018-04-10T00:00Revista Brasileira de Ciência do Solo (Online) - Sociedade Brasileira de Ciência do Solo (SBCS)false
dc.title.none.fl_str_mv Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressions
title Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressions
spellingShingle Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressions
Pinheiro,Helena Saraiva Koenow
terrain attributes
soil depth functions
digital soil mapping
regression models
title_short Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressions
title_full Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressions
title_fullStr Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressions
title_full_unstemmed Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressions
title_sort Prediction of Topsoil Texture Through Regression Trees and Multiple Linear Regressions
author Pinheiro,Helena Saraiva Koenow
author_facet Pinheiro,Helena Saraiva Koenow
Carvalho Junior,Waldir de
Chagas,César da Silva
Anjos,Lúcia Helena Cunha dos
Owens,Phillip Ray
author_role author
author2 Carvalho Junior,Waldir de
Chagas,César da Silva
Anjos,Lúcia Helena Cunha dos
Owens,Phillip Ray
author2_role author
author
author
author
dc.contributor.author.fl_str_mv Pinheiro,Helena Saraiva Koenow
Carvalho Junior,Waldir de
Chagas,César da Silva
Anjos,Lúcia Helena Cunha dos
Owens,Phillip Ray
dc.subject.por.fl_str_mv terrain attributes
soil depth functions
digital soil mapping
regression models
topic terrain attributes
soil depth functions
digital soil mapping
regression models
description ABSTRACT: Users of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km2) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil. Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R2 values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.
publishDate 2018
dc.date.none.fl_str_mv 2018-01-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832018000100304
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0100-06832018000100304
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/18069657rbcs20170167
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Sociedade Brasileira de Ciência do Solo
publisher.none.fl_str_mv Sociedade Brasileira de Ciência do Solo
dc.source.none.fl_str_mv Revista Brasileira de Ciência do Solo v.42 2018
reponame:Revista Brasileira de Ciência do Solo (Online)
instname:Sociedade Brasileira de Ciência do Solo (SBCS)
instacron:SBCS
instname_str Sociedade Brasileira de Ciência do Solo (SBCS)
instacron_str SBCS
institution SBCS
reponame_str Revista Brasileira de Ciência do Solo (Online)
collection Revista Brasileira de Ciência do Solo (Online)
repository.name.fl_str_mv Revista Brasileira de Ciência do Solo (Online) - Sociedade Brasileira de Ciência do Solo (SBCS)
repository.mail.fl_str_mv ||sbcs@ufv.br
_version_ 1752126521733021696