Prediction of topsoil texture through regression trees and multiple linear regressions.

Detalhes bibliográficos
Autor(a) principal: PINHEIRO, H. S. K.
Data de Publicação: 2018
Outros Autores: CARVALHO JUNIOR, W. de, CHAGAS, C. da S., ANJOS, L. H. C. dos, OWENS, P. R.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo: http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994
https://doi.org/10.1590/18069657rbcs20170167
Resumo: Users of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km²) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil.Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R² values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.
id EMBR_edecd0e4b0b5ec4bca5cf2d22c2ac705
oai_identifier_str oai:www.alice.cnptia.embrapa.br:doc/1089994
network_acronym_str EMBR
network_name_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str 2154
spelling Prediction of topsoil texture through regression trees and multiple linear regressions.Atributos do terrenoFunções de profundidade do soloMapeamento digital do soloModelos de regressãoUsers of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km²) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil.Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R² values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.HELENA SARAIVA KOENOW PINHEIRO, UFRRJ; WALDIR DE CARVALHO JUNIOR, CNPS; CESAR DA SILVA CHAGAS, CNPS; LÚCIA HELENA CUNHA DOS ANJOS, UFRRJ; PHILLIP RAY OWENS, USDA.PINHEIRO, H. S. K.CARVALHO JUNIOR, W. deCHAGAS, C. da S.ANJOS, L. H. C. dosOWENS, P. R.2018-04-03T00:35:35Z2018-04-03T00:35:35Z2018-04-0220182019-04-16T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleRevista Brasileira de Ciência do Solo, v. 42, article e0170167, 2018.http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994https://doi.org/10.1590/18069657rbcs20170167enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2018-04-03T00:35:43Zoai:www.alice.cnptia.embrapa.br:doc/1089994Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542018-04-03T00:35:43falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542018-04-03T00:35:43Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv Prediction of topsoil texture through regression trees and multiple linear regressions.
title Prediction of topsoil texture through regression trees and multiple linear regressions.
spellingShingle Prediction of topsoil texture through regression trees and multiple linear regressions.
PINHEIRO, H. S. K.
Atributos do terreno
Funções de profundidade do solo
Mapeamento digital do solo
Modelos de regressão
title_short Prediction of topsoil texture through regression trees and multiple linear regressions.
title_full Prediction of topsoil texture through regression trees and multiple linear regressions.
title_fullStr Prediction of topsoil texture through regression trees and multiple linear regressions.
title_full_unstemmed Prediction of topsoil texture through regression trees and multiple linear regressions.
title_sort Prediction of topsoil texture through regression trees and multiple linear regressions.
author PINHEIRO, H. S. K.
author_facet PINHEIRO, H. S. K.
CARVALHO JUNIOR, W. de
CHAGAS, C. da S.
ANJOS, L. H. C. dos
OWENS, P. R.
author_role author
author2 CARVALHO JUNIOR, W. de
CHAGAS, C. da S.
ANJOS, L. H. C. dos
OWENS, P. R.
author2_role author
author
author
author
dc.contributor.none.fl_str_mv HELENA SARAIVA KOENOW PINHEIRO, UFRRJ; WALDIR DE CARVALHO JUNIOR, CNPS; CESAR DA SILVA CHAGAS, CNPS; LÚCIA HELENA CUNHA DOS ANJOS, UFRRJ; PHILLIP RAY OWENS, USDA.
dc.contributor.author.fl_str_mv PINHEIRO, H. S. K.
CARVALHO JUNIOR, W. de
CHAGAS, C. da S.
ANJOS, L. H. C. dos
OWENS, P. R.
dc.subject.por.fl_str_mv Atributos do terreno
Funções de profundidade do solo
Mapeamento digital do solo
Modelos de regressão
topic Atributos do terreno
Funções de profundidade do solo
Mapeamento digital do solo
Modelos de regressão
description Users of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km²) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil.Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R² values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.
publishDate 2018
dc.date.none.fl_str_mv 2018-04-03T00:35:35Z
2018-04-03T00:35:35Z
2018-04-02
2018
2019-04-16T11:11:11Z
dc.type.driver.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Revista Brasileira de Ciência do Solo, v. 42, article e0170167, 2018.
http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994
https://doi.org/10.1590/18069657rbcs20170167
identifier_str_mv Revista Brasileira de Ciência do Solo, v. 42, article e0170167, 2018.
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994
https://doi.org/10.1590/18069657rbcs20170167
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv cg-riaa@embrapa.br
_version_ 1794503451859746816