Prediction of topsoil texture through regression trees and multiple linear regressions.
Autor(a) principal: | |
---|---|
Data de Publicação: | 2018 |
Outros Autores: | , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
Texto Completo: | http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994 https://doi.org/10.1590/18069657rbcs20170167 |
Resumo: | Users of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km²) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil.Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R² values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions. |
id |
EMBR_edecd0e4b0b5ec4bca5cf2d22c2ac705 |
---|---|
oai_identifier_str |
oai:www.alice.cnptia.embrapa.br:doc/1089994 |
network_acronym_str |
EMBR |
network_name_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository_id_str |
2154 |
spelling |
Prediction of topsoil texture through regression trees and multiple linear regressions.Atributos do terrenoFunções de profundidade do soloMapeamento digital do soloModelos de regressãoUsers of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km²) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil.Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R² values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.HELENA SARAIVA KOENOW PINHEIRO, UFRRJ; WALDIR DE CARVALHO JUNIOR, CNPS; CESAR DA SILVA CHAGAS, CNPS; LÚCIA HELENA CUNHA DOS ANJOS, UFRRJ; PHILLIP RAY OWENS, USDA.PINHEIRO, H. S. K.CARVALHO JUNIOR, W. deCHAGAS, C. da S.ANJOS, L. H. C. dosOWENS, P. R.2018-04-03T00:35:35Z2018-04-03T00:35:35Z2018-04-0220182019-04-16T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleRevista Brasileira de Ciência do Solo, v. 42, article e0170167, 2018.http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994https://doi.org/10.1590/18069657rbcs20170167enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2018-04-03T00:35:43Zoai:www.alice.cnptia.embrapa.br:doc/1089994Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542018-04-03T00:35:43falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542018-04-03T00:35:43Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false |
dc.title.none.fl_str_mv |
Prediction of topsoil texture through regression trees and multiple linear regressions. |
title |
Prediction of topsoil texture through regression trees and multiple linear regressions. |
spellingShingle |
Prediction of topsoil texture through regression trees and multiple linear regressions. PINHEIRO, H. S. K. Atributos do terreno Funções de profundidade do solo Mapeamento digital do solo Modelos de regressão |
title_short |
Prediction of topsoil texture through regression trees and multiple linear regressions. |
title_full |
Prediction of topsoil texture through regression trees and multiple linear regressions. |
title_fullStr |
Prediction of topsoil texture through regression trees and multiple linear regressions. |
title_full_unstemmed |
Prediction of topsoil texture through regression trees and multiple linear regressions. |
title_sort |
Prediction of topsoil texture through regression trees and multiple linear regressions. |
author |
PINHEIRO, H. S. K. |
author_facet |
PINHEIRO, H. S. K. CARVALHO JUNIOR, W. de CHAGAS, C. da S. ANJOS, L. H. C. dos OWENS, P. R. |
author_role |
author |
author2 |
CARVALHO JUNIOR, W. de CHAGAS, C. da S. ANJOS, L. H. C. dos OWENS, P. R. |
author2_role |
author author author author |
dc.contributor.none.fl_str_mv |
HELENA SARAIVA KOENOW PINHEIRO, UFRRJ; WALDIR DE CARVALHO JUNIOR, CNPS; CESAR DA SILVA CHAGAS, CNPS; LÚCIA HELENA CUNHA DOS ANJOS, UFRRJ; PHILLIP RAY OWENS, USDA. |
dc.contributor.author.fl_str_mv |
PINHEIRO, H. S. K. CARVALHO JUNIOR, W. de CHAGAS, C. da S. ANJOS, L. H. C. dos OWENS, P. R. |
dc.subject.por.fl_str_mv |
Atributos do terreno Funções de profundidade do solo Mapeamento digital do solo Modelos de regressão |
topic |
Atributos do terreno Funções de profundidade do solo Mapeamento digital do solo Modelos de regressão |
description |
Users of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km²) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil.Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R² values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions. |
publishDate |
2018 |
dc.date.none.fl_str_mv |
2018-04-03T00:35:35Z 2018-04-03T00:35:35Z 2018-04-02 2018 2019-04-16T11:11:11Z |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
Revista Brasileira de Ciência do Solo, v. 42, article e0170167, 2018. http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994 https://doi.org/10.1590/18069657rbcs20170167 |
identifier_str_mv |
Revista Brasileira de Ciência do Solo, v. 42, article e0170167, 2018. |
url |
http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994 https://doi.org/10.1590/18069657rbcs20170167 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa) instacron:EMBRAPA |
instname_str |
Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
instacron_str |
EMBRAPA |
institution |
EMBRAPA |
reponame_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
collection |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository.name.fl_str_mv |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
repository.mail.fl_str_mv |
cg-riaa@embrapa.br |
_version_ |
1794503451859746816 |