Prediction of topsoil texture through regression trees and multiple linear regressions.

PINHEIRO, H. S. K.; CARVALHO JUNIOR, W. de; CHAGAS, C. da S.; ANJOS, L. H. C. dos; OWENS, P. R.

Prediction of topsoil texture through regression trees and multiple linear regressions.

Detalhes bibliográficos
Autor(a) principal:	PINHEIRO, H. S. K.
Data de Publicação:	2018
Outros Autores:	CARVALHO JUNIOR, W. de, CHAGAS, C. da S., ANJOS, L. H. C. dos, OWENS, P. R.
Tipo de documento:	Artigo
Idioma:	eng
Título da fonte:	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo:	http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994 https://doi.org/10.1590/18069657rbcs20170167
Resumo:	Users of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km²) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil.Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R² values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.

Metadados do item

id	EMBR_edecd0e4b0b5ec4bca5cf2d22c2ac705
oai_identifier_str	oai:www.alice.cnptia.embrapa.br:doc/1089994
network_acronym_str	EMBR
network_name_str	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str	2154
spelling	Prediction of topsoil texture through regression trees and multiple linear regressions.Atributos do terrenoFunções de profundidade do soloMapeamento digital do soloModelos de regressãoUsers of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km²) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil.Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R² values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.HELENA SARAIVA KOENOW PINHEIRO, UFRRJ; WALDIR DE CARVALHO JUNIOR, CNPS; CESAR DA SILVA CHAGAS, CNPS; LÚCIA HELENA CUNHA DOS ANJOS, UFRRJ; PHILLIP RAY OWENS, USDA.PINHEIRO, H. S. K.CARVALHO JUNIOR, W. deCHAGAS, C. da S.ANJOS, L. H. C. dosOWENS, P. R.2018-04-03T00:35:35Z2018-04-03T00:35:35Z2018-04-0220182019-04-16T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articleRevista Brasileira de Ciência do Solo, v. 42, article e0170167, 2018.http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994https://doi.org/10.1590/18069657rbcs20170167enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2018-04-03T00:35:43Zoai:www.alice.cnptia.embrapa.br:doc/1089994Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542018-04-03T00:35:43falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542018-04-03T00:35:43Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv	Prediction of topsoil texture through regression trees and multiple linear regressions.
title	Prediction of topsoil texture through regression trees and multiple linear regressions.
spellingShingle	Prediction of topsoil texture through regression trees and multiple linear regressions. PINHEIRO, H. S. K. Atributos do terreno Funções de profundidade do solo Mapeamento digital do solo Modelos de regressão
title_short	Prediction of topsoil texture through regression trees and multiple linear regressions.
title_full	Prediction of topsoil texture through regression trees and multiple linear regressions.
title_fullStr	Prediction of topsoil texture through regression trees and multiple linear regressions.
title_full_unstemmed	Prediction of topsoil texture through regression trees and multiple linear regressions.
title_sort	Prediction of topsoil texture through regression trees and multiple linear regressions.
author	PINHEIRO, H. S. K.
author_facet	PINHEIRO, H. S. K. CARVALHO JUNIOR, W. de CHAGAS, C. da S. ANJOS, L. H. C. dos OWENS, P. R.
author_role	author
author2	CARVALHO JUNIOR, W. de CHAGAS, C. da S. ANJOS, L. H. C. dos OWENS, P. R.
author2_role	author author author author
dc.contributor.none.fl_str_mv	HELENA SARAIVA KOENOW PINHEIRO, UFRRJ; WALDIR DE CARVALHO JUNIOR, CNPS; CESAR DA SILVA CHAGAS, CNPS; LÚCIA HELENA CUNHA DOS ANJOS, UFRRJ; PHILLIP RAY OWENS, USDA.
dc.contributor.author.fl_str_mv	PINHEIRO, H. S. K. CARVALHO JUNIOR, W. de CHAGAS, C. da S. ANJOS, L. H. C. dos OWENS, P. R.
dc.subject.por.fl_str_mv	Atributos do terreno Funções de profundidade do solo Mapeamento digital do solo Modelos de regressão
topic	Atributos do terreno Funções de profundidade do solo Mapeamento digital do solo Modelos de regressão
description	Users of soil survey products are mostly interested in understanding how soil properties vary in space and time. The aim of digital soil mapping (DSM) is to represent the spatial variability of soil properties quantitatively to support decision-making. The goal of this study is to evaluate DSM techniques (Regression Trees - RT and Multiple Linear Regressions - MLR) and the ability of these tools to predict mineral fraction content under a wide variability of landscapes. The study site was the entire Guapi-Macacu watershed (1,250.78 km²) in the state of Rio de Janeiro in the Southeast region of Brazil. Terrain attributes and remote sensing data (with 30 m of spatial resolution) were used to represent landscape co-variables selected as an input in predictive models in order to develop the explanatory variables. The selection of sampling sites was based on the Latin Hypercube algorithm. A representative set of one hundred points with feasible field access was chosen. Different input databases were tested for prediction of mineral fraction content (harmonized and original data). The Spline algorithm was used to harmonize data according to the GlobalSoil.Net consortium standards. The results showed better performance from the RT models, using input from an average of six covariates; the simplest MLR model used twice as many input variables, creating more complex models without gaining precision. Furthermore, better R² values were obtained using RT models, irrespective of harmonization of soil data. The harmonized dataset from the 0.00-0.05 and 0.05-0.15 m layers, in general, presented better results for the clay and silt, with R2 values of 0.52 (0.00-0.05 m) and 0.69 (0.05-0.15 m), respectively. Prediction of sand content showed better results when the original depth data was used as an input, although all regression tree models had R2 values greater than 0.52. The RT models provided a better statistical index than MLR for all predicted properties; however, the variance between models suggests similarity of performance. Regarding harmonization of soil data, both input databases (harmonized or not) can be used to predict soil properties, since the variance of model performance was low and generalization of the soil maps showed similar trends. The products obtained from the digital soil mapping approach make it possible to integrate the factor of uncertainties, providing easier interpretation for soil management and land use decisions.
publishDate	2018
dc.date.none.fl_str_mv	2018-04-03T00:35:35Z 2018-04-03T00:35:35Z 2018-04-02 2018 2019-04-16T11:11:11Z
dc.type.driver.fl_str_mv	info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article
format	article
status_str	publishedVersion
dc.identifier.uri.fl_str_mv	Revista Brasileira de Ciência do Solo, v. 42, article e0170167, 2018. http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994 https://doi.org/10.1590/18069657rbcs20170167
identifier_str_mv	Revista Brasileira de Ciência do Solo, v. 42, article e0170167, 2018.
url	http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089994 https://doi.org/10.1590/18069657rbcs20170167
dc.language.iso.fl_str_mv	eng
language	eng
dc.rights.driver.fl_str_mv	info:eu-repo/semantics/openAccess
eu_rights_str_mv	openAccess
dc.source.none.fl_str_mv	reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa) instacron:EMBRAPA
instname_str	Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str	EMBRAPA
institution	EMBRAPA
reponame_str	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv	Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv	cg-riaa@embrapa.br
_version_	1794503451859746816

Prediction of topsoil texture through regression trees and multiple linear regressions.

Registros relacionados