Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removal

Detalhes bibliográficos
Autor(a) principal: Araujo, Ralf Vieira de
Data de Publicação: 2021
Tipo de documento: Dissertação
Idioma: eng
Título da fonte: Biblioteca Digital de Teses e Dissertações da USP
Texto Completo: https://www.teses.usp.br/teses/disponiveis/11/11140/tde-11102021-103725/
Resumo: Brazil, as other countries, has established energy and climate policies that foster the use of biofuels as sugarcane ethanol, in which a growing practice is to use harvesting residues, the straw, for cogeneration of electricity or to produce second-generation ethanol. In this study, it was aimed to create machine learning (ML) models capable of predict short-term changes in the soil organic carbon stocks according to the mass of sugarcane straw leftover the soil during harvest. Considerations were also made on the current state of the art regarding the application of ML in soil science, with an emphasis on tropical soils and soil carbon stocks. The initial data was generated between 2015 and 2018 in five experimental sites under commercial cultivation of sugarcane in Brazilian south-central region and the available variables were related to climate, soil physical and chemical attributes, organic matter and crop variety. The variable to be predicted (y) was the rate of carbon stock change per area per year (Mg C ha-1 yr-1) in relation to the total dry mass of straw. The initial dataset was divided into training (80%) and test (20%) and eight ML models were trained using the algorithms Random Forest (RF) and Support Vector Machine (SVM) associated to four feature selection methods. Results were evaluated using 10-fold cross-validation of the root mean squared error (RMSE) in the training set and prediction RMSE in the test set. The trained models were statistically compared among them and to the use of mean y stratified by straw mass deposited and soil layer. All the ML models surpassed the simple generalization of previously known mean values of y. The model SVM associated with RF feature selection performed better with a considerable reduction in the number of attributes, which could reduce the costs and effort of data acquisition and processing in future applications. The achievements indicate that ML models are good tools to predict short-term changes in carbon stocks due to total or partial straw remotion from the field. The obtained results and applied methodology have the potential to help producers and decision-makers interested in identifying cause-effect relationships between in situ crop conditions, straw management and expected soil carbon variations.
id USP_f69c16b90a987f4247fd1cf8d1b3cbdc
oai_identifier_str oai:teses.usp.br:tde-11102021-103725
network_acronym_str USP
network_name_str Biblioteca Digital de Teses e Dissertações da USP
repository_id_str 2721
spelling Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removalAprendizado de máquina para predição de alterações nos estoques de carbono do solo em cultivo de cana-de-açúcar devido à remoção da palhaAtributos do soloCarbono orgânico do soloMachine learningMachine learningManejo de resíduosSoil attributesSoil organic carbonWaste managementBrazil, as other countries, has established energy and climate policies that foster the use of biofuels as sugarcane ethanol, in which a growing practice is to use harvesting residues, the straw, for cogeneration of electricity or to produce second-generation ethanol. In this study, it was aimed to create machine learning (ML) models capable of predict short-term changes in the soil organic carbon stocks according to the mass of sugarcane straw leftover the soil during harvest. Considerations were also made on the current state of the art regarding the application of ML in soil science, with an emphasis on tropical soils and soil carbon stocks. The initial data was generated between 2015 and 2018 in five experimental sites under commercial cultivation of sugarcane in Brazilian south-central region and the available variables were related to climate, soil physical and chemical attributes, organic matter and crop variety. The variable to be predicted (y) was the rate of carbon stock change per area per year (Mg C ha-1 yr-1) in relation to the total dry mass of straw. The initial dataset was divided into training (80%) and test (20%) and eight ML models were trained using the algorithms Random Forest (RF) and Support Vector Machine (SVM) associated to four feature selection methods. Results were evaluated using 10-fold cross-validation of the root mean squared error (RMSE) in the training set and prediction RMSE in the test set. The trained models were statistically compared among them and to the use of mean y stratified by straw mass deposited and soil layer. All the ML models surpassed the simple generalization of previously known mean values of y. The model SVM associated with RF feature selection performed better with a considerable reduction in the number of attributes, which could reduce the costs and effort of data acquisition and processing in future applications. The achievements indicate that ML models are good tools to predict short-term changes in carbon stocks due to total or partial straw remotion from the field. The obtained results and applied methodology have the potential to help producers and decision-makers interested in identifying cause-effect relationships between in situ crop conditions, straw management and expected soil carbon variations.O Brasil estabeleceu políticas energéticas e climáticas que fomentam o uso de biocombustíveis como o etanol da cana-de-açúcar. Uma prática crescente é usar os resíduos da colheita, a palha de cana-de-açúcar, para cogeração de energia elétrica ou para produzir etanol de segunda geração. Neste estudo, objetivou-se prever mudanças de curto prazo nos estoques de carbono orgânico do solo de acordo com a massa de resíduos depositada na colheita utilizando técnicas de aprendizado de máquina (AM). Foram feitas também considerações sobre o atual estado da arte relativo à aplicação de AM nas ciências do solo, com ênfase em solos tropicais e estoques de carbono do solo. Os dados iniciais foram gerados entre 2015 e 2018 em cinco áreas de cultivo comercial de cana-de-açúcar na região centro-sul do Brasil e as variáveis disponíveis relacionam-se ao clima, atributos físicos e químicos do solo, matéria orgânica e variedade cultural. A variável predita (y) foi a taxa de variação do estoque de carbono por área por ano (Mg C ha-1 ano-1) em relação à massa seca total da palha. O conjunto de dados inicial foi dividido em treino (80%) e teste (20%) e oito modelos baseados em algoritmos de AM foram desenvolvidos utilizando Random Forest (RF) e Support Vector Machine (SVM) associados a quatro métodos de seleção de atributos. Os resultados foram avaliados pela raiz do erro quadrático médio (RMSE) com validação cruzada no conjunto treino e RMSE da predição no conjunto de teste. Os modelos treinados foram comparados com a adoção de valores médios de y estratificados por massa de palha depositada e camada de solo e entre eles (p < 0,05). Todos os modelos AM superaram a generalização de valores médios de y previamente conhecidos. O modelo SVM aplicado ao conjunto de atributos selecionado por RF apresentou melhor desempenho com redução considerável no número de atributos, o que poderia reduzir os custos e esforço de aquisição e processamento de dados em aplicações futuras. Conclui-se que modelos de AM são boas ferramentas para prever mudanças de curto prazo nos estoques de carbono devido à remoção total ou parcial da palha do campo. Os resultados obtidos e metodologia aplicada tem potencial de auxiliar produtores e gestores a identificar ralações de causa-efeito entre as condições locais de cultivo, o manejo da palhada adotado e as variações esperadas no carbono orgânico do solo.Biblioteca Digitais de Teses e Dissertações da USPCerri, Carlos Eduardo PellegrinoAraujo, Ralf Vieira de2021-08-31info:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/masterThesisapplication/pdfhttps://www.teses.usp.br/teses/disponiveis/11/11140/tde-11102021-103725/reponame:Biblioteca Digital de Teses e Dissertações da USPinstname:Universidade de São Paulo (USP)instacron:USPLiberar o conteúdo para acesso público.info:eu-repo/semantics/openAccesseng2021-10-13T17:58:02Zoai:teses.usp.br:tde-11102021-103725Biblioteca Digital de Teses e Dissertaçõeshttp://www.teses.usp.br/PUBhttp://www.teses.usp.br/cgi-bin/mtd2br.plvirginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.bropendoar:27212021-10-13T17:58:02Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)false
dc.title.none.fl_str_mv Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removal
Aprendizado de máquina para predição de alterações nos estoques de carbono do solo em cultivo de cana-de-açúcar devido à remoção da palha
title Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removal
spellingShingle Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removal
Araujo, Ralf Vieira de
Atributos do solo
Carbono orgânico do solo
Machine learning
Machine learning
Manejo de resíduos
Soil attributes
Soil organic carbon
Waste management
title_short Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removal
title_full Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removal
title_fullStr Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removal
title_full_unstemmed Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removal
title_sort Machine learning for prediction of soil carbon stock changes in sugarcane crop due to straw removal
author Araujo, Ralf Vieira de
author_facet Araujo, Ralf Vieira de
author_role author
dc.contributor.none.fl_str_mv Cerri, Carlos Eduardo Pellegrino
dc.contributor.author.fl_str_mv Araujo, Ralf Vieira de
dc.subject.por.fl_str_mv Atributos do solo
Carbono orgânico do solo
Machine learning
Machine learning
Manejo de resíduos
Soil attributes
Soil organic carbon
Waste management
topic Atributos do solo
Carbono orgânico do solo
Machine learning
Machine learning
Manejo de resíduos
Soil attributes
Soil organic carbon
Waste management
description Brazil, as other countries, has established energy and climate policies that foster the use of biofuels as sugarcane ethanol, in which a growing practice is to use harvesting residues, the straw, for cogeneration of electricity or to produce second-generation ethanol. In this study, it was aimed to create machine learning (ML) models capable of predict short-term changes in the soil organic carbon stocks according to the mass of sugarcane straw leftover the soil during harvest. Considerations were also made on the current state of the art regarding the application of ML in soil science, with an emphasis on tropical soils and soil carbon stocks. The initial data was generated between 2015 and 2018 in five experimental sites under commercial cultivation of sugarcane in Brazilian south-central region and the available variables were related to climate, soil physical and chemical attributes, organic matter and crop variety. The variable to be predicted (y) was the rate of carbon stock change per area per year (Mg C ha-1 yr-1) in relation to the total dry mass of straw. The initial dataset was divided into training (80%) and test (20%) and eight ML models were trained using the algorithms Random Forest (RF) and Support Vector Machine (SVM) associated to four feature selection methods. Results were evaluated using 10-fold cross-validation of the root mean squared error (RMSE) in the training set and prediction RMSE in the test set. The trained models were statistically compared among them and to the use of mean y stratified by straw mass deposited and soil layer. All the ML models surpassed the simple generalization of previously known mean values of y. The model SVM associated with RF feature selection performed better with a considerable reduction in the number of attributes, which could reduce the costs and effort of data acquisition and processing in future applications. The achievements indicate that ML models are good tools to predict short-term changes in carbon stocks due to total or partial straw remotion from the field. The obtained results and applied methodology have the potential to help producers and decision-makers interested in identifying cause-effect relationships between in situ crop conditions, straw management and expected soil carbon variations.
publishDate 2021
dc.date.none.fl_str_mv 2021-08-31
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
dc.type.driver.fl_str_mv info:eu-repo/semantics/masterThesis
format masterThesis
status_str publishedVersion
dc.identifier.uri.fl_str_mv https://www.teses.usp.br/teses/disponiveis/11/11140/tde-11102021-103725/
url https://www.teses.usp.br/teses/disponiveis/11/11140/tde-11102021-103725/
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv
dc.rights.driver.fl_str_mv Liberar o conteúdo para acesso público.
info:eu-repo/semantics/openAccess
rights_invalid_str_mv Liberar o conteúdo para acesso público.
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv application/pdf
dc.coverage.none.fl_str_mv
dc.publisher.none.fl_str_mv Biblioteca Digitais de Teses e Dissertações da USP
publisher.none.fl_str_mv Biblioteca Digitais de Teses e Dissertações da USP
dc.source.none.fl_str_mv
reponame:Biblioteca Digital de Teses e Dissertações da USP
instname:Universidade de São Paulo (USP)
instacron:USP
instname_str Universidade de São Paulo (USP)
instacron_str USP
institution USP
reponame_str Biblioteca Digital de Teses e Dissertações da USP
collection Biblioteca Digital de Teses e Dissertações da USP
repository.name.fl_str_mv Biblioteca Digital de Teses e Dissertações da USP - Universidade de São Paulo (USP)
repository.mail.fl_str_mv virginia@if.usp.br|| atendimento@aguia.usp.br||virginia@if.usp.br
_version_ 1809090629253201920