Classification of soil respiration in areas of sugarcane renewal using decision tree

Detalhes bibliográficos
Autor(a) principal: Farhate,Camila Viana Vieira
Data de Publicação: 2018
Outros Autores: Souza,Zigomar Menezes de, Oliveira,Stanley Robson de Medeiros, Carvalho,João Luís Nunes, Scala Júnior,Newton La, Santos,Ana Paula Guimarães
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Scientia Agrícola (Online)
Texto Completo: http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-90162018000300216
Resumo: ABSTRACT: The use of data mining is a promising alternative to predict soil respiration from correlated variables. Our objective was to build a model using variable selection and decision tree induction to predict different levels of soil respiration, taking into account physical, chemical and microbiological variables of soil as well as precipitation in renewal of sugarcane areas. The original dataset was composed of 19 variables (18 independent variables and one dependent (or response) variable). The variable-target refers to soil respiration as the target classification. Due to a large number of variables, a procedure for variable selection was conducted to remove those with low correlation with the variable-target. For that purpose, four approaches of variable selection were evaluated: no variable selection, correlation-based feature selection (CFS), chisquare method (χ2) and Wrapper. To classify soil respiration, we used the decision tree induction technique available in the Weka software package. Our results showed that data mining techniques allow the development of a model for soil respiration classification with accuracy of 81 %, resulting in a knowledge base composed of 27 rules for prediction of soil respiration. In particular, the wrapper method for variable selection identified a subset of only five variables out of 18 available in the original dataset, and they had the following order of influence in determining soil respiration: soil temperature > precipitation > macroporosity > soil moisture > potential acidity.
id USP-18_3f5f66ad813c4244f6dfb50a2a2647f6
oai_identifier_str oai:scielo:S0103-90162018000300216
network_acronym_str USP-18
network_name_str Scientia Agrícola (Online)
repository_id_str
spelling Classification of soil respiration in areas of sugarcane renewal using decision treesoil CO2 emissiondata miningvariable selectionsoil temperaturesoil organic matterABSTRACT: The use of data mining is a promising alternative to predict soil respiration from correlated variables. Our objective was to build a model using variable selection and decision tree induction to predict different levels of soil respiration, taking into account physical, chemical and microbiological variables of soil as well as precipitation in renewal of sugarcane areas. The original dataset was composed of 19 variables (18 independent variables and one dependent (or response) variable). The variable-target refers to soil respiration as the target classification. Due to a large number of variables, a procedure for variable selection was conducted to remove those with low correlation with the variable-target. For that purpose, four approaches of variable selection were evaluated: no variable selection, correlation-based feature selection (CFS), chisquare method (χ2) and Wrapper. To classify soil respiration, we used the decision tree induction technique available in the Weka software package. Our results showed that data mining techniques allow the development of a model for soil respiration classification with accuracy of 81 %, resulting in a knowledge base composed of 27 rules for prediction of soil respiration. In particular, the wrapper method for variable selection identified a subset of only five variables out of 18 available in the original dataset, and they had the following order of influence in determining soil respiration: soil temperature > precipitation > macroporosity > soil moisture > potential acidity.Escola Superior de Agricultura "Luiz de Queiroz"2018-05-01info:eu-repo/semantics/articleinfo:eu-repo/semantics/publishedVersiontext/htmlhttp://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-90162018000300216Scientia Agricola v.75 n.3 2018reponame:Scientia Agrícola (Online)instname:Universidade de São Paulo (USP)instacron:USP10.1590/1678-992x-2016-0473info:eu-repo/semantics/openAccessFarhate,Camila Viana VieiraSouza,Zigomar Menezes deOliveira,Stanley Robson de MedeirosCarvalho,João Luís NunesScala Júnior,Newton LaSantos,Ana Paula Guimarãeseng2018-01-29T00:00:00Zoai:scielo:S0103-90162018000300216Revistahttp://revistas.usp.br/sa/indexPUBhttps://old.scielo.br/oai/scielo-oai.phpscientia@usp.br||alleoni@usp.br1678-992X0103-9016opendoar:2018-01-29T00:00Scientia Agrícola (Online) - Universidade de São Paulo (USP)false
dc.title.none.fl_str_mv Classification of soil respiration in areas of sugarcane renewal using decision tree
title Classification of soil respiration in areas of sugarcane renewal using decision tree
spellingShingle Classification of soil respiration in areas of sugarcane renewal using decision tree
Farhate,Camila Viana Vieira
soil CO2 emission
data mining
variable selection
soil temperature
soil organic matter
title_short Classification of soil respiration in areas of sugarcane renewal using decision tree
title_full Classification of soil respiration in areas of sugarcane renewal using decision tree
title_fullStr Classification of soil respiration in areas of sugarcane renewal using decision tree
title_full_unstemmed Classification of soil respiration in areas of sugarcane renewal using decision tree
title_sort Classification of soil respiration in areas of sugarcane renewal using decision tree
author Farhate,Camila Viana Vieira
author_facet Farhate,Camila Viana Vieira
Souza,Zigomar Menezes de
Oliveira,Stanley Robson de Medeiros
Carvalho,João Luís Nunes
Scala Júnior,Newton La
Santos,Ana Paula Guimarães
author_role author
author2 Souza,Zigomar Menezes de
Oliveira,Stanley Robson de Medeiros
Carvalho,João Luís Nunes
Scala Júnior,Newton La
Santos,Ana Paula Guimarães
author2_role author
author
author
author
author
dc.contributor.author.fl_str_mv Farhate,Camila Viana Vieira
Souza,Zigomar Menezes de
Oliveira,Stanley Robson de Medeiros
Carvalho,João Luís Nunes
Scala Júnior,Newton La
Santos,Ana Paula Guimarães
dc.subject.por.fl_str_mv soil CO2 emission
data mining
variable selection
soil temperature
soil organic matter
topic soil CO2 emission
data mining
variable selection
soil temperature
soil organic matter
description ABSTRACT: The use of data mining is a promising alternative to predict soil respiration from correlated variables. Our objective was to build a model using variable selection and decision tree induction to predict different levels of soil respiration, taking into account physical, chemical and microbiological variables of soil as well as precipitation in renewal of sugarcane areas. The original dataset was composed of 19 variables (18 independent variables and one dependent (or response) variable). The variable-target refers to soil respiration as the target classification. Due to a large number of variables, a procedure for variable selection was conducted to remove those with low correlation with the variable-target. For that purpose, four approaches of variable selection were evaluated: no variable selection, correlation-based feature selection (CFS), chisquare method (χ2) and Wrapper. To classify soil respiration, we used the decision tree induction technique available in the Weka software package. Our results showed that data mining techniques allow the development of a model for soil respiration classification with accuracy of 81 %, resulting in a knowledge base composed of 27 rules for prediction of soil respiration. In particular, the wrapper method for variable selection identified a subset of only five variables out of 18 available in the original dataset, and they had the following order of influence in determining soil respiration: soil temperature > precipitation > macroporosity > soil moisture > potential acidity.
publishDate 2018
dc.date.none.fl_str_mv 2018-05-01
dc.type.driver.fl_str_mv info:eu-repo/semantics/article
dc.type.status.fl_str_mv info:eu-repo/semantics/publishedVersion
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-90162018000300216
url http://old.scielo.br/scielo.php?script=sci_arttext&pid=S0103-90162018000300216
dc.language.iso.fl_str_mv eng
language eng
dc.relation.none.fl_str_mv 10.1590/1678-992x-2016-0473
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.format.none.fl_str_mv text/html
dc.publisher.none.fl_str_mv Escola Superior de Agricultura "Luiz de Queiroz"
publisher.none.fl_str_mv Escola Superior de Agricultura "Luiz de Queiroz"
dc.source.none.fl_str_mv Scientia Agricola v.75 n.3 2018
reponame:Scientia Agrícola (Online)
instname:Universidade de São Paulo (USP)
instacron:USP
instname_str Universidade de São Paulo (USP)
instacron_str USP
institution USP
reponame_str Scientia Agrícola (Online)
collection Scientia Agrícola (Online)
repository.name.fl_str_mv Scientia Agrícola (Online) - Universidade de São Paulo (USP)
repository.mail.fl_str_mv scientia@usp.br||alleoni@usp.br
_version_ 1748936464706568192