Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.

Detalhes bibliográficos
Autor(a) principal: FARHATE, C. V. V.
Data de Publicação: 2018
Outros Autores: SOUZA, Z. M. de, OLIVEIRA, S. R. de M., TAVARES, R. L. M., CARVALHO, J. L. N.
Tipo de documento: Artigo
Idioma: eng
Título da fonte: Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
Texto Completo: http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089160
https://doi.org/ 10.1371/journal.pone.0193537
Resumo: Soil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO2 concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO2 emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop and minimum tillage are effective strategies to promote the mitigation of soil CO2 emissions, in which the average CO2 emissions are 63 kg ha-1 day-1. The variables soil moisture, soil temperature (Ts), rainfall, pH, and organic carbon were most frequently selected for soil CO2 emission classification using different methods for attribute selection. According to the results of the ROC curve, the best approaches for soil CO2 emission classification were the following: (I)-the Multilayer Perceptron classifier with attribute selection through the wrapper method, that presented rate of false positive of 13,50%, true positive of 94,20% area under the curve (AUC) of 89,90% (II)-the Bagging classifier with logistic regression with attribute selection through the Chi-square method, that presented rate of false positive of 13,50%, true positive of 94,20% AUC of 89,90%. However, the (I) approach stands out in relation to (II) for its higher positive class accuracy (high CO2 emission) and lower computational cost.
id EMBR_acff12318486662b64bc13e5b821835d
oai_identifier_str oai:www.alice.cnptia.embrapa.br:doc/1089160
network_acronym_str EMBR
network_name_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository_id_str 2154
spelling Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.Mineração de dadosEmissão de dióxido de carbonoManejo de cultivosCarbon dioxide emissionData miningCana de açúcarDióxido de carbonoCarbon dioxideSugarcaneCrop managementSoil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO2 concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO2 emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop and minimum tillage are effective strategies to promote the mitigation of soil CO2 emissions, in which the average CO2 emissions are 63 kg ha-1 day-1. The variables soil moisture, soil temperature (Ts), rainfall, pH, and organic carbon were most frequently selected for soil CO2 emission classification using different methods for attribute selection. According to the results of the ROC curve, the best approaches for soil CO2 emission classification were the following: (I)-the Multilayer Perceptron classifier with attribute selection through the wrapper method, that presented rate of false positive of 13,50%, true positive of 94,20% area under the curve (AUC) of 89,90% (II)-the Bagging classifier with logistic regression with attribute selection through the Chi-square method, that presented rate of false positive of 13,50%, true positive of 94,20% AUC of 89,90%. However, the (I) approach stands out in relation to (II) for its higher positive class accuracy (high CO2 emission) and lower computational cost.Artigo e0193537.CAMILA VIANA VIEIRA FARHATE, Unicamp; ZIGOMAR MENEZES DE SOUZA, Unicamp; STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA; ROSE LUIZA MORAES TAVARES, Rio Verde University; JOÃO LUÍS NUNES CARVALHO, Brazilian Center for Research in Energy and Materials.FARHATE, C. V. V.SOUZA, Z. M. deOLIVEIRA, S. R. de M.TAVARES, R. L. M.CARVALHO, J. L. N.2018-03-15T00:40:37Z2018-03-15T00:40:37Z2018-03-1420182018-05-08T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlePlos One, v. 13, n. 3, p. 1-18, 2018.http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089160https://doi.org/ 10.1371/journal.pone.0193537enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2018-03-15T00:40:43Zoai:www.alice.cnptia.embrapa.br:doc/1089160Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542018-03-15T00:40:43falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542018-03-15T00:40:43Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false
dc.title.none.fl_str_mv Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.
title Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.
spellingShingle Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.
FARHATE, C. V. V.
Mineração de dados
Emissão de dióxido de carbono
Manejo de cultivos
Carbon dioxide emission
Data mining
Cana de açúcar
Dióxido de carbono
Carbon dioxide
Sugarcane
Crop management
title_short Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.
title_full Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.
title_fullStr Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.
title_full_unstemmed Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.
title_sort Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.
author FARHATE, C. V. V.
author_facet FARHATE, C. V. V.
SOUZA, Z. M. de
OLIVEIRA, S. R. de M.
TAVARES, R. L. M.
CARVALHO, J. L. N.
author_role author
author2 SOUZA, Z. M. de
OLIVEIRA, S. R. de M.
TAVARES, R. L. M.
CARVALHO, J. L. N.
author2_role author
author
author
author
dc.contributor.none.fl_str_mv CAMILA VIANA VIEIRA FARHATE, Unicamp; ZIGOMAR MENEZES DE SOUZA, Unicamp; STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA; ROSE LUIZA MORAES TAVARES, Rio Verde University; JOÃO LUÍS NUNES CARVALHO, Brazilian Center for Research in Energy and Materials.
dc.contributor.author.fl_str_mv FARHATE, C. V. V.
SOUZA, Z. M. de
OLIVEIRA, S. R. de M.
TAVARES, R. L. M.
CARVALHO, J. L. N.
dc.subject.por.fl_str_mv Mineração de dados
Emissão de dióxido de carbono
Manejo de cultivos
Carbon dioxide emission
Data mining
Cana de açúcar
Dióxido de carbono
Carbon dioxide
Sugarcane
Crop management
topic Mineração de dados
Emissão de dióxido de carbono
Manejo de cultivos
Carbon dioxide emission
Data mining
Cana de açúcar
Dióxido de carbono
Carbon dioxide
Sugarcane
Crop management
description Soil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO2 concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO2 emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop and minimum tillage are effective strategies to promote the mitigation of soil CO2 emissions, in which the average CO2 emissions are 63 kg ha-1 day-1. The variables soil moisture, soil temperature (Ts), rainfall, pH, and organic carbon were most frequently selected for soil CO2 emission classification using different methods for attribute selection. According to the results of the ROC curve, the best approaches for soil CO2 emission classification were the following: (I)-the Multilayer Perceptron classifier with attribute selection through the wrapper method, that presented rate of false positive of 13,50%, true positive of 94,20% area under the curve (AUC) of 89,90% (II)-the Bagging classifier with logistic regression with attribute selection through the Chi-square method, that presented rate of false positive of 13,50%, true positive of 94,20% AUC of 89,90%. However, the (I) approach stands out in relation to (II) for its higher positive class accuracy (high CO2 emission) and lower computational cost.
publishDate 2018
dc.date.none.fl_str_mv 2018-03-15T00:40:37Z
2018-03-15T00:40:37Z
2018-03-14
2018
2018-05-08T11:11:11Z
dc.type.driver.fl_str_mv info:eu-repo/semantics/publishedVersion
info:eu-repo/semantics/article
format article
status_str publishedVersion
dc.identifier.uri.fl_str_mv Plos One, v. 13, n. 3, p. 1-18, 2018.
http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089160
https://doi.org/ 10.1371/journal.pone.0193537
identifier_str_mv Plos One, v. 13, n. 3, p. 1-18, 2018.
url http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089160
https://doi.org/ 10.1371/journal.pone.0193537
dc.language.iso.fl_str_mv eng
language eng
dc.rights.driver.fl_str_mv info:eu-repo/semantics/openAccess
eu_rights_str_mv openAccess
dc.source.none.fl_str_mv reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron:EMBRAPA
instname_str Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
instacron_str EMBRAPA
institution EMBRAPA
reponame_str Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
collection Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)
repository.name.fl_str_mv Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)
repository.mail.fl_str_mv cg-riaa@embrapa.br
_version_ 1794503450941194240