Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.
Autor(a) principal: | |
---|---|
Data de Publicação: | 2018 |
Outros Autores: | , , , |
Tipo de documento: | Artigo |
Idioma: | eng |
Título da fonte: | Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
Texto Completo: | http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089160 https://doi.org/ 10.1371/journal.pone.0193537 |
Resumo: | Soil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO2 concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO2 emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop and minimum tillage are effective strategies to promote the mitigation of soil CO2 emissions, in which the average CO2 emissions are 63 kg ha-1 day-1. The variables soil moisture, soil temperature (Ts), rainfall, pH, and organic carbon were most frequently selected for soil CO2 emission classification using different methods for attribute selection. According to the results of the ROC curve, the best approaches for soil CO2 emission classification were the following: (I)-the Multilayer Perceptron classifier with attribute selection through the wrapper method, that presented rate of false positive of 13,50%, true positive of 94,20% area under the curve (AUC) of 89,90% (II)-the Bagging classifier with logistic regression with attribute selection through the Chi-square method, that presented rate of false positive of 13,50%, true positive of 94,20% AUC of 89,90%. However, the (I) approach stands out in relation to (II) for its higher positive class accuracy (high CO2 emission) and lower computational cost. |
id |
EMBR_acff12318486662b64bc13e5b821835d |
---|---|
oai_identifier_str |
oai:www.alice.cnptia.embrapa.br:doc/1089160 |
network_acronym_str |
EMBR |
network_name_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository_id_str |
2154 |
spelling |
Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field.Mineração de dadosEmissão de dióxido de carbonoManejo de cultivosCarbon dioxide emissionData miningCana de açúcarDióxido de carbonoCarbon dioxideSugarcaneCrop managementSoil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO2 concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO2 emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop and minimum tillage are effective strategies to promote the mitigation of soil CO2 emissions, in which the average CO2 emissions are 63 kg ha-1 day-1. The variables soil moisture, soil temperature (Ts), rainfall, pH, and organic carbon were most frequently selected for soil CO2 emission classification using different methods for attribute selection. According to the results of the ROC curve, the best approaches for soil CO2 emission classification were the following: (I)-the Multilayer Perceptron classifier with attribute selection through the wrapper method, that presented rate of false positive of 13,50%, true positive of 94,20% area under the curve (AUC) of 89,90% (II)-the Bagging classifier with logistic regression with attribute selection through the Chi-square method, that presented rate of false positive of 13,50%, true positive of 94,20% AUC of 89,90%. However, the (I) approach stands out in relation to (II) for its higher positive class accuracy (high CO2 emission) and lower computational cost.Artigo e0193537.CAMILA VIANA VIEIRA FARHATE, Unicamp; ZIGOMAR MENEZES DE SOUZA, Unicamp; STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA; ROSE LUIZA MORAES TAVARES, Rio Verde University; JOÃO LUÍS NUNES CARVALHO, Brazilian Center for Research in Energy and Materials.FARHATE, C. V. V.SOUZA, Z. M. deOLIVEIRA, S. R. de M.TAVARES, R. L. M.CARVALHO, J. L. N.2018-03-15T00:40:37Z2018-03-15T00:40:37Z2018-03-1420182018-05-08T11:11:11Zinfo:eu-repo/semantics/publishedVersioninfo:eu-repo/semantics/articlePlos One, v. 13, n. 3, p. 1-18, 2018.http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089160https://doi.org/ 10.1371/journal.pone.0193537enginfo:eu-repo/semantics/openAccessreponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice)instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa)instacron:EMBRAPA2018-03-15T00:40:43Zoai:www.alice.cnptia.embrapa.br:doc/1089160Repositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestopendoar:21542018-03-15T00:40:43falseRepositório InstitucionalPUBhttps://www.alice.cnptia.embrapa.br/oai/requestcg-riaa@embrapa.bropendoar:21542018-03-15T00:40:43Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa)false |
dc.title.none.fl_str_mv |
Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field. |
title |
Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field. |
spellingShingle |
Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field. FARHATE, C. V. V. Mineração de dados Emissão de dióxido de carbono Manejo de cultivos Carbon dioxide emission Data mining Cana de açúcar Dióxido de carbono Carbon dioxide Sugarcane Crop management |
title_short |
Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field. |
title_full |
Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field. |
title_fullStr |
Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field. |
title_full_unstemmed |
Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field. |
title_sort |
Use of data mining techniques to classify soil CO2 emission induced by crop management in sugarcane field. |
author |
FARHATE, C. V. V. |
author_facet |
FARHATE, C. V. V. SOUZA, Z. M. de OLIVEIRA, S. R. de M. TAVARES, R. L. M. CARVALHO, J. L. N. |
author_role |
author |
author2 |
SOUZA, Z. M. de OLIVEIRA, S. R. de M. TAVARES, R. L. M. CARVALHO, J. L. N. |
author2_role |
author author author author |
dc.contributor.none.fl_str_mv |
CAMILA VIANA VIEIRA FARHATE, Unicamp; ZIGOMAR MENEZES DE SOUZA, Unicamp; STANLEY ROBSON DE MEDEIROS OLIVEIRA, CNPTIA; ROSE LUIZA MORAES TAVARES, Rio Verde University; JOÃO LUÍS NUNES CARVALHO, Brazilian Center for Research in Energy and Materials. |
dc.contributor.author.fl_str_mv |
FARHATE, C. V. V. SOUZA, Z. M. de OLIVEIRA, S. R. de M. TAVARES, R. L. M. CARVALHO, J. L. N. |
dc.subject.por.fl_str_mv |
Mineração de dados Emissão de dióxido de carbono Manejo de cultivos Carbon dioxide emission Data mining Cana de açúcar Dióxido de carbono Carbon dioxide Sugarcane Crop management |
topic |
Mineração de dados Emissão de dióxido de carbono Manejo de cultivos Carbon dioxide emission Data mining Cana de açúcar Dióxido de carbono Carbon dioxide Sugarcane Crop management |
description |
Soil CO2 emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO2 concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO2 emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop and minimum tillage are effective strategies to promote the mitigation of soil CO2 emissions, in which the average CO2 emissions are 63 kg ha-1 day-1. The variables soil moisture, soil temperature (Ts), rainfall, pH, and organic carbon were most frequently selected for soil CO2 emission classification using different methods for attribute selection. According to the results of the ROC curve, the best approaches for soil CO2 emission classification were the following: (I)-the Multilayer Perceptron classifier with attribute selection through the wrapper method, that presented rate of false positive of 13,50%, true positive of 94,20% area under the curve (AUC) of 89,90% (II)-the Bagging classifier with logistic regression with attribute selection through the Chi-square method, that presented rate of false positive of 13,50%, true positive of 94,20% AUC of 89,90%. However, the (I) approach stands out in relation to (II) for its higher positive class accuracy (high CO2 emission) and lower computational cost. |
publishDate |
2018 |
dc.date.none.fl_str_mv |
2018-03-15T00:40:37Z 2018-03-15T00:40:37Z 2018-03-14 2018 2018-05-08T11:11:11Z |
dc.type.driver.fl_str_mv |
info:eu-repo/semantics/publishedVersion info:eu-repo/semantics/article |
format |
article |
status_str |
publishedVersion |
dc.identifier.uri.fl_str_mv |
Plos One, v. 13, n. 3, p. 1-18, 2018. http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089160 https://doi.org/ 10.1371/journal.pone.0193537 |
identifier_str_mv |
Plos One, v. 13, n. 3, p. 1-18, 2018. |
url |
http://www.alice.cnptia.embrapa.br/alice/handle/doc/1089160 https://doi.org/ 10.1371/journal.pone.0193537 |
dc.language.iso.fl_str_mv |
eng |
language |
eng |
dc.rights.driver.fl_str_mv |
info:eu-repo/semantics/openAccess |
eu_rights_str_mv |
openAccess |
dc.source.none.fl_str_mv |
reponame:Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) instname:Empresa Brasileira de Pesquisa Agropecuária (Embrapa) instacron:EMBRAPA |
instname_str |
Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
instacron_str |
EMBRAPA |
institution |
EMBRAPA |
reponame_str |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
collection |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) |
repository.name.fl_str_mv |
Repositório Institucional da EMBRAPA (Repository Open Access to Scientific Information from EMBRAPA - Alice) - Empresa Brasileira de Pesquisa Agropecuária (Embrapa) |
repository.mail.fl_str_mv |
cg-riaa@embrapa.br |
_version_ |
1794503450941194240 |