Cargando…
Use of data mining techniques to classify soil CO(2) emission induced by crop management in sugarcane field
Soil CO(2) emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO(2) concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the deve...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5841768/ https://www.ncbi.nlm.nih.gov/pubmed/29513765 http://dx.doi.org/10.1371/journal.pone.0193537 |
_version_ | 1783304793144426496 |
---|---|
author | Farhate, Camila Viana Vieira de Souza, Zigomar Menezes Oliveira, Stanley Robson de Medeiros Tavares, Rose Luiza Moraes Carvalho, João Luís Nunes |
author_facet | Farhate, Camila Viana Vieira de Souza, Zigomar Menezes Oliveira, Stanley Robson de Medeiros Tavares, Rose Luiza Moraes Carvalho, João Luís Nunes |
author_sort | Farhate, Camila Viana Vieira |
collection | PubMed |
description | Soil CO(2) emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO(2) concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO(2) emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop and minimum tillage are effective strategies to promote the mitigation of soil CO(2) emissions, in which the average CO(2) emissions are 63 kg ha(-1) day(-1). The variables soil moisture, soil temperature (Ts), rainfall, pH, and organic carbon were most frequently selected for soil CO(2) emission classification using different methods for attribute selection. According to the results of the ROC curve, the best approaches for soil CO(2) emission classification were the following: (I)–the Multilayer Perceptron classifier with attribute selection through the wrapper method, that presented rate of false positive of 13,50%, true positive of 94,20% area under the curve (AUC) of 89,90% (II)–the Bagging classifier with logistic regression with attribute selection through the Chi-square method, that presented rate of false positive of 13,50%, true positive of 94,20% AUC of 89,90%. However, the (I) approach stands out in relation to (II) for its higher positive class accuracy (high CO(2) emission) and lower computational cost. |
format | Online Article Text |
id | pubmed-5841768 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-58417682018-03-23 Use of data mining techniques to classify soil CO(2) emission induced by crop management in sugarcane field Farhate, Camila Viana Vieira de Souza, Zigomar Menezes Oliveira, Stanley Robson de Medeiros Tavares, Rose Luiza Moraes Carvalho, João Luís Nunes PLoS One Research Article Soil CO(2) emissions are regarded as one of the largest flows of the global carbon cycle and small changes in their magnitude can have a large effect on the CO(2) concentration in the atmosphere. Thus, a better understanding of this attribute would enable the identification of promoters and the development of strategies to mitigate the risks of climate change. Therefore, our study aimed at using data mining techniques to predict the soil CO(2) emission induced by crop management in sugarcane areas in Brazil. To do so, we used different variable selection methods (correlation, chi-square, wrapper) and classification (Decision tree, Bayesian models, neural networks, support vector machine, bagging with logistic regression), and finally we tested the efficiency of different approaches through the Receiver Operating Characteristic (ROC) curve. The original dataset consisted of 19 variables (18 independent variables and one dependent (or response) variable). The association between cover crop and minimum tillage are effective strategies to promote the mitigation of soil CO(2) emissions, in which the average CO(2) emissions are 63 kg ha(-1) day(-1). The variables soil moisture, soil temperature (Ts), rainfall, pH, and organic carbon were most frequently selected for soil CO(2) emission classification using different methods for attribute selection. According to the results of the ROC curve, the best approaches for soil CO(2) emission classification were the following: (I)–the Multilayer Perceptron classifier with attribute selection through the wrapper method, that presented rate of false positive of 13,50%, true positive of 94,20% area under the curve (AUC) of 89,90% (II)–the Bagging classifier with logistic regression with attribute selection through the Chi-square method, that presented rate of false positive of 13,50%, true positive of 94,20% AUC of 89,90%. However, the (I) approach stands out in relation to (II) for its higher positive class accuracy (high CO(2) emission) and lower computational cost. Public Library of Science 2018-03-07 /pmc/articles/PMC5841768/ /pubmed/29513765 http://dx.doi.org/10.1371/journal.pone.0193537 Text en © 2018 Farhate et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Farhate, Camila Viana Vieira de Souza, Zigomar Menezes Oliveira, Stanley Robson de Medeiros Tavares, Rose Luiza Moraes Carvalho, João Luís Nunes Use of data mining techniques to classify soil CO(2) emission induced by crop management in sugarcane field |
title | Use of data mining techniques to classify soil CO(2) emission induced by crop management in sugarcane field |
title_full | Use of data mining techniques to classify soil CO(2) emission induced by crop management in sugarcane field |
title_fullStr | Use of data mining techniques to classify soil CO(2) emission induced by crop management in sugarcane field |
title_full_unstemmed | Use of data mining techniques to classify soil CO(2) emission induced by crop management in sugarcane field |
title_short | Use of data mining techniques to classify soil CO(2) emission induced by crop management in sugarcane field |
title_sort | use of data mining techniques to classify soil co(2) emission induced by crop management in sugarcane field |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5841768/ https://www.ncbi.nlm.nih.gov/pubmed/29513765 http://dx.doi.org/10.1371/journal.pone.0193537 |
work_keys_str_mv | AT farhatecamilavianavieira useofdataminingtechniquestoclassifysoilco2emissioninducedbycropmanagementinsugarcanefield AT desouzazigomarmenezes useofdataminingtechniquestoclassifysoilco2emissioninducedbycropmanagementinsugarcanefield AT oliveirastanleyrobsondemedeiros useofdataminingtechniquestoclassifysoilco2emissioninducedbycropmanagementinsugarcanefield AT tavaresroseluizamoraes useofdataminingtechniquestoclassifysoilco2emissioninducedbycropmanagementinsugarcanefield AT carvalhojoaoluisnunes useofdataminingtechniquestoclassifysoilco2emissioninducedbycropmanagementinsugarcanefield |