Cargando…
Comparison of data mining and allometric model in estimation of tree biomass
BACKGROUND: The traditional method used to estimate tree biomass is allometry. In this method, models are tested and equations fitted by regression usually applying ordinary least squares, though other analogous methods are also used for this purpose. Due to the nature of tree biomass data, the assu...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4528850/ https://www.ncbi.nlm.nih.gov/pubmed/26250142 http://dx.doi.org/10.1186/s12859-015-0662-5 |
_version_ | 1782384715002544128 |
---|---|
author | Sanquetta, Carlos R. Wojciechowski, Jaime Dalla Corte, Ana P. Behling, Alexandre Péllico Netto, Sylvio Rodrigues, Aurélio L. Sanquetta, Mateus N. I. |
author_facet | Sanquetta, Carlos R. Wojciechowski, Jaime Dalla Corte, Ana P. Behling, Alexandre Péllico Netto, Sylvio Rodrigues, Aurélio L. Sanquetta, Mateus N. I. |
author_sort | Sanquetta, Carlos R. |
collection | PubMed |
description | BACKGROUND: The traditional method used to estimate tree biomass is allometry. In this method, models are tested and equations fitted by regression usually applying ordinary least squares, though other analogous methods are also used for this purpose. Due to the nature of tree biomass data, the assumptions of regression are not always accomplished, bringing uncertainties to the inferences. This article demonstrates that the Data Mining (DM) technique can be used as an alternative to traditional regression approach to estimate tree biomass in the Atlantic Forest, providing better results than allometry, and demonstrating simplicity, versatility and flexibility to apply to a wide range of conditions. RESULTS: Various DM approaches were examined regarding distance, number of neighbors and weighting, by using 180 trees coming from environmental restoration plantations in the Atlantic Forest biome. The best results were attained using the Chebishev distance, 1/d weighting and 5 neighbors. Increasing number of neighbors did not improve estimates. We also analyze the effect of the size of data set and number of variables in the results. The complete data set and the maximum number of predicting variables provided the best fitting. We compare DM to Schumacher-Hall model and the results showed a gain of up to 16.5 % in reduction of the standard error of estimate. CONCLUSION: It was concluded that Data Mining can provide accurate estimates of tree biomass and can be successfully used for this purpose in environmental restoration plantations in the Atlantic Forest. This technique provides lower standard error of estimate than the Schumacher-Hall model and has the advantage of not requiring some statistical assumptions as do the regression models. Flexibility, versatility and simplicity are attributes of DM that corroborates its great potential for similar applications. |
format | Online Article Text |
id | pubmed-4528850 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-45288502015-08-08 Comparison of data mining and allometric model in estimation of tree biomass Sanquetta, Carlos R. Wojciechowski, Jaime Dalla Corte, Ana P. Behling, Alexandre Péllico Netto, Sylvio Rodrigues, Aurélio L. Sanquetta, Mateus N. I. BMC Bioinformatics Research Article BACKGROUND: The traditional method used to estimate tree biomass is allometry. In this method, models are tested and equations fitted by regression usually applying ordinary least squares, though other analogous methods are also used for this purpose. Due to the nature of tree biomass data, the assumptions of regression are not always accomplished, bringing uncertainties to the inferences. This article demonstrates that the Data Mining (DM) technique can be used as an alternative to traditional regression approach to estimate tree biomass in the Atlantic Forest, providing better results than allometry, and demonstrating simplicity, versatility and flexibility to apply to a wide range of conditions. RESULTS: Various DM approaches were examined regarding distance, number of neighbors and weighting, by using 180 trees coming from environmental restoration plantations in the Atlantic Forest biome. The best results were attained using the Chebishev distance, 1/d weighting and 5 neighbors. Increasing number of neighbors did not improve estimates. We also analyze the effect of the size of data set and number of variables in the results. The complete data set and the maximum number of predicting variables provided the best fitting. We compare DM to Schumacher-Hall model and the results showed a gain of up to 16.5 % in reduction of the standard error of estimate. CONCLUSION: It was concluded that Data Mining can provide accurate estimates of tree biomass and can be successfully used for this purpose in environmental restoration plantations in the Atlantic Forest. This technique provides lower standard error of estimate than the Schumacher-Hall model and has the advantage of not requiring some statistical assumptions as do the regression models. Flexibility, versatility and simplicity are attributes of DM that corroborates its great potential for similar applications. BioMed Central 2015-08-07 /pmc/articles/PMC4528850/ /pubmed/26250142 http://dx.doi.org/10.1186/s12859-015-0662-5 Text en © Sanquetta et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Sanquetta, Carlos R. Wojciechowski, Jaime Dalla Corte, Ana P. Behling, Alexandre Péllico Netto, Sylvio Rodrigues, Aurélio L. Sanquetta, Mateus N. I. Comparison of data mining and allometric model in estimation of tree biomass |
title | Comparison of data mining and allometric model in estimation of tree biomass |
title_full | Comparison of data mining and allometric model in estimation of tree biomass |
title_fullStr | Comparison of data mining and allometric model in estimation of tree biomass |
title_full_unstemmed | Comparison of data mining and allometric model in estimation of tree biomass |
title_short | Comparison of data mining and allometric model in estimation of tree biomass |
title_sort | comparison of data mining and allometric model in estimation of tree biomass |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4528850/ https://www.ncbi.nlm.nih.gov/pubmed/26250142 http://dx.doi.org/10.1186/s12859-015-0662-5 |
work_keys_str_mv | AT sanquettacarlosr comparisonofdataminingandallometricmodelinestimationoftreebiomass AT wojciechowskijaime comparisonofdataminingandallometricmodelinestimationoftreebiomass AT dallacorteanap comparisonofdataminingandallometricmodelinestimationoftreebiomass AT behlingalexandre comparisonofdataminingandallometricmodelinestimationoftreebiomass AT pelliconettosylvio comparisonofdataminingandallometricmodelinestimationoftreebiomass AT rodriguesaureliol comparisonofdataminingandallometricmodelinestimationoftreebiomass AT sanquettamateusni comparisonofdataminingandallometricmodelinestimationoftreebiomass |