Cargando…

Comparison of data mining and allometric model in estimation of tree biomass

BACKGROUND: The traditional method used to estimate tree biomass is allometry. In this method, models are tested and equations fitted by regression usually applying ordinary least squares, though other analogous methods are also used for this purpose. Due to the nature of tree biomass data, the assu...

Descripción completa

Detalles Bibliográficos
Autores principales: Sanquetta, Carlos R., Wojciechowski, Jaime, Dalla Corte, Ana P., Behling, Alexandre, Péllico Netto, Sylvio, Rodrigues, Aurélio L., Sanquetta, Mateus N. I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4528850/
https://www.ncbi.nlm.nih.gov/pubmed/26250142
http://dx.doi.org/10.1186/s12859-015-0662-5
_version_ 1782384715002544128
author Sanquetta, Carlos R.
Wojciechowski, Jaime
Dalla Corte, Ana P.
Behling, Alexandre
Péllico Netto, Sylvio
Rodrigues, Aurélio L.
Sanquetta, Mateus N. I.
author_facet Sanquetta, Carlos R.
Wojciechowski, Jaime
Dalla Corte, Ana P.
Behling, Alexandre
Péllico Netto, Sylvio
Rodrigues, Aurélio L.
Sanquetta, Mateus N. I.
author_sort Sanquetta, Carlos R.
collection PubMed
description BACKGROUND: The traditional method used to estimate tree biomass is allometry. In this method, models are tested and equations fitted by regression usually applying ordinary least squares, though other analogous methods are also used for this purpose. Due to the nature of tree biomass data, the assumptions of regression are not always accomplished, bringing uncertainties to the inferences. This article demonstrates that the Data Mining (DM) technique can be used as an alternative to traditional regression approach to estimate tree biomass in the Atlantic Forest, providing better results than allometry, and demonstrating simplicity, versatility and flexibility to apply to a wide range of conditions. RESULTS: Various DM approaches were examined regarding distance, number of neighbors and weighting, by using 180 trees coming from environmental restoration plantations in the Atlantic Forest biome. The best results were attained using the Chebishev distance, 1/d weighting and 5 neighbors. Increasing number of neighbors did not improve estimates. We also analyze the effect of the size of data set and number of variables in the results. The complete data set and the maximum number of predicting variables provided the best fitting. We compare DM to Schumacher-Hall model and the results showed a gain of up to 16.5 % in reduction of the standard error of estimate. CONCLUSION: It was concluded that Data Mining can provide accurate estimates of tree biomass and can be successfully used for this purpose in environmental restoration plantations in the Atlantic Forest. This technique provides lower standard error of estimate than the Schumacher-Hall model and has the advantage of not requiring some statistical assumptions as do the regression models. Flexibility, versatility and simplicity are attributes of DM that corroborates its great potential for similar applications.
format Online
Article
Text
id pubmed-4528850
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-45288502015-08-08 Comparison of data mining and allometric model in estimation of tree biomass Sanquetta, Carlos R. Wojciechowski, Jaime Dalla Corte, Ana P. Behling, Alexandre Péllico Netto, Sylvio Rodrigues, Aurélio L. Sanquetta, Mateus N. I. BMC Bioinformatics Research Article BACKGROUND: The traditional method used to estimate tree biomass is allometry. In this method, models are tested and equations fitted by regression usually applying ordinary least squares, though other analogous methods are also used for this purpose. Due to the nature of tree biomass data, the assumptions of regression are not always accomplished, bringing uncertainties to the inferences. This article demonstrates that the Data Mining (DM) technique can be used as an alternative to traditional regression approach to estimate tree biomass in the Atlantic Forest, providing better results than allometry, and demonstrating simplicity, versatility and flexibility to apply to a wide range of conditions. RESULTS: Various DM approaches were examined regarding distance, number of neighbors and weighting, by using 180 trees coming from environmental restoration plantations in the Atlantic Forest biome. The best results were attained using the Chebishev distance, 1/d weighting and 5 neighbors. Increasing number of neighbors did not improve estimates. We also analyze the effect of the size of data set and number of variables in the results. The complete data set and the maximum number of predicting variables provided the best fitting. We compare DM to Schumacher-Hall model and the results showed a gain of up to 16.5 % in reduction of the standard error of estimate. CONCLUSION: It was concluded that Data Mining can provide accurate estimates of tree biomass and can be successfully used for this purpose in environmental restoration plantations in the Atlantic Forest. This technique provides lower standard error of estimate than the Schumacher-Hall model and has the advantage of not requiring some statistical assumptions as do the regression models. Flexibility, versatility and simplicity are attributes of DM that corroborates its great potential for similar applications. BioMed Central 2015-08-07 /pmc/articles/PMC4528850/ /pubmed/26250142 http://dx.doi.org/10.1186/s12859-015-0662-5 Text en © Sanquetta et al. 2015 Open Access This article is distributed under the terms of the Creative Commons Attribution License http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Sanquetta, Carlos R.
Wojciechowski, Jaime
Dalla Corte, Ana P.
Behling, Alexandre
Péllico Netto, Sylvio
Rodrigues, Aurélio L.
Sanquetta, Mateus N. I.
Comparison of data mining and allometric model in estimation of tree biomass
title Comparison of data mining and allometric model in estimation of tree biomass
title_full Comparison of data mining and allometric model in estimation of tree biomass
title_fullStr Comparison of data mining and allometric model in estimation of tree biomass
title_full_unstemmed Comparison of data mining and allometric model in estimation of tree biomass
title_short Comparison of data mining and allometric model in estimation of tree biomass
title_sort comparison of data mining and allometric model in estimation of tree biomass
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4528850/
https://www.ncbi.nlm.nih.gov/pubmed/26250142
http://dx.doi.org/10.1186/s12859-015-0662-5
work_keys_str_mv AT sanquettacarlosr comparisonofdataminingandallometricmodelinestimationoftreebiomass
AT wojciechowskijaime comparisonofdataminingandallometricmodelinestimationoftreebiomass
AT dallacorteanap comparisonofdataminingandallometricmodelinestimationoftreebiomass
AT behlingalexandre comparisonofdataminingandallometricmodelinestimationoftreebiomass
AT pelliconettosylvio comparisonofdataminingandallometricmodelinestimationoftreebiomass
AT rodriguesaureliol comparisonofdataminingandallometricmodelinestimationoftreebiomass
AT sanquettamateusni comparisonofdataminingandallometricmodelinestimationoftreebiomass