Cargando…

Selection criteria for linear regression models to estimate individual tree biomasses in the Atlantic Rain Forest, Brazil

BACKGROUND: Biomass models are useful for several purposes, especially for quantifying carbon stocks and dynamics in forests. Selecting appropriate equations from a fitted model is a process which can involves several criteria, some widely used and others used to a lesser extent. This study analyzes...

Descripción completa

Detalles Bibliográficos
Autores principales: Sanquetta, Carlos Roberto, Dalla Corte, Ana Paula, Behling, Alexandre, de Oliveira Piva, Luani Rosa, Péllico Netto, Sylvio, Rodrigues, Aurélio Lourenço, Sanquetta, Mateus Niroh Inoue
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6286299/
https://www.ncbi.nlm.nih.gov/pubmed/30535635
http://dx.doi.org/10.1186/s13021-018-0112-6
_version_ 1783379424498941952
author Sanquetta, Carlos Roberto
Dalla Corte, Ana Paula
Behling, Alexandre
de Oliveira Piva, Luani Rosa
Péllico Netto, Sylvio
Rodrigues, Aurélio Lourenço
Sanquetta, Mateus Niroh Inoue
author_facet Sanquetta, Carlos Roberto
Dalla Corte, Ana Paula
Behling, Alexandre
de Oliveira Piva, Luani Rosa
Péllico Netto, Sylvio
Rodrigues, Aurélio Lourenço
Sanquetta, Mateus Niroh Inoue
author_sort Sanquetta, Carlos Roberto
collection PubMed
description BACKGROUND: Biomass models are useful for several purposes, especially for quantifying carbon stocks and dynamics in forests. Selecting appropriate equations from a fitted model is a process which can involves several criteria, some widely used and others used to a lesser extent. This study analyzes six selection criteria for models fitted to six sets of individual biomass collected from woody indigenous species of the Tropical Atlantic Rain Forest in Brazil. Six models were examined and the respective fitted equations evaluated by the residual sum of squares, adjusted coefficient of determination, absolute and relative estimates of the standard error of estimate, and Akaike and Schwartz (Bayesian) information criteria. The aim of this study was to analyze the numeric behavior of these model selection criteria and discuss the ease of interpretation of them. The importance of residual analysis in model selection is stressed. RESULTS: The adjusted coefficient of determination ([Formula: see text] ) and the standard error of estimate in percentage (Syx%) are relative model selection criteria and are not affected by sample size and scale of the response variable. The sum of squared residuals (SSR), the absolute standard error of estimate (Syx), the Akaike information criterion and the Schwartz information criterion, in turn, depend on these quantities. The best fit model was always the same within a given data set regardless the model selection criteria considered (except for SSR in two cases), indicating they tend to converge to a common result. However, such criteria are not always closely related across different data sets. General model selection criteria are indicative of the average goodness of fit, but do not capture bias and outlier effects. Graphical residual analysis is a useful tool to this detection and must always be used in model selection. CONCLUSIONS: It is concluded that the criteria for model selection tend to lead to a common result, regardless their mathematical formulation and statistical significance. Relative measures of goodness of fitting are easier to interpret than the absolute ones. Careful graphical residual analysis must always be used to confirm the performance of the models.
format Online
Article
Text
id pubmed-6286299
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-62862992018-12-26 Selection criteria for linear regression models to estimate individual tree biomasses in the Atlantic Rain Forest, Brazil Sanquetta, Carlos Roberto Dalla Corte, Ana Paula Behling, Alexandre de Oliveira Piva, Luani Rosa Péllico Netto, Sylvio Rodrigues, Aurélio Lourenço Sanquetta, Mateus Niroh Inoue Carbon Balance Manag Review BACKGROUND: Biomass models are useful for several purposes, especially for quantifying carbon stocks and dynamics in forests. Selecting appropriate equations from a fitted model is a process which can involves several criteria, some widely used and others used to a lesser extent. This study analyzes six selection criteria for models fitted to six sets of individual biomass collected from woody indigenous species of the Tropical Atlantic Rain Forest in Brazil. Six models were examined and the respective fitted equations evaluated by the residual sum of squares, adjusted coefficient of determination, absolute and relative estimates of the standard error of estimate, and Akaike and Schwartz (Bayesian) information criteria. The aim of this study was to analyze the numeric behavior of these model selection criteria and discuss the ease of interpretation of them. The importance of residual analysis in model selection is stressed. RESULTS: The adjusted coefficient of determination ([Formula: see text] ) and the standard error of estimate in percentage (Syx%) are relative model selection criteria and are not affected by sample size and scale of the response variable. The sum of squared residuals (SSR), the absolute standard error of estimate (Syx), the Akaike information criterion and the Schwartz information criterion, in turn, depend on these quantities. The best fit model was always the same within a given data set regardless the model selection criteria considered (except for SSR in two cases), indicating they tend to converge to a common result. However, such criteria are not always closely related across different data sets. General model selection criteria are indicative of the average goodness of fit, but do not capture bias and outlier effects. Graphical residual analysis is a useful tool to this detection and must always be used in model selection. CONCLUSIONS: It is concluded that the criteria for model selection tend to lead to a common result, regardless their mathematical formulation and statistical significance. Relative measures of goodness of fitting are easier to interpret than the absolute ones. Careful graphical residual analysis must always be used to confirm the performance of the models. Springer International Publishing 2018-12-07 /pmc/articles/PMC6286299/ /pubmed/30535635 http://dx.doi.org/10.1186/s13021-018-0112-6 Text en © The Author(s) 2018 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.
spellingShingle Review
Sanquetta, Carlos Roberto
Dalla Corte, Ana Paula
Behling, Alexandre
de Oliveira Piva, Luani Rosa
Péllico Netto, Sylvio
Rodrigues, Aurélio Lourenço
Sanquetta, Mateus Niroh Inoue
Selection criteria for linear regression models to estimate individual tree biomasses in the Atlantic Rain Forest, Brazil
title Selection criteria for linear regression models to estimate individual tree biomasses in the Atlantic Rain Forest, Brazil
title_full Selection criteria for linear regression models to estimate individual tree biomasses in the Atlantic Rain Forest, Brazil
title_fullStr Selection criteria for linear regression models to estimate individual tree biomasses in the Atlantic Rain Forest, Brazil
title_full_unstemmed Selection criteria for linear regression models to estimate individual tree biomasses in the Atlantic Rain Forest, Brazil
title_short Selection criteria for linear regression models to estimate individual tree biomasses in the Atlantic Rain Forest, Brazil
title_sort selection criteria for linear regression models to estimate individual tree biomasses in the atlantic rain forest, brazil
topic Review
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6286299/
https://www.ncbi.nlm.nih.gov/pubmed/30535635
http://dx.doi.org/10.1186/s13021-018-0112-6
work_keys_str_mv AT sanquettacarlosroberto selectioncriteriaforlinearregressionmodelstoestimateindividualtreebiomassesintheatlanticrainforestbrazil
AT dallacorteanapaula selectioncriteriaforlinearregressionmodelstoestimateindividualtreebiomassesintheatlanticrainforestbrazil
AT behlingalexandre selectioncriteriaforlinearregressionmodelstoestimateindividualtreebiomassesintheatlanticrainforestbrazil
AT deoliveirapivaluanirosa selectioncriteriaforlinearregressionmodelstoestimateindividualtreebiomassesintheatlanticrainforestbrazil
AT pelliconettosylvio selectioncriteriaforlinearregressionmodelstoestimateindividualtreebiomassesintheatlanticrainforestbrazil
AT rodriguesaureliolourenco selectioncriteriaforlinearregressionmodelstoestimateindividualtreebiomassesintheatlanticrainforestbrazil
AT sanquettamateusnirohinoue selectioncriteriaforlinearregressionmodelstoestimateindividualtreebiomassesintheatlanticrainforestbrazil