Cargando…
Devil in the details: how can we avoid potential pitfalls of CATS regression when our data do not follow a Poisson distribution?
BACKGROUND: Community assembly by trait selection (CATS) allows for the detection of environmental filtering and estimation of the relative role of local and regional (meta-community-level) effects on community composition from trait and abundance data without using environmental data. It has been s...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
PeerJ Inc.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8763042/ https://www.ncbi.nlm.nih.gov/pubmed/35174013 http://dx.doi.org/10.7717/peerj.12763 |
_version_ | 1784633872753885184 |
---|---|
author | Botta-Dukát, Zoltán |
author_facet | Botta-Dukát, Zoltán |
author_sort | Botta-Dukát, Zoltán |
collection | PubMed |
description | BACKGROUND: Community assembly by trait selection (CATS) allows for the detection of environmental filtering and estimation of the relative role of local and regional (meta-community-level) effects on community composition from trait and abundance data without using environmental data. It has been shown that Poisson regression of abundances against trait data results in the same parameter estimates. Abundance data do not necessarily follow a Poisson distribution, and in these cases, other generalized linear models should be fitted to obtain unbiased parameter estimates. AIMS: This paper discusses how the original algorithm for calculating the relative role of local and regional effects has to be modified if Poisson model is not appropriate. RESULTS: It can be shown that the use of the logarithm of regional relative abundances as an offset is appropriate only if a log-link function is applied. Otherwise, the link function should be applied to the product of local total abundance and regional relative abundances. Since this product may be outside the domain of the link function, the use of log-link is recommended, even if it is not the canonical link. An algorithm is also suggested for calculating the offset when data are zero-inflated. The relative role of local and regional effects is measured by Kullback-Leibler R(2). The formula for this measure presented by Shipley (2014) is valid only if the abundances follow a Poisson distribution. Otherwise, slightly different formulas have to be applied. Beyond theoretical considerations, the proposed refinements are illustrated by numerical examples. CATS regression could be a useful tool for community ecologists, but it has to be slightly modified when abundance data do not follow a Poisson distribution. This paper gives detailed instructions on the necessary refinement. |
format | Online Article Text |
id | pubmed-8763042 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | PeerJ Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-87630422022-02-15 Devil in the details: how can we avoid potential pitfalls of CATS regression when our data do not follow a Poisson distribution? Botta-Dukát, Zoltán PeerJ Ecology BACKGROUND: Community assembly by trait selection (CATS) allows for the detection of environmental filtering and estimation of the relative role of local and regional (meta-community-level) effects on community composition from trait and abundance data without using environmental data. It has been shown that Poisson regression of abundances against trait data results in the same parameter estimates. Abundance data do not necessarily follow a Poisson distribution, and in these cases, other generalized linear models should be fitted to obtain unbiased parameter estimates. AIMS: This paper discusses how the original algorithm for calculating the relative role of local and regional effects has to be modified if Poisson model is not appropriate. RESULTS: It can be shown that the use of the logarithm of regional relative abundances as an offset is appropriate only if a log-link function is applied. Otherwise, the link function should be applied to the product of local total abundance and regional relative abundances. Since this product may be outside the domain of the link function, the use of log-link is recommended, even if it is not the canonical link. An algorithm is also suggested for calculating the offset when data are zero-inflated. The relative role of local and regional effects is measured by Kullback-Leibler R(2). The formula for this measure presented by Shipley (2014) is valid only if the abundances follow a Poisson distribution. Otherwise, slightly different formulas have to be applied. Beyond theoretical considerations, the proposed refinements are illustrated by numerical examples. CATS regression could be a useful tool for community ecologists, but it has to be slightly modified when abundance data do not follow a Poisson distribution. This paper gives detailed instructions on the necessary refinement. PeerJ Inc. 2022-01-14 /pmc/articles/PMC8763042/ /pubmed/35174013 http://dx.doi.org/10.7717/peerj.12763 Text en ©2022 Botta-Dukát https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited. |
spellingShingle | Ecology Botta-Dukát, Zoltán Devil in the details: how can we avoid potential pitfalls of CATS regression when our data do not follow a Poisson distribution? |
title | Devil in the details: how can we avoid potential pitfalls of CATS regression when our data do not follow a Poisson distribution? |
title_full | Devil in the details: how can we avoid potential pitfalls of CATS regression when our data do not follow a Poisson distribution? |
title_fullStr | Devil in the details: how can we avoid potential pitfalls of CATS regression when our data do not follow a Poisson distribution? |
title_full_unstemmed | Devil in the details: how can we avoid potential pitfalls of CATS regression when our data do not follow a Poisson distribution? |
title_short | Devil in the details: how can we avoid potential pitfalls of CATS regression when our data do not follow a Poisson distribution? |
title_sort | devil in the details: how can we avoid potential pitfalls of cats regression when our data do not follow a poisson distribution? |
topic | Ecology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8763042/ https://www.ncbi.nlm.nih.gov/pubmed/35174013 http://dx.doi.org/10.7717/peerj.12763 |
work_keys_str_mv | AT bottadukatzoltan devilinthedetailshowcanweavoidpotentialpitfallsofcatsregressionwhenourdatadonotfollowapoissondistribution |