Cargando…
A hierarchical Bayesian approach for handling missing classification data
1. Ecologists use classifications of individuals in categories to understand composition of populations and communities. These categories might be defined by demographics, functional traits, or species. Assignment of categories is often imperfect, but frequently treated as observations without error...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6434567/ https://www.ncbi.nlm.nih.gov/pubmed/30962886 http://dx.doi.org/10.1002/ece3.4927 |
_version_ | 1783406498991308800 |
---|---|
author | Ketz, Alison C. Johnson, Therese L. Hooten, Mevin B. Hobbs, N. Thompson |
author_facet | Ketz, Alison C. Johnson, Therese L. Hooten, Mevin B. Hobbs, N. Thompson |
author_sort | Ketz, Alison C. |
collection | PubMed |
description | 1. Ecologists use classifications of individuals in categories to understand composition of populations and communities. These categories might be defined by demographics, functional traits, or species. Assignment of categories is often imperfect, but frequently treated as observations without error. When individuals are observed but not classified, these “partial” observations must be modified to include the missing data mechanism to avoid spurious inference. 2. We developed two hierarchical Bayesian models to overcome the assumption of perfect assignment to mutually exclusive categories in the multinomial distribution of categorical counts, when classifications are missing. These models incorporate auxiliary information to adjust the posterior distributions of the proportions of membership in categories. In one model, we use an empirical Bayes approach, where a subset of data from one year serves as a prior for the missing data the next. In the other approach, we use a small random sample of data within a year to inform the distribution of the missing data. 3. We performed a simulation to show the bias that occurs when partial observations were ignored and demonstrated the altered inference for the estimation of demographic ratios. We applied our models to demographic classifications of elk (Cervus elaphus nelsoni) to demonstrate improved inference for the proportions of sex and stage classes. 4. We developed multiple modeling approaches using a generalizable nested multinomial structure to account for partially observed data that were missing not at random for classification counts. Accounting for classification uncertainty is important to accurately understand the composition of populations and communities in ecological studies. |
format | Online Article Text |
id | pubmed-6434567 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-64345672019-04-08 A hierarchical Bayesian approach for handling missing classification data Ketz, Alison C. Johnson, Therese L. Hooten, Mevin B. Hobbs, N. Thompson Ecol Evol Original Research 1. Ecologists use classifications of individuals in categories to understand composition of populations and communities. These categories might be defined by demographics, functional traits, or species. Assignment of categories is often imperfect, but frequently treated as observations without error. When individuals are observed but not classified, these “partial” observations must be modified to include the missing data mechanism to avoid spurious inference. 2. We developed two hierarchical Bayesian models to overcome the assumption of perfect assignment to mutually exclusive categories in the multinomial distribution of categorical counts, when classifications are missing. These models incorporate auxiliary information to adjust the posterior distributions of the proportions of membership in categories. In one model, we use an empirical Bayes approach, where a subset of data from one year serves as a prior for the missing data the next. In the other approach, we use a small random sample of data within a year to inform the distribution of the missing data. 3. We performed a simulation to show the bias that occurs when partial observations were ignored and demonstrated the altered inference for the estimation of demographic ratios. We applied our models to demographic classifications of elk (Cervus elaphus nelsoni) to demonstrate improved inference for the proportions of sex and stage classes. 4. We developed multiple modeling approaches using a generalizable nested multinomial structure to account for partially observed data that were missing not at random for classification counts. Accounting for classification uncertainty is important to accurately understand the composition of populations and communities in ecological studies. John Wiley and Sons Inc. 2019-03-02 /pmc/articles/PMC6434567/ /pubmed/30962886 http://dx.doi.org/10.1002/ece3.4927 Text en © 2019 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Research Ketz, Alison C. Johnson, Therese L. Hooten, Mevin B. Hobbs, N. Thompson A hierarchical Bayesian approach for handling missing classification data |
title | A hierarchical Bayesian approach for handling missing classification data |
title_full | A hierarchical Bayesian approach for handling missing classification data |
title_fullStr | A hierarchical Bayesian approach for handling missing classification data |
title_full_unstemmed | A hierarchical Bayesian approach for handling missing classification data |
title_short | A hierarchical Bayesian approach for handling missing classification data |
title_sort | hierarchical bayesian approach for handling missing classification data |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6434567/ https://www.ncbi.nlm.nih.gov/pubmed/30962886 http://dx.doi.org/10.1002/ece3.4927 |
work_keys_str_mv | AT ketzalisonc ahierarchicalbayesianapproachforhandlingmissingclassificationdata AT johnsontheresel ahierarchicalbayesianapproachforhandlingmissingclassificationdata AT hootenmevinb ahierarchicalbayesianapproachforhandlingmissingclassificationdata AT hobbsnthompson ahierarchicalbayesianapproachforhandlingmissingclassificationdata AT ketzalisonc hierarchicalbayesianapproachforhandlingmissingclassificationdata AT johnsontheresel hierarchicalbayesianapproachforhandlingmissingclassificationdata AT hootenmevinb hierarchicalbayesianapproachforhandlingmissingclassificationdata AT hobbsnthompson hierarchicalbayesianapproachforhandlingmissingclassificationdata |