Cargando…

A hierarchical Bayesian approach for handling missing classification data

1. Ecologists use classifications of individuals in categories to understand composition of populations and communities. These categories might be defined by demographics, functional traits, or species. Assignment of categories is often imperfect, but frequently treated as observations without error...

Descripción completa

Detalles Bibliográficos
Autores principales: Ketz, Alison C., Johnson, Therese L., Hooten, Mevin B., Hobbs, N. Thompson
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6434567/
https://www.ncbi.nlm.nih.gov/pubmed/30962886
http://dx.doi.org/10.1002/ece3.4927
_version_ 1783406498991308800
author Ketz, Alison C.
Johnson, Therese L.
Hooten, Mevin B.
Hobbs, N. Thompson
author_facet Ketz, Alison C.
Johnson, Therese L.
Hooten, Mevin B.
Hobbs, N. Thompson
author_sort Ketz, Alison C.
collection PubMed
description 1. Ecologists use classifications of individuals in categories to understand composition of populations and communities. These categories might be defined by demographics, functional traits, or species. Assignment of categories is often imperfect, but frequently treated as observations without error. When individuals are observed but not classified, these “partial” observations must be modified to include the missing data mechanism to avoid spurious inference. 2. We developed two hierarchical Bayesian models to overcome the assumption of perfect assignment to mutually exclusive categories in the multinomial distribution of categorical counts, when classifications are missing. These models incorporate auxiliary information to adjust the posterior distributions of the proportions of membership in categories. In one model, we use an empirical Bayes approach, where a subset of data from one year serves as a prior for the missing data the next. In the other approach, we use a small random sample of data within a year to inform the distribution of the missing data. 3. We performed a simulation to show the bias that occurs when partial observations were ignored and demonstrated the altered inference for the estimation of demographic ratios. We applied our models to demographic classifications of elk (Cervus elaphus nelsoni) to demonstrate improved inference for the proportions of sex and stage classes. 4. We developed multiple modeling approaches using a generalizable nested multinomial structure to account for partially observed data that were missing not at random for classification counts. Accounting for classification uncertainty is important to accurately understand the composition of populations and communities in ecological studies.
format Online
Article
Text
id pubmed-6434567
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-64345672019-04-08 A hierarchical Bayesian approach for handling missing classification data Ketz, Alison C. Johnson, Therese L. Hooten, Mevin B. Hobbs, N. Thompson Ecol Evol Original Research 1. Ecologists use classifications of individuals in categories to understand composition of populations and communities. These categories might be defined by demographics, functional traits, or species. Assignment of categories is often imperfect, but frequently treated as observations without error. When individuals are observed but not classified, these “partial” observations must be modified to include the missing data mechanism to avoid spurious inference. 2. We developed two hierarchical Bayesian models to overcome the assumption of perfect assignment to mutually exclusive categories in the multinomial distribution of categorical counts, when classifications are missing. These models incorporate auxiliary information to adjust the posterior distributions of the proportions of membership in categories. In one model, we use an empirical Bayes approach, where a subset of data from one year serves as a prior for the missing data the next. In the other approach, we use a small random sample of data within a year to inform the distribution of the missing data. 3. We performed a simulation to show the bias that occurs when partial observations were ignored and demonstrated the altered inference for the estimation of demographic ratios. We applied our models to demographic classifications of elk (Cervus elaphus nelsoni) to demonstrate improved inference for the proportions of sex and stage classes. 4. We developed multiple modeling approaches using a generalizable nested multinomial structure to account for partially observed data that were missing not at random for classification counts. Accounting for classification uncertainty is important to accurately understand the composition of populations and communities in ecological studies. John Wiley and Sons Inc. 2019-03-02 /pmc/articles/PMC6434567/ /pubmed/30962886 http://dx.doi.org/10.1002/ece3.4927 Text en © 2019 The Authors. Ecology and Evolution published by John Wiley & Sons Ltd. This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Research
Ketz, Alison C.
Johnson, Therese L.
Hooten, Mevin B.
Hobbs, N. Thompson
A hierarchical Bayesian approach for handling missing classification data
title A hierarchical Bayesian approach for handling missing classification data
title_full A hierarchical Bayesian approach for handling missing classification data
title_fullStr A hierarchical Bayesian approach for handling missing classification data
title_full_unstemmed A hierarchical Bayesian approach for handling missing classification data
title_short A hierarchical Bayesian approach for handling missing classification data
title_sort hierarchical bayesian approach for handling missing classification data
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6434567/
https://www.ncbi.nlm.nih.gov/pubmed/30962886
http://dx.doi.org/10.1002/ece3.4927
work_keys_str_mv AT ketzalisonc ahierarchicalbayesianapproachforhandlingmissingclassificationdata
AT johnsontheresel ahierarchicalbayesianapproachforhandlingmissingclassificationdata
AT hootenmevinb ahierarchicalbayesianapproachforhandlingmissingclassificationdata
AT hobbsnthompson ahierarchicalbayesianapproachforhandlingmissingclassificationdata
AT ketzalisonc hierarchicalbayesianapproachforhandlingmissingclassificationdata
AT johnsontheresel hierarchicalbayesianapproachforhandlingmissingclassificationdata
AT hootenmevinb hierarchicalbayesianapproachforhandlingmissingclassificationdata
AT hobbsnthompson hierarchicalbayesianapproachforhandlingmissingclassificationdata