Cargando…
Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?
In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride, an...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677568/ https://www.ncbi.nlm.nih.gov/pubmed/33989428 http://dx.doi.org/10.1093/sysbio/syab037 |
_version_ | 1784616167241940992 |
---|---|
author | Parag, Kris V Pybus, Oliver G Wu, Chieh-Hsi |
author_facet | Parag, Kris V Pybus, Oliver G Wu, Chieh-Hsi |
author_sort | Parag, Kris V |
collection | PubMed |
description | In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride, and Skygrid all model these population size changes with a discontinuous, piecewise-constant function but then apply a smoothing prior to ensure that their posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent data or tree. Here, we present a novel statistic, [Formula: see text] , to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using [Formula: see text] we show that, because it is surprisingly easy to overparametrize piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading inference, even under robust experimental designs. We propose [Formula: see text] as a useful tool for detecting when effective population size estimates are overly reliant on prior assumptions and for improving quantification of the uncertainty in those estimates.[Coalescent processes; effective population size; information theory; phylodynamics; prior assumptions; skyline plots.] |
format | Online Article Text |
id | pubmed-8677568 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-86775682021-12-17 Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? Parag, Kris V Pybus, Oliver G Wu, Chieh-Hsi Syst Biol Regular Articles In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride, and Skygrid all model these population size changes with a discontinuous, piecewise-constant function but then apply a smoothing prior to ensure that their posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent data or tree. Here, we present a novel statistic, [Formula: see text] , to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using [Formula: see text] we show that, because it is surprisingly easy to overparametrize piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading inference, even under robust experimental designs. We propose [Formula: see text] as a useful tool for detecting when effective population size estimates are overly reliant on prior assumptions and for improving quantification of the uncertainty in those estimates.[Coalescent processes; effective population size; information theory; phylodynamics; prior assumptions; skyline plots.] Oxford University Press 2021-05-13 /pmc/articles/PMC8677568/ /pubmed/33989428 http://dx.doi.org/10.1093/sysbio/syab037 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society of Systematic Biologists. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Regular Articles Parag, Kris V Pybus, Oliver G Wu, Chieh-Hsi Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? |
title | Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? |
title_full | Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? |
title_fullStr | Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? |
title_full_unstemmed | Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? |
title_short | Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? |
title_sort | are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions? |
topic | Regular Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677568/ https://www.ncbi.nlm.nih.gov/pubmed/33989428 http://dx.doi.org/10.1093/sysbio/syab037 |
work_keys_str_mv | AT paragkrisv areskylineplotbaseddemographicestimatesoverlydependentonsmoothingpriorassumptions AT pybusoliverg areskylineplotbaseddemographicestimatesoverlydependentonsmoothingpriorassumptions AT wuchiehhsi areskylineplotbaseddemographicestimatesoverlydependentonsmoothingpriorassumptions |