Cargando…

Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?

In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride, an...

Descripción completa

Detalles Bibliográficos
Autores principales: Parag, Kris V, Pybus, Oliver G, Wu, Chieh-Hsi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677568/
https://www.ncbi.nlm.nih.gov/pubmed/33989428
http://dx.doi.org/10.1093/sysbio/syab037
_version_ 1784616167241940992
author Parag, Kris V
Pybus, Oliver G
Wu, Chieh-Hsi
author_facet Parag, Kris V
Pybus, Oliver G
Wu, Chieh-Hsi
author_sort Parag, Kris V
collection PubMed
description In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride, and Skygrid all model these population size changes with a discontinuous, piecewise-constant function but then apply a smoothing prior to ensure that their posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent data or tree. Here, we present a novel statistic, [Formula: see text] , to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using [Formula: see text] we show that, because it is surprisingly easy to overparametrize piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading inference, even under robust experimental designs. We propose [Formula: see text] as a useful tool for detecting when effective population size estimates are overly reliant on prior assumptions and for improving quantification of the uncertainty in those estimates.[Coalescent processes; effective population size; information theory; phylodynamics; prior assumptions; skyline plots.]
format Online
Article
Text
id pubmed-8677568
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-86775682021-12-17 Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions? Parag, Kris V Pybus, Oliver G Wu, Chieh-Hsi Syst Biol Regular Articles In Bayesian phylogenetics, the coalescent process provides an informative framework for inferring changes in the effective size of a population from a phylogeny (or tree) of sequences sampled from that population. Popular coalescent inference approaches such as the Bayesian Skyline Plot, Skyride, and Skygrid all model these population size changes with a discontinuous, piecewise-constant function but then apply a smoothing prior to ensure that their posterior population size estimates transition gradually with time. These prior distributions implicitly encode extra population size information that is not available from the observed coalescent data or tree. Here, we present a novel statistic, [Formula: see text] , to quantify and disaggregate the relative contributions of the coalescent data and prior assumptions to the resulting posterior estimate precision. Our statistic also measures the additional mutual information introduced by such priors. Using [Formula: see text] we show that, because it is surprisingly easy to overparametrize piecewise-constant population models, common smoothing priors can lead to overconfident and potentially misleading inference, even under robust experimental designs. We propose [Formula: see text] as a useful tool for detecting when effective population size estimates are overly reliant on prior assumptions and for improving quantification of the uncertainty in those estimates.[Coalescent processes; effective population size; information theory; phylodynamics; prior assumptions; skyline plots.] Oxford University Press 2021-05-13 /pmc/articles/PMC8677568/ /pubmed/33989428 http://dx.doi.org/10.1093/sysbio/syab037 Text en © The Author(s) 2021. Published by Oxford University Press on behalf of the Society of Systematic Biologists. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Parag, Kris V
Pybus, Oliver G
Wu, Chieh-Hsi
Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?
title Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?
title_full Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?
title_fullStr Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?
title_full_unstemmed Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?
title_short Are Skyline Plot-Based Demographic Estimates Overly Dependent on Smoothing Prior Assumptions?
title_sort are skyline plot-based demographic estimates overly dependent on smoothing prior assumptions?
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8677568/
https://www.ncbi.nlm.nih.gov/pubmed/33989428
http://dx.doi.org/10.1093/sysbio/syab037
work_keys_str_mv AT paragkrisv areskylineplotbaseddemographicestimatesoverlydependentonsmoothingpriorassumptions
AT pybusoliverg areskylineplotbaseddemographicestimatesoverlydependentonsmoothingpriorassumptions
AT wuchiehhsi areskylineplotbaseddemographicestimatesoverlydependentonsmoothingpriorassumptions