Cargando…

Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models

Estimating temporal changes in a target population from phylogenetic or count data is an important problem in ecology and epidemiology. Reliable estimates can provide key insights into the climatic and biological drivers influencing the diversity or structure of that population and evidence hypothes...

Descripción completa

Detalles Bibliográficos
Autores principales: Parag, Kris V, Donnelly, Christl A
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7584150/
https://www.ncbi.nlm.nih.gov/pubmed/32333789
http://dx.doi.org/10.1093/sysbio/syaa035
_version_ 1783599536488316928
author Parag, Kris V
Donnelly, Christl A
author_facet Parag, Kris V
Donnelly, Christl A
author_sort Parag, Kris V
collection PubMed
description Estimating temporal changes in a target population from phylogenetic or count data is an important problem in ecology and epidemiology. Reliable estimates can provide key insights into the climatic and biological drivers influencing the diversity or structure of that population and evidence hypotheses concerning its future growth or decline. In infectious disease applications, the individuals infected across an epidemic form the target population. The renewal model estimates the effective reproduction number, R, of the epidemic from counts of observed incident cases. The skyline model infers the effective population size, N, underlying a phylogeny of sequences sampled from that epidemic. Practically, R measures ongoing epidemic growth while N informs on historical caseload. While both models solve distinct problems, the reliability of their estimates depends on p-dimensional piecewise-constant functions. If p is misspecified, the model might underfit significant changes or overfit noise and promote a spurious understanding of the epidemic, which might misguide intervention policies or misinform forecasts. Surprisingly, no transparent yet principled approach for optimizing p exists. Usually, p is heuristically set, or obscurely controlled via complex algorithms. We present a computable and interpretable p-selection method based on the minimum description length (MDL) formalism of information theory. Unlike many standard model selection techniques, MDL accounts for the additional statistical complexity induced by how parameters interact. As a result, our method optimizes p so that R and N estimates properly and meaningfully adapt to available data. It also outperforms comparable Akaike and Bayesian information criteria on several classification problems, given minimal knowledge of the parameter space, and exposes statistical similarities among renewal, skyline, and other models in biology. Rigorous and interpretable model selection is necessary if trustworthy and justifiable conclusions are to be drawn from piecewise models. [Coalescent processes; epidemiology; information theory; model selection; phylodynamics; renewal models; skyline plots]
format Online
Article
Text
id pubmed-7584150
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-75841502020-10-29 Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models Parag, Kris V Donnelly, Christl A Syst Biol Regular Articles Estimating temporal changes in a target population from phylogenetic or count data is an important problem in ecology and epidemiology. Reliable estimates can provide key insights into the climatic and biological drivers influencing the diversity or structure of that population and evidence hypotheses concerning its future growth or decline. In infectious disease applications, the individuals infected across an epidemic form the target population. The renewal model estimates the effective reproduction number, R, of the epidemic from counts of observed incident cases. The skyline model infers the effective population size, N, underlying a phylogeny of sequences sampled from that epidemic. Practically, R measures ongoing epidemic growth while N informs on historical caseload. While both models solve distinct problems, the reliability of their estimates depends on p-dimensional piecewise-constant functions. If p is misspecified, the model might underfit significant changes or overfit noise and promote a spurious understanding of the epidemic, which might misguide intervention policies or misinform forecasts. Surprisingly, no transparent yet principled approach for optimizing p exists. Usually, p is heuristically set, or obscurely controlled via complex algorithms. We present a computable and interpretable p-selection method based on the minimum description length (MDL) formalism of information theory. Unlike many standard model selection techniques, MDL accounts for the additional statistical complexity induced by how parameters interact. As a result, our method optimizes p so that R and N estimates properly and meaningfully adapt to available data. It also outperforms comparable Akaike and Bayesian information criteria on several classification problems, given minimal knowledge of the parameter space, and exposes statistical similarities among renewal, skyline, and other models in biology. Rigorous and interpretable model selection is necessary if trustworthy and justifiable conclusions are to be drawn from piecewise models. [Coalescent processes; epidemiology; information theory; model selection; phylodynamics; renewal models; skyline plots] Oxford University Press 2020-04-25 /pmc/articles/PMC7584150/ /pubmed/32333789 http://dx.doi.org/10.1093/sysbio/syaa035 Text en © The Author(s) 2020. Published by Oxford University Press on behalf of the Society of Systematic Biologists. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Regular Articles
Parag, Kris V
Donnelly, Christl A
Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models
title Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models
title_full Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models
title_fullStr Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models
title_full_unstemmed Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models
title_short Adaptive Estimation for Epidemic Renewal and Phylogenetic Skyline Models
title_sort adaptive estimation for epidemic renewal and phylogenetic skyline models
topic Regular Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7584150/
https://www.ncbi.nlm.nih.gov/pubmed/32333789
http://dx.doi.org/10.1093/sysbio/syaa035
work_keys_str_mv AT paragkrisv adaptiveestimationforepidemicrenewalandphylogeneticskylinemodels
AT donnellychristla adaptiveestimationforepidemicrenewalandphylogeneticskylinemodels