Cargando…

Inferring epidemiological parameters from phylogenies using regression-ABC: A comparative study

Inferring epidemiological parameters such as the R(0) from time-scaled phylogenies is a timely challenge. Most current approaches rely on likelihood functions, which raise specific issues that range from computing these functions to finding their maxima numerically. Here, we present a new regression...

Descripción completa

Detalles Bibliográficos
Autores principales: Saulnier, Emma, Gascuel, Olivier, Alizon, Samuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5358897/
https://www.ncbi.nlm.nih.gov/pubmed/28263987
http://dx.doi.org/10.1371/journal.pcbi.1005416
_version_ 1782516301474824192
author Saulnier, Emma
Gascuel, Olivier
Alizon, Samuel
author_facet Saulnier, Emma
Gascuel, Olivier
Alizon, Samuel
author_sort Saulnier, Emma
collection PubMed
description Inferring epidemiological parameters such as the R(0) from time-scaled phylogenies is a timely challenge. Most current approaches rely on likelihood functions, which raise specific issues that range from computing these functions to finding their maxima numerically. Here, we present a new regression-based Approximate Bayesian Computation (ABC) approach, which we base on a large variety of summary statistics intended to capture the information contained in the phylogeny and its corresponding lineage-through-time plot. The regression step involves the Least Absolute Shrinkage and Selection Operator (LASSO) method, which is a robust machine learning technique. It allows us to readily deal with the large number of summary statistics, while avoiding resorting to Markov Chain Monte Carlo (MCMC) techniques. To compare our approach to existing ones, we simulated target trees under a variety of epidemiological models and settings, and inferred parameters of interest using the same priors. We found that, for large phylogenies, the accuracy of our regression-ABC is comparable to that of likelihood-based approaches involving birth-death processes implemented in BEAST2. Our approach even outperformed these when inferring the host population size with a Susceptible-Infected-Removed epidemiological model. It also clearly outperformed a recent kernel-ABC approach when assuming a Susceptible-Infected epidemiological model with two host types. Lastly, by re-analyzing data from the early stages of the recent Ebola epidemic in Sierra Leone, we showed that regression-ABC provides more realistic estimates for the duration parameters (latency and infectiousness) than the likelihood-based method. Overall, ABC based on a large variety of summary statistics and a regression method able to perform variable selection and avoid overfitting is a promising approach to analyze large phylogenies.
format Online
Article
Text
id pubmed-5358897
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-53588972017-04-06 Inferring epidemiological parameters from phylogenies using regression-ABC: A comparative study Saulnier, Emma Gascuel, Olivier Alizon, Samuel PLoS Comput Biol Research Article Inferring epidemiological parameters such as the R(0) from time-scaled phylogenies is a timely challenge. Most current approaches rely on likelihood functions, which raise specific issues that range from computing these functions to finding their maxima numerically. Here, we present a new regression-based Approximate Bayesian Computation (ABC) approach, which we base on a large variety of summary statistics intended to capture the information contained in the phylogeny and its corresponding lineage-through-time plot. The regression step involves the Least Absolute Shrinkage and Selection Operator (LASSO) method, which is a robust machine learning technique. It allows us to readily deal with the large number of summary statistics, while avoiding resorting to Markov Chain Monte Carlo (MCMC) techniques. To compare our approach to existing ones, we simulated target trees under a variety of epidemiological models and settings, and inferred parameters of interest using the same priors. We found that, for large phylogenies, the accuracy of our regression-ABC is comparable to that of likelihood-based approaches involving birth-death processes implemented in BEAST2. Our approach even outperformed these when inferring the host population size with a Susceptible-Infected-Removed epidemiological model. It also clearly outperformed a recent kernel-ABC approach when assuming a Susceptible-Infected epidemiological model with two host types. Lastly, by re-analyzing data from the early stages of the recent Ebola epidemic in Sierra Leone, we showed that regression-ABC provides more realistic estimates for the duration parameters (latency and infectiousness) than the likelihood-based method. Overall, ABC based on a large variety of summary statistics and a regression method able to perform variable selection and avoid overfitting is a promising approach to analyze large phylogenies. Public Library of Science 2017-03-06 /pmc/articles/PMC5358897/ /pubmed/28263987 http://dx.doi.org/10.1371/journal.pcbi.1005416 Text en © 2017 Saulnier et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Saulnier, Emma
Gascuel, Olivier
Alizon, Samuel
Inferring epidemiological parameters from phylogenies using regression-ABC: A comparative study
title Inferring epidemiological parameters from phylogenies using regression-ABC: A comparative study
title_full Inferring epidemiological parameters from phylogenies using regression-ABC: A comparative study
title_fullStr Inferring epidemiological parameters from phylogenies using regression-ABC: A comparative study
title_full_unstemmed Inferring epidemiological parameters from phylogenies using regression-ABC: A comparative study
title_short Inferring epidemiological parameters from phylogenies using regression-ABC: A comparative study
title_sort inferring epidemiological parameters from phylogenies using regression-abc: a comparative study
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5358897/
https://www.ncbi.nlm.nih.gov/pubmed/28263987
http://dx.doi.org/10.1371/journal.pcbi.1005416
work_keys_str_mv AT saulnieremma inferringepidemiologicalparametersfromphylogeniesusingregressionabcacomparativestudy
AT gascuelolivier inferringepidemiologicalparametersfromphylogeniesusingregressionabcacomparativestudy
AT alizonsamuel inferringepidemiologicalparametersfromphylogeniesusingregressionabcacomparativestudy