Cargando…

Robust Phylodynamic Analysis of Genetic Sequencing Data from Structured Populations

The multi-type birth–death model with sampling is a phylodynamic model which enables the quantification of past population dynamics in structured populations based on phylogenetic trees. The BEAST 2 package bdmm implements an algorithm for numerically computing the probability density of a phylogene...

Descripción completa

Detalles Bibliográficos
Autores principales: Scire, Jérémie, Barido-Sottani, Joëlle, Kühnert, Denise, Vaughan, Timothy G., Stadler, Tanja
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9413058/
https://www.ncbi.nlm.nih.gov/pubmed/36016270
http://dx.doi.org/10.3390/v14081648
_version_ 1784775643299315712
author Scire, Jérémie
Barido-Sottani, Joëlle
Kühnert, Denise
Vaughan, Timothy G.
Stadler, Tanja
author_facet Scire, Jérémie
Barido-Sottani, Joëlle
Kühnert, Denise
Vaughan, Timothy G.
Stadler, Tanja
author_sort Scire, Jérémie
collection PubMed
description The multi-type birth–death model with sampling is a phylodynamic model which enables the quantification of past population dynamics in structured populations based on phylogenetic trees. The BEAST 2 package bdmm implements an algorithm for numerically computing the probability density of a phylogenetic tree given the population dynamic parameters under this model. In the initial release of bdmm, analyses were computationally limited to trees consisting of up to approximately 250 genetic samples. We implemented important algorithmic changes to bdmm which dramatically increased the number of genetic samples that could be analyzed and which improved the numerical robustness and efficiency of the calculations. Including more samples led to the improved precision of parameter estimates, particularly for structured models with a high number of inferred parameters. Furthermore, we report on several model extensions to bdmm, inspired by properties common to empirical datasets. We applied this improved algorithm to two partly overlapping datasets of the Influenza A virus HA sequences sampled around the world—one with 500 samples and the other with only 175—for comparison. We report and compare the global migration patterns and seasonal dynamics inferred from each dataset. In this way, we show the information that is gained by analyzing the bigger dataset, which became possible with the presented algorithmic changes to bdmm. In summary, bdmm allows for the robust, faster, and more general phylodynamic inference of larger datasets.
format Online
Article
Text
id pubmed-9413058
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-94130582022-08-27 Robust Phylodynamic Analysis of Genetic Sequencing Data from Structured Populations Scire, Jérémie Barido-Sottani, Joëlle Kühnert, Denise Vaughan, Timothy G. Stadler, Tanja Viruses Article The multi-type birth–death model with sampling is a phylodynamic model which enables the quantification of past population dynamics in structured populations based on phylogenetic trees. The BEAST 2 package bdmm implements an algorithm for numerically computing the probability density of a phylogenetic tree given the population dynamic parameters under this model. In the initial release of bdmm, analyses were computationally limited to trees consisting of up to approximately 250 genetic samples. We implemented important algorithmic changes to bdmm which dramatically increased the number of genetic samples that could be analyzed and which improved the numerical robustness and efficiency of the calculations. Including more samples led to the improved precision of parameter estimates, particularly for structured models with a high number of inferred parameters. Furthermore, we report on several model extensions to bdmm, inspired by properties common to empirical datasets. We applied this improved algorithm to two partly overlapping datasets of the Influenza A virus HA sequences sampled around the world—one with 500 samples and the other with only 175—for comparison. We report and compare the global migration patterns and seasonal dynamics inferred from each dataset. In this way, we show the information that is gained by analyzing the bigger dataset, which became possible with the presented algorithmic changes to bdmm. In summary, bdmm allows for the robust, faster, and more general phylodynamic inference of larger datasets. MDPI 2022-07-27 /pmc/articles/PMC9413058/ /pubmed/36016270 http://dx.doi.org/10.3390/v14081648 Text en © 2022 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Scire, Jérémie
Barido-Sottani, Joëlle
Kühnert, Denise
Vaughan, Timothy G.
Stadler, Tanja
Robust Phylodynamic Analysis of Genetic Sequencing Data from Structured Populations
title Robust Phylodynamic Analysis of Genetic Sequencing Data from Structured Populations
title_full Robust Phylodynamic Analysis of Genetic Sequencing Data from Structured Populations
title_fullStr Robust Phylodynamic Analysis of Genetic Sequencing Data from Structured Populations
title_full_unstemmed Robust Phylodynamic Analysis of Genetic Sequencing Data from Structured Populations
title_short Robust Phylodynamic Analysis of Genetic Sequencing Data from Structured Populations
title_sort robust phylodynamic analysis of genetic sequencing data from structured populations
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9413058/
https://www.ncbi.nlm.nih.gov/pubmed/36016270
http://dx.doi.org/10.3390/v14081648
work_keys_str_mv AT scirejeremie robustphylodynamicanalysisofgeneticsequencingdatafromstructuredpopulations
AT baridosottanijoelle robustphylodynamicanalysisofgeneticsequencingdatafromstructuredpopulations
AT kuhnertdenise robustphylodynamicanalysisofgeneticsequencingdatafromstructuredpopulations
AT vaughantimothyg robustphylodynamicanalysisofgeneticsequencingdatafromstructuredpopulations
AT stadlertanja robustphylodynamicanalysisofgeneticsequencingdatafromstructuredpopulations