Cargando…

DISSEQT—DIStribution-based modeling of SEQuence space Time dynamics(†)

Rapidly evolving microbes are a challenge to model because of the volatile, complex, and dynamic nature of their populations. We developed the DISSEQT pipeline (DIStribution-based SEQuence space Time dynamics) for analyzing, visualizing, and predicting the evolution of heterogeneous biological popul...

Descripción completa

Detalles Bibliográficos
Autores principales: Henningsson, R, Moratorio, G, Bordería, A V, Vignuzzi, M, Fontes, M
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6680062/
https://www.ncbi.nlm.nih.gov/pubmed/31392032
http://dx.doi.org/10.1093/ve/vez028
_version_ 1783441430225616896
author Henningsson, R
Moratorio, G
Bordería, A V
Vignuzzi, M
Fontes, M
author_facet Henningsson, R
Moratorio, G
Bordería, A V
Vignuzzi, M
Fontes, M
author_sort Henningsson, R
collection PubMed
description Rapidly evolving microbes are a challenge to model because of the volatile, complex, and dynamic nature of their populations. We developed the DISSEQT pipeline (DIStribution-based SEQuence space Time dynamics) for analyzing, visualizing, and predicting the evolution of heterogeneous biological populations in multidimensional genetic space, suited for population-based modeling of deep sequencing and high-throughput data. The pipeline is openly available on GitHub (https://github.com/rasmushenningsson/DISSEQT.jl, accessed 23 June 2019) and Synapse (https://www.synapse.org/#!Synapse: syn11425758, accessed 23 June 2019), covering the entire workflow from read alignment to visualization of results. Our pipeline is centered around robust dimension and model reduction algorithms for analysis of genotypic data with additional capabilities for including phenotypic features to explore dynamic genotype–phenotype maps. We illustrate its utility and capacity with examples from evolving RNA virus populations, which present one of the highest degrees of genetic heterogeneity within a given population found in nature. Using our pipeline, we empirically reconstruct the evolutionary trajectories of evolving populations in sequence space and genotype–phenotype fitness landscapes. We show that while sequence space is vastly multidimensional, the relevant genetic space of evolving microbial populations is of intrinsically low dimension. In addition, evolutionary trajectories of these populations can be faithfully monitored to identify the key minority genotypes contributing most to evolution. Finally, we show that empirical fitness landscapes, when reconstructed to include minority variants, can predict phenotype from genotype with high accuracy.
format Online
Article
Text
id pubmed-6680062
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-66800622019-08-07 DISSEQT—DIStribution-based modeling of SEQuence space Time dynamics(†) Henningsson, R Moratorio, G Bordería, A V Vignuzzi, M Fontes, M Virus Evol Resources Rapidly evolving microbes are a challenge to model because of the volatile, complex, and dynamic nature of their populations. We developed the DISSEQT pipeline (DIStribution-based SEQuence space Time dynamics) for analyzing, visualizing, and predicting the evolution of heterogeneous biological populations in multidimensional genetic space, suited for population-based modeling of deep sequencing and high-throughput data. The pipeline is openly available on GitHub (https://github.com/rasmushenningsson/DISSEQT.jl, accessed 23 June 2019) and Synapse (https://www.synapse.org/#!Synapse: syn11425758, accessed 23 June 2019), covering the entire workflow from read alignment to visualization of results. Our pipeline is centered around robust dimension and model reduction algorithms for analysis of genotypic data with additional capabilities for including phenotypic features to explore dynamic genotype–phenotype maps. We illustrate its utility and capacity with examples from evolving RNA virus populations, which present one of the highest degrees of genetic heterogeneity within a given population found in nature. Using our pipeline, we empirically reconstruct the evolutionary trajectories of evolving populations in sequence space and genotype–phenotype fitness landscapes. We show that while sequence space is vastly multidimensional, the relevant genetic space of evolving microbial populations is of intrinsically low dimension. In addition, evolutionary trajectories of these populations can be faithfully monitored to identify the key minority genotypes contributing most to evolution. Finally, we show that empirical fitness landscapes, when reconstructed to include minority variants, can predict phenotype from genotype with high accuracy. Oxford University Press 2019-08-05 /pmc/articles/PMC6680062/ /pubmed/31392032 http://dx.doi.org/10.1093/ve/vez028 Text en © The Author(s) 2019. Published by Oxford University Press. http://creativecommons.org/licenses/by/4.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Resources
Henningsson, R
Moratorio, G
Bordería, A V
Vignuzzi, M
Fontes, M
DISSEQT—DIStribution-based modeling of SEQuence space Time dynamics(†)
title DISSEQT—DIStribution-based modeling of SEQuence space Time dynamics(†)
title_full DISSEQT—DIStribution-based modeling of SEQuence space Time dynamics(†)
title_fullStr DISSEQT—DIStribution-based modeling of SEQuence space Time dynamics(†)
title_full_unstemmed DISSEQT—DIStribution-based modeling of SEQuence space Time dynamics(†)
title_short DISSEQT—DIStribution-based modeling of SEQuence space Time dynamics(†)
title_sort disseqt—distribution-based modeling of sequence space time dynamics(†)
topic Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6680062/
https://www.ncbi.nlm.nih.gov/pubmed/31392032
http://dx.doi.org/10.1093/ve/vez028
work_keys_str_mv AT henningssonr disseqtdistributionbasedmodelingofsequencespacetimedynamics
AT moratoriog disseqtdistributionbasedmodelingofsequencespacetimedynamics
AT borderiaav disseqtdistributionbasedmodelingofsequencespacetimedynamics
AT vignuzzim disseqtdistributionbasedmodelingofsequencespacetimedynamics
AT fontesm disseqtdistributionbasedmodelingofsequencespacetimedynamics