Cargando…

Predicting the evolution of Escherichia coli by a data-driven approach

A tantalizing question in evolutionary biology is whether evolution can be predicted from past experiences. To address this question, we created a coherent compendium of more than 15,000 mutation events for the bacterium Escherichia coli under 178 distinct environmental settings. Compendium analysis...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Xiaokang, Zorraquino, Violeta, Kim, Minseung, Tsoukalas, Athanasios, Tagkopoulos, Ilias
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6120903/
https://www.ncbi.nlm.nih.gov/pubmed/30177705
http://dx.doi.org/10.1038/s41467-018-05807-z
_version_ 1783352345967460352
author Wang, Xiaokang
Zorraquino, Violeta
Kim, Minseung
Tsoukalas, Athanasios
Tagkopoulos, Ilias
author_facet Wang, Xiaokang
Zorraquino, Violeta
Kim, Minseung
Tsoukalas, Athanasios
Tagkopoulos, Ilias
author_sort Wang, Xiaokang
collection PubMed
description A tantalizing question in evolutionary biology is whether evolution can be predicted from past experiences. To address this question, we created a coherent compendium of more than 15,000 mutation events for the bacterium Escherichia coli under 178 distinct environmental settings. Compendium analysis provides a comprehensive view of the explored environments, mutation hotspots and mutation co-occurrence. While the mutations shared across all replicates decrease with the number of replicates, our results argue that the pairwise overlapping ratio remains the same, regardless of the number of replicates. An ensemble of predictors trained on the mutation compendium and tested in forward validation over 35 evolution replicates achieves a 49.2 ± 5.8% (mean ± std) precision and 34.5 ± 5.7% recall in predicting mutation targets. This work demonstrates how integrated datasets can be harnessed to create predictive models of evolution at a gene level and elucidate the effect of evolutionary processes in well-defined environments.
format Online
Article
Text
id pubmed-6120903
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-61209032018-09-05 Predicting the evolution of Escherichia coli by a data-driven approach Wang, Xiaokang Zorraquino, Violeta Kim, Minseung Tsoukalas, Athanasios Tagkopoulos, Ilias Nat Commun Article A tantalizing question in evolutionary biology is whether evolution can be predicted from past experiences. To address this question, we created a coherent compendium of more than 15,000 mutation events for the bacterium Escherichia coli under 178 distinct environmental settings. Compendium analysis provides a comprehensive view of the explored environments, mutation hotspots and mutation co-occurrence. While the mutations shared across all replicates decrease with the number of replicates, our results argue that the pairwise overlapping ratio remains the same, regardless of the number of replicates. An ensemble of predictors trained on the mutation compendium and tested in forward validation over 35 evolution replicates achieves a 49.2 ± 5.8% (mean ± std) precision and 34.5 ± 5.7% recall in predicting mutation targets. This work demonstrates how integrated datasets can be harnessed to create predictive models of evolution at a gene level and elucidate the effect of evolutionary processes in well-defined environments. Nature Publishing Group UK 2018-09-03 /pmc/articles/PMC6120903/ /pubmed/30177705 http://dx.doi.org/10.1038/s41467-018-05807-z Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Wang, Xiaokang
Zorraquino, Violeta
Kim, Minseung
Tsoukalas, Athanasios
Tagkopoulos, Ilias
Predicting the evolution of Escherichia coli by a data-driven approach
title Predicting the evolution of Escherichia coli by a data-driven approach
title_full Predicting the evolution of Escherichia coli by a data-driven approach
title_fullStr Predicting the evolution of Escherichia coli by a data-driven approach
title_full_unstemmed Predicting the evolution of Escherichia coli by a data-driven approach
title_short Predicting the evolution of Escherichia coli by a data-driven approach
title_sort predicting the evolution of escherichia coli by a data-driven approach
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6120903/
https://www.ncbi.nlm.nih.gov/pubmed/30177705
http://dx.doi.org/10.1038/s41467-018-05807-z
work_keys_str_mv AT wangxiaokang predictingtheevolutionofescherichiacolibyadatadrivenapproach
AT zorraquinovioleta predictingtheevolutionofescherichiacolibyadatadrivenapproach
AT kimminseung predictingtheevolutionofescherichiacolibyadatadrivenapproach
AT tsoukalasathanasios predictingtheevolutionofescherichiacolibyadatadrivenapproach
AT tagkopoulosilias predictingtheevolutionofescherichiacolibyadatadrivenapproach