Cargando…

Whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks

The genomic diversity across strains of a species forms the genetic basis for differences in their behavior. A large-scale assessment of sequence variation has been made possible by the growing availability of strain-specific whole-genome sequences (WGS) and with the advent of large-scale databases...

Descripción completa

Detalles Bibliográficos
Autores principales: Catoiu, Edward Alexander, Phaneuf, Patrick, Monk, Jonathan, Palsson, Bernhard O.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: National Academy of Sciences 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104531/
https://www.ncbi.nlm.nih.gov/pubmed/37011218
http://dx.doi.org/10.1073/pnas.2218835120
_version_ 1785026058995630080
author Catoiu, Edward Alexander
Phaneuf, Patrick
Monk, Jonathan
Palsson, Bernhard O.
author_facet Catoiu, Edward Alexander
Phaneuf, Patrick
Monk, Jonathan
Palsson, Bernhard O.
author_sort Catoiu, Edward Alexander
collection PubMed
description The genomic diversity across strains of a species forms the genetic basis for differences in their behavior. A large-scale assessment of sequence variation has been made possible by the growing availability of strain-specific whole-genome sequences (WGS) and with the advent of large-scale databases of laboratory-acquired mutations. We define the Escherichia coli “alleleome” through a genome-scale assessment of amino acid (AA) sequence diversity in open reading frames across 2,661 WGS from wild-type strains. We observe a highly conserved alleleome enriched in mutations unlikely to affect protein function. In contrast, 33,000 mutations acquired in laboratory evolution experiments result in more severe AA substitutions that are rarely achieved by natural selection. Large-scale assessment of the alleleome establishes a method for the quantification of bacterial allelic diversity, reveals opportunities for synthetic biology to explore novel sequence space, and offers insights into the constraints governing evolution.
format Online
Article
Text
id pubmed-10104531
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher National Academy of Sciences
record_format MEDLINE/PubMed
spelling pubmed-101045312023-04-15 Whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks Catoiu, Edward Alexander Phaneuf, Patrick Monk, Jonathan Palsson, Bernhard O. Proc Natl Acad Sci U S A Biological Sciences The genomic diversity across strains of a species forms the genetic basis for differences in their behavior. A large-scale assessment of sequence variation has been made possible by the growing availability of strain-specific whole-genome sequences (WGS) and with the advent of large-scale databases of laboratory-acquired mutations. We define the Escherichia coli “alleleome” through a genome-scale assessment of amino acid (AA) sequence diversity in open reading frames across 2,661 WGS from wild-type strains. We observe a highly conserved alleleome enriched in mutations unlikely to affect protein function. In contrast, 33,000 mutations acquired in laboratory evolution experiments result in more severe AA substitutions that are rarely achieved by natural selection. Large-scale assessment of the alleleome establishes a method for the quantification of bacterial allelic diversity, reveals opportunities for synthetic biology to explore novel sequence space, and offers insights into the constraints governing evolution. National Academy of Sciences 2023-04-03 2023-04-11 /pmc/articles/PMC10104531/ /pubmed/37011218 http://dx.doi.org/10.1073/pnas.2218835120 Text en Copyright © 2023 the Author(s). Published by PNAS. https://creativecommons.org/licenses/by-nc-nd/4.0/This open access article is distributed under Creative Commons Attribution-NonCommercial-NoDerivatives License 4.0 (CC BY-NC-ND) (https://creativecommons.org/licenses/by-nc-nd/4.0/) .
spellingShingle Biological Sciences
Catoiu, Edward Alexander
Phaneuf, Patrick
Monk, Jonathan
Palsson, Bernhard O.
Whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks
title Whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks
title_full Whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks
title_fullStr Whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks
title_full_unstemmed Whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks
title_short Whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks
title_sort whole-genome sequences from wild-type and laboratory-evolved strains define the alleleome and establish its hallmarks
topic Biological Sciences
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10104531/
https://www.ncbi.nlm.nih.gov/pubmed/37011218
http://dx.doi.org/10.1073/pnas.2218835120
work_keys_str_mv AT catoiuedwardalexander wholegenomesequencesfromwildtypeandlaboratoryevolvedstrainsdefinethealleleomeandestablishitshallmarks
AT phaneufpatrick wholegenomesequencesfromwildtypeandlaboratoryevolvedstrainsdefinethealleleomeandestablishitshallmarks
AT monkjonathan wholegenomesequencesfromwildtypeandlaboratoryevolvedstrainsdefinethealleleomeandestablishitshallmarks
AT palssonbernhardo wholegenomesequencesfromwildtypeandlaboratoryevolvedstrainsdefinethealleleomeandestablishitshallmarks