Cargando…
Comparison of 61 Sequenced Escherichia coli Genomes
Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics trees...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Springer-Verlag
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2974192/ https://www.ncbi.nlm.nih.gov/pubmed/20623278 http://dx.doi.org/10.1007/s00248-010-9717-3 |
_version_ | 1782190865697996800 |
---|---|
author | Lukjancenko, Oksana Wassenaar, Trudy M. Ussery, David W. |
author_facet | Lukjancenko, Oksana Wassenaar, Trudy M. Ussery, David W. |
author_sort | Lukjancenko, Oksana |
collection | PubMed |
description | Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics trees, and to identify the pan- and core genomes of this set of sequenced strains. A hierarchical clustering of variable genes allowed clear separation of the strains into clusters, including known pathotypes; clinically relevant serotypes can also be resolved in this way. In contrast, when in silico MLST was performed, many of the various strains appear jumbled and less well resolved. The predicted pan-genome comprises 15,741 gene families, and only 993 (6%) of the families are represented in every genome, comprising the core genome. The variable or ‘accessory’ genes thus make up more than 90% of the pan-genome and about 80% of a typical genome; some of these variable genes tend to be co-localized on genomic islands. The diversity within the species E. coli, and the overlap in gene content between this and related species, suggests a continuum rather than sharp species borders in this group of Enterobacteriaceae. |
format | Text |
id | pubmed-2974192 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | Springer-Verlag |
record_format | MEDLINE/PubMed |
spelling | pubmed-29741922010-11-29 Comparison of 61 Sequenced Escherichia coli Genomes Lukjancenko, Oksana Wassenaar, Trudy M. Ussery, David W. Microb Ecol Minireviews Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics trees, and to identify the pan- and core genomes of this set of sequenced strains. A hierarchical clustering of variable genes allowed clear separation of the strains into clusters, including known pathotypes; clinically relevant serotypes can also be resolved in this way. In contrast, when in silico MLST was performed, many of the various strains appear jumbled and less well resolved. The predicted pan-genome comprises 15,741 gene families, and only 993 (6%) of the families are represented in every genome, comprising the core genome. The variable or ‘accessory’ genes thus make up more than 90% of the pan-genome and about 80% of a typical genome; some of these variable genes tend to be co-localized on genomic islands. The diversity within the species E. coli, and the overlap in gene content between this and related species, suggests a continuum rather than sharp species borders in this group of Enterobacteriaceae. Springer-Verlag 2010-07-11 2010 /pmc/articles/PMC2974192/ /pubmed/20623278 http://dx.doi.org/10.1007/s00248-010-9717-3 Text en © The Author(s) 2010 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited. |
spellingShingle | Minireviews Lukjancenko, Oksana Wassenaar, Trudy M. Ussery, David W. Comparison of 61 Sequenced Escherichia coli Genomes |
title | Comparison of 61 Sequenced Escherichia coli Genomes |
title_full | Comparison of 61 Sequenced Escherichia coli Genomes |
title_fullStr | Comparison of 61 Sequenced Escherichia coli Genomes |
title_full_unstemmed | Comparison of 61 Sequenced Escherichia coli Genomes |
title_short | Comparison of 61 Sequenced Escherichia coli Genomes |
title_sort | comparison of 61 sequenced escherichia coli genomes |
topic | Minireviews |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2974192/ https://www.ncbi.nlm.nih.gov/pubmed/20623278 http://dx.doi.org/10.1007/s00248-010-9717-3 |
work_keys_str_mv | AT lukjancenkooksana comparisonof61sequencedescherichiacoligenomes AT wassenaartrudym comparisonof61sequencedescherichiacoligenomes AT usserydavidw comparisonof61sequencedescherichiacoligenomes |