Cargando…

Comparison of 61 Sequenced Escherichia coli Genomes

Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics trees...

Descripción completa

Detalles Bibliográficos
Autores principales: Lukjancenko, Oksana, Wassenaar, Trudy M., Ussery, David W.
Formato: Texto
Lenguaje:English
Publicado: Springer-Verlag 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2974192/
https://www.ncbi.nlm.nih.gov/pubmed/20623278
http://dx.doi.org/10.1007/s00248-010-9717-3
_version_ 1782190865697996800
author Lukjancenko, Oksana
Wassenaar, Trudy M.
Ussery, David W.
author_facet Lukjancenko, Oksana
Wassenaar, Trudy M.
Ussery, David W.
author_sort Lukjancenko, Oksana
collection PubMed
description Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics trees, and to identify the pan- and core genomes of this set of sequenced strains. A hierarchical clustering of variable genes allowed clear separation of the strains into clusters, including known pathotypes; clinically relevant serotypes can also be resolved in this way. In contrast, when in silico MLST was performed, many of the various strains appear jumbled and less well resolved. The predicted pan-genome comprises 15,741 gene families, and only 993 (6%) of the families are represented in every genome, comprising the core genome. The variable or ‘accessory’ genes thus make up more than 90% of the pan-genome and about 80% of a typical genome; some of these variable genes tend to be co-localized on genomic islands. The diversity within the species E. coli, and the overlap in gene content between this and related species, suggests a continuum rather than sharp species borders in this group of Enterobacteriaceae.
format Text
id pubmed-2974192
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Springer-Verlag
record_format MEDLINE/PubMed
spelling pubmed-29741922010-11-29 Comparison of 61 Sequenced Escherichia coli Genomes Lukjancenko, Oksana Wassenaar, Trudy M. Ussery, David W. Microb Ecol Minireviews Escherichia coli is an important component of the biosphere and is an ideal model for studies of processes involved in bacterial genome evolution. Sixty-one publically available E. coli and Shigella spp. sequenced genomes are compared, using basic methods to produce phylogenetic and proteomics trees, and to identify the pan- and core genomes of this set of sequenced strains. A hierarchical clustering of variable genes allowed clear separation of the strains into clusters, including known pathotypes; clinically relevant serotypes can also be resolved in this way. In contrast, when in silico MLST was performed, many of the various strains appear jumbled and less well resolved. The predicted pan-genome comprises 15,741 gene families, and only 993 (6%) of the families are represented in every genome, comprising the core genome. The variable or ‘accessory’ genes thus make up more than 90% of the pan-genome and about 80% of a typical genome; some of these variable genes tend to be co-localized on genomic islands. The diversity within the species E. coli, and the overlap in gene content between this and related species, suggests a continuum rather than sharp species borders in this group of Enterobacteriaceae. Springer-Verlag 2010-07-11 2010 /pmc/articles/PMC2974192/ /pubmed/20623278 http://dx.doi.org/10.1007/s00248-010-9717-3 Text en © The Author(s) 2010 https://creativecommons.org/licenses/by-nc/4.0/ This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
spellingShingle Minireviews
Lukjancenko, Oksana
Wassenaar, Trudy M.
Ussery, David W.
Comparison of 61 Sequenced Escherichia coli Genomes
title Comparison of 61 Sequenced Escherichia coli Genomes
title_full Comparison of 61 Sequenced Escherichia coli Genomes
title_fullStr Comparison of 61 Sequenced Escherichia coli Genomes
title_full_unstemmed Comparison of 61 Sequenced Escherichia coli Genomes
title_short Comparison of 61 Sequenced Escherichia coli Genomes
title_sort comparison of 61 sequenced escherichia coli genomes
topic Minireviews
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2974192/
https://www.ncbi.nlm.nih.gov/pubmed/20623278
http://dx.doi.org/10.1007/s00248-010-9717-3
work_keys_str_mv AT lukjancenkooksana comparisonof61sequencedescherichiacoligenomes
AT wassenaartrudym comparisonof61sequencedescherichiacoligenomes
AT usserydavidw comparisonof61sequencedescherichiacoligenomes