Cargando…
To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131
BACKGROUND: Escherichia coli (E. coli) has been one of the most studied model organisms in the history of life sciences. Initially thought just to be commensal bacteria, E. coli has shown wide phenotypic diversity including pathogenic isolates with great relevance to public health. Though pangenome...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205054/ https://www.ncbi.nlm.nih.gov/pubmed/35710371 http://dx.doi.org/10.1186/s12915-022-01347-7 |
_version_ | 1784729049639157760 |
---|---|
author | Tantoso, Erwin Eisenhaber, Birgit Kirsch, Miles Shitov, Vladimir Zhao, Zhiya Eisenhaber, Frank |
author_facet | Tantoso, Erwin Eisenhaber, Birgit Kirsch, Miles Shitov, Vladimir Zhao, Zhiya Eisenhaber, Frank |
author_sort | Tantoso, Erwin |
collection | PubMed |
description | BACKGROUND: Escherichia coli (E. coli) has been one of the most studied model organisms in the history of life sciences. Initially thought just to be commensal bacteria, E. coli has shown wide phenotypic diversity including pathogenic isolates with great relevance to public health. Though pangenome analysis has been attempted several times, there is no systematic functional characterization of the E. coli subgroups according to the gene profile. RESULTS: Systematically scanning for optimal parametrization, we have built the E. coli pangenome from 1324 complete genomes. The pangenome size is estimated to be ~25,000 gene families (GFs). Whereas the core genome diminishes as more genomes are added, the softcore genome (≥95% of strains) is stable with ~3000 GFs regardless of the total number of genomes. Apparently, the softcore genome (with a 92% or 95% generation threshold) can define the genome of a bacterial species listing the critically relevant, evolutionarily most conserved or important classes of GFs. Unsupervised clustering of common E. coli sequence types using the presence/absence GF matrix reveals distinct characteristics of E. coli phylogroups B1, B2, and E. We highlight the bi-lineage nature of B1, the variation of the secretion and of the iron acquisition systems in ST11 (E), and the incorporation of a highly conserved prophage into the genome of ST131 (B2). The tail structure of the prophage is evolutionarily related to R2-pyocin (a tailocin) from Pseudomonas aeruginosa PAO1. We hypothesize that this molecular machinery is highly likely to play an important role in protecting its own colonies; thus, contributing towards the rapid rise of pandemic E. coli ST131. CONCLUSIONS: This study has explored the optimized pangenome development in E. coli. We provide complete GF lists and the pangenome matrix as supplementary data for further studies. We identified biological characteristics of different E. coli subtypes, specifically for phylogroups B1, B2, and E. We found an operon-like genome region coding for a tailocin specific for ST131 strains. The latter is a potential killer weapon providing pandemic E. coli ST131 with an advantage in inter-bacterial competition and, suggestively, explains their dominance as human pathogen among E. coli strains. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01347-7. |
format | Online Article Text |
id | pubmed-9205054 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-92050542022-06-18 To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131 Tantoso, Erwin Eisenhaber, Birgit Kirsch, Miles Shitov, Vladimir Zhao, Zhiya Eisenhaber, Frank BMC Biol Research Article BACKGROUND: Escherichia coli (E. coli) has been one of the most studied model organisms in the history of life sciences. Initially thought just to be commensal bacteria, E. coli has shown wide phenotypic diversity including pathogenic isolates with great relevance to public health. Though pangenome analysis has been attempted several times, there is no systematic functional characterization of the E. coli subgroups according to the gene profile. RESULTS: Systematically scanning for optimal parametrization, we have built the E. coli pangenome from 1324 complete genomes. The pangenome size is estimated to be ~25,000 gene families (GFs). Whereas the core genome diminishes as more genomes are added, the softcore genome (≥95% of strains) is stable with ~3000 GFs regardless of the total number of genomes. Apparently, the softcore genome (with a 92% or 95% generation threshold) can define the genome of a bacterial species listing the critically relevant, evolutionarily most conserved or important classes of GFs. Unsupervised clustering of common E. coli sequence types using the presence/absence GF matrix reveals distinct characteristics of E. coli phylogroups B1, B2, and E. We highlight the bi-lineage nature of B1, the variation of the secretion and of the iron acquisition systems in ST11 (E), and the incorporation of a highly conserved prophage into the genome of ST131 (B2). The tail structure of the prophage is evolutionarily related to R2-pyocin (a tailocin) from Pseudomonas aeruginosa PAO1. We hypothesize that this molecular machinery is highly likely to play an important role in protecting its own colonies; thus, contributing towards the rapid rise of pandemic E. coli ST131. CONCLUSIONS: This study has explored the optimized pangenome development in E. coli. We provide complete GF lists and the pangenome matrix as supplementary data for further studies. We identified biological characteristics of different E. coli subtypes, specifically for phylogroups B1, B2, and E. We found an operon-like genome region coding for a tailocin specific for ST131 strains. The latter is a potential killer weapon providing pandemic E. coli ST131 with an advantage in inter-bacterial competition and, suggestively, explains their dominance as human pathogen among E. coli strains. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01347-7. BioMed Central 2022-06-16 /pmc/articles/PMC9205054/ /pubmed/35710371 http://dx.doi.org/10.1186/s12915-022-01347-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Article Tantoso, Erwin Eisenhaber, Birgit Kirsch, Miles Shitov, Vladimir Zhao, Zhiya Eisenhaber, Frank To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131 |
title | To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131 |
title_full | To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131 |
title_fullStr | To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131 |
title_full_unstemmed | To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131 |
title_short | To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131 |
title_sort | to kill or to be killed: pangenome analysis of escherichia coli strains reveals a tailocin specific for pandemic st131 |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205054/ https://www.ncbi.nlm.nih.gov/pubmed/35710371 http://dx.doi.org/10.1186/s12915-022-01347-7 |
work_keys_str_mv | AT tantosoerwin tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131 AT eisenhaberbirgit tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131 AT kirschmiles tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131 AT shitovvladimir tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131 AT zhaozhiya tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131 AT eisenhaberfrank tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131 |