Cargando…

To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131

BACKGROUND: Escherichia coli (E. coli) has been one of the most studied model organisms in the history of life sciences. Initially thought just to be commensal bacteria, E. coli has shown wide phenotypic diversity including pathogenic isolates with great relevance to public health. Though pangenome...

Descripción completa

Detalles Bibliográficos
Autores principales: Tantoso, Erwin, Eisenhaber, Birgit, Kirsch, Miles, Shitov, Vladimir, Zhao, Zhiya, Eisenhaber, Frank
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205054/
https://www.ncbi.nlm.nih.gov/pubmed/35710371
http://dx.doi.org/10.1186/s12915-022-01347-7
_version_ 1784729049639157760
author Tantoso, Erwin
Eisenhaber, Birgit
Kirsch, Miles
Shitov, Vladimir
Zhao, Zhiya
Eisenhaber, Frank
author_facet Tantoso, Erwin
Eisenhaber, Birgit
Kirsch, Miles
Shitov, Vladimir
Zhao, Zhiya
Eisenhaber, Frank
author_sort Tantoso, Erwin
collection PubMed
description BACKGROUND: Escherichia coli (E. coli) has been one of the most studied model organisms in the history of life sciences. Initially thought just to be commensal bacteria, E. coli has shown wide phenotypic diversity including pathogenic isolates with great relevance to public health. Though pangenome analysis has been attempted several times, there is no systematic functional characterization of the E. coli subgroups according to the gene profile. RESULTS: Systematically scanning for optimal parametrization, we have built the E. coli pangenome from 1324 complete genomes. The pangenome size is estimated to be ~25,000 gene families (GFs). Whereas the core genome diminishes as more genomes are added, the softcore genome (≥95% of strains) is stable with ~3000 GFs regardless of the total number of genomes. Apparently, the softcore genome (with a 92% or 95% generation threshold) can define the genome of a bacterial species listing the critically relevant, evolutionarily most conserved or important classes of GFs. Unsupervised clustering of common E. coli sequence types using the presence/absence GF matrix reveals distinct characteristics of E. coli phylogroups B1, B2, and E. We highlight the bi-lineage nature of B1, the variation of the secretion and of the iron acquisition systems in ST11 (E), and the incorporation of a highly conserved prophage into the genome of ST131 (B2). The tail structure of the prophage is evolutionarily related to R2-pyocin (a tailocin) from Pseudomonas aeruginosa PAO1. We hypothesize that this molecular machinery is highly likely to play an important role in protecting its own colonies; thus, contributing towards the rapid rise of pandemic E. coli ST131. CONCLUSIONS: This study has explored the optimized pangenome development in E. coli. We provide complete GF lists and the pangenome matrix as supplementary data for further studies. We identified biological characteristics of different E. coli subtypes, specifically for phylogroups B1, B2, and E. We found an operon-like genome region coding for a tailocin specific for ST131 strains. The latter is a potential killer weapon providing pandemic E. coli ST131 with an advantage in inter-bacterial competition and, suggestively, explains their dominance as human pathogen among E. coli strains. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01347-7.
format Online
Article
Text
id pubmed-9205054
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-92050542022-06-18 To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131 Tantoso, Erwin Eisenhaber, Birgit Kirsch, Miles Shitov, Vladimir Zhao, Zhiya Eisenhaber, Frank BMC Biol Research Article BACKGROUND: Escherichia coli (E. coli) has been one of the most studied model organisms in the history of life sciences. Initially thought just to be commensal bacteria, E. coli has shown wide phenotypic diversity including pathogenic isolates with great relevance to public health. Though pangenome analysis has been attempted several times, there is no systematic functional characterization of the E. coli subgroups according to the gene profile. RESULTS: Systematically scanning for optimal parametrization, we have built the E. coli pangenome from 1324 complete genomes. The pangenome size is estimated to be ~25,000 gene families (GFs). Whereas the core genome diminishes as more genomes are added, the softcore genome (≥95% of strains) is stable with ~3000 GFs regardless of the total number of genomes. Apparently, the softcore genome (with a 92% or 95% generation threshold) can define the genome of a bacterial species listing the critically relevant, evolutionarily most conserved or important classes of GFs. Unsupervised clustering of common E. coli sequence types using the presence/absence GF matrix reveals distinct characteristics of E. coli phylogroups B1, B2, and E. We highlight the bi-lineage nature of B1, the variation of the secretion and of the iron acquisition systems in ST11 (E), and the incorporation of a highly conserved prophage into the genome of ST131 (B2). The tail structure of the prophage is evolutionarily related to R2-pyocin (a tailocin) from Pseudomonas aeruginosa PAO1. We hypothesize that this molecular machinery is highly likely to play an important role in protecting its own colonies; thus, contributing towards the rapid rise of pandemic E. coli ST131. CONCLUSIONS: This study has explored the optimized pangenome development in E. coli. We provide complete GF lists and the pangenome matrix as supplementary data for further studies. We identified biological characteristics of different E. coli subtypes, specifically for phylogroups B1, B2, and E. We found an operon-like genome region coding for a tailocin specific for ST131 strains. The latter is a potential killer weapon providing pandemic E. coli ST131 with an advantage in inter-bacterial competition and, suggestively, explains their dominance as human pathogen among E. coli strains. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12915-022-01347-7. BioMed Central 2022-06-16 /pmc/articles/PMC9205054/ /pubmed/35710371 http://dx.doi.org/10.1186/s12915-022-01347-7 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Tantoso, Erwin
Eisenhaber, Birgit
Kirsch, Miles
Shitov, Vladimir
Zhao, Zhiya
Eisenhaber, Frank
To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131
title To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131
title_full To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131
title_fullStr To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131
title_full_unstemmed To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131
title_short To kill or to be killed: pangenome analysis of Escherichia coli strains reveals a tailocin specific for pandemic ST131
title_sort to kill or to be killed: pangenome analysis of escherichia coli strains reveals a tailocin specific for pandemic st131
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9205054/
https://www.ncbi.nlm.nih.gov/pubmed/35710371
http://dx.doi.org/10.1186/s12915-022-01347-7
work_keys_str_mv AT tantosoerwin tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131
AT eisenhaberbirgit tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131
AT kirschmiles tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131
AT shitovvladimir tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131
AT zhaozhiya tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131
AT eisenhaberfrank tokillortobekilledpangenomeanalysisofescherichiacolistrainsrevealsatailocinspecificforpandemicst131