Cargando…

Gene duplications in the E. coli genome: common themes among pathotypes

BACKGROUND: Gene duplication underlies a significant proportion of gene functional diversity and genome complexity in both eukaryotes and prokaryotes. Although several reports in the literature described the duplication of specific genes in E. coli, a detailed analysis of the extent of gene duplicat...

Descripción completa

Detalles Bibliográficos
Autores principales: Bernabeu, Manuel, Sánchez-Herrero, José Francisco, Huedo, Pol, Prieto, Alejandro, Hüttener, Mário, Rozas, Julio, Juárez, Antonio
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480617/
https://www.ncbi.nlm.nih.gov/pubmed/31014240
http://dx.doi.org/10.1186/s12864-019-5683-4
_version_ 1783413606594904064
author Bernabeu, Manuel
Sánchez-Herrero, José Francisco
Huedo, Pol
Prieto, Alejandro
Hüttener, Mário
Rozas, Julio
Juárez, Antonio
author_facet Bernabeu, Manuel
Sánchez-Herrero, José Francisco
Huedo, Pol
Prieto, Alejandro
Hüttener, Mário
Rozas, Julio
Juárez, Antonio
author_sort Bernabeu, Manuel
collection PubMed
description BACKGROUND: Gene duplication underlies a significant proportion of gene functional diversity and genome complexity in both eukaryotes and prokaryotes. Although several reports in the literature described the duplication of specific genes in E. coli, a detailed analysis of the extent of gene duplications in this microorganism is needed. RESULTS: The genomes of the E. coli enteroaggregative strain 042 and other pathogenic strains contain duplications of the gene that codes for the global regulator Hha. To determine whether the presence of additional copies of the hha gene correlates with the presence of other genes, we performed a comparative genomic analysis between E. coli strains with and without hha duplications. The results showed that strains harboring additional copies of the hha gene also encode the yeeR irmA (aec69) gene cluster, which, in turn, is also duplicated in strain 042 and several other strains. The identification of these duplications prompted us to obtain a global map of gene duplications, first in strain 042 and later in other E. coli genomes. Duplications in the genomes of the enteroaggregative strain 042, the uropathogenic strain CFT073 and the enterohemorrhagic strain O145:H28 have been identified by a BLASTp protein similarity search. This algorithm was also used to evaluate the distribution of the identified duplicates among the genomes of a set of 28 representative E. coli strains. Despite the high genomic diversity of E. coli strains, we identified several duplicates in the genomes of almost all studied pathogenic strains. Most duplicated genes have no known function. Transcriptomic analysis also showed that most of these duplications are regulated by the H-NS/Hha proteins. CONCLUSIONS: Several duplicated genes are widely distributed among pathogenic E. coli strains. In addition, some duplicated genes are present only in specific pathotypes, and others are strain specific. This gene duplication analysis shows novel relationships between E. coli pathotypes and suggests that newly identified genes that are duplicated in a high percentage of pathogenic E. coli isolates may play a role in virulence. Our study also shows a relationship between the duplication of genes encoding regulators and genes encoding their targets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5683-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6480617
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-64806172019-05-01 Gene duplications in the E. coli genome: common themes among pathotypes Bernabeu, Manuel Sánchez-Herrero, José Francisco Huedo, Pol Prieto, Alejandro Hüttener, Mário Rozas, Julio Juárez, Antonio BMC Genomics Research Article BACKGROUND: Gene duplication underlies a significant proportion of gene functional diversity and genome complexity in both eukaryotes and prokaryotes. Although several reports in the literature described the duplication of specific genes in E. coli, a detailed analysis of the extent of gene duplications in this microorganism is needed. RESULTS: The genomes of the E. coli enteroaggregative strain 042 and other pathogenic strains contain duplications of the gene that codes for the global regulator Hha. To determine whether the presence of additional copies of the hha gene correlates with the presence of other genes, we performed a comparative genomic analysis between E. coli strains with and without hha duplications. The results showed that strains harboring additional copies of the hha gene also encode the yeeR irmA (aec69) gene cluster, which, in turn, is also duplicated in strain 042 and several other strains. The identification of these duplications prompted us to obtain a global map of gene duplications, first in strain 042 and later in other E. coli genomes. Duplications in the genomes of the enteroaggregative strain 042, the uropathogenic strain CFT073 and the enterohemorrhagic strain O145:H28 have been identified by a BLASTp protein similarity search. This algorithm was also used to evaluate the distribution of the identified duplicates among the genomes of a set of 28 representative E. coli strains. Despite the high genomic diversity of E. coli strains, we identified several duplicates in the genomes of almost all studied pathogenic strains. Most duplicated genes have no known function. Transcriptomic analysis also showed that most of these duplications are regulated by the H-NS/Hha proteins. CONCLUSIONS: Several duplicated genes are widely distributed among pathogenic E. coli strains. In addition, some duplicated genes are present only in specific pathotypes, and others are strain specific. This gene duplication analysis shows novel relationships between E. coli pathotypes and suggests that newly identified genes that are duplicated in a high percentage of pathogenic E. coli isolates may play a role in virulence. Our study also shows a relationship between the duplication of genes encoding regulators and genes encoding their targets. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-019-5683-4) contains supplementary material, which is available to authorized users. BioMed Central 2019-04-24 /pmc/articles/PMC6480617/ /pubmed/31014240 http://dx.doi.org/10.1186/s12864-019-5683-4 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Bernabeu, Manuel
Sánchez-Herrero, José Francisco
Huedo, Pol
Prieto, Alejandro
Hüttener, Mário
Rozas, Julio
Juárez, Antonio
Gene duplications in the E. coli genome: common themes among pathotypes
title Gene duplications in the E. coli genome: common themes among pathotypes
title_full Gene duplications in the E. coli genome: common themes among pathotypes
title_fullStr Gene duplications in the E. coli genome: common themes among pathotypes
title_full_unstemmed Gene duplications in the E. coli genome: common themes among pathotypes
title_short Gene duplications in the E. coli genome: common themes among pathotypes
title_sort gene duplications in the e. coli genome: common themes among pathotypes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6480617/
https://www.ncbi.nlm.nih.gov/pubmed/31014240
http://dx.doi.org/10.1186/s12864-019-5683-4
work_keys_str_mv AT bernabeumanuel geneduplicationsintheecoligenomecommonthemesamongpathotypes
AT sanchezherrerojosefrancisco geneduplicationsintheecoligenomecommonthemesamongpathotypes
AT huedopol geneduplicationsintheecoligenomecommonthemesamongpathotypes
AT prietoalejandro geneduplicationsintheecoligenomecommonthemesamongpathotypes
AT huttenermario geneduplicationsintheecoligenomecommonthemesamongpathotypes
AT rozasjulio geneduplicationsintheecoligenomecommonthemesamongpathotypes
AT juarezantonio geneduplicationsintheecoligenomecommonthemesamongpathotypes