Cargando…

The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles

BACKGROUND: Ribosomally synthesized and post-translationally modified peptides (RiPPs) are a burgeoning class of natural products with diverse activity that share a similar origin and common features in their biosynthetic pathways. The precursor peptides of these natural products are ribosomally pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Cox, Courtney L., Doroghazi, James R., Mitchell, Douglas A.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4603692/
https://www.ncbi.nlm.nih.gov/pubmed/26462797
http://dx.doi.org/10.1186/s12864-015-2008-0
_version_ 1782394937655951360
author Cox, Courtney L.
Doroghazi, James R.
Mitchell, Douglas A.
author_facet Cox, Courtney L.
Doroghazi, James R.
Mitchell, Douglas A.
author_sort Cox, Courtney L.
collection PubMed
description BACKGROUND: Ribosomally synthesized and post-translationally modified peptides (RiPPs) are a burgeoning class of natural products with diverse activity that share a similar origin and common features in their biosynthetic pathways. The precursor peptides of these natural products are ribosomally produced, upon which a combination of modification enzymes installs diverse functional groups. This genetically encoded peptide-based strategy allows for rapid diversification of these natural products by mutation in the precursor genes merged with unique combinations of modification enzymes. Thiazole/oxazole-modified microcins (TOMMs) are a class of RiPPs defined by the presence of heterocycles derived from cysteine, serine, and threonine residues in the precursor peptide. TOMMs encompass a number of different families, including but not limited to the linear azol(in)e-containing peptides (streptolysin S, microcin B17, and plantazolicin), cyanobactins, thiopeptides, and bottromycins. Although many TOMMs have been explored, the increased availability of genome sequences has illuminated several unexplored TOMM producers. METHODS: All YcaO domain-containing proteins (D protein) and the surrounding genomic regions were were obtained from the European Molecular Biology Laboratory (EMBL) and the European Bioinformatics Institute (EBI). MultiGeneBlast was used to group gene clusters contain a D protein. A number of techniques were used to identify TOMM biosynthetic gene clusters from the D protein containing gene clusters. Precursor peptides from these gene clusters were also identified. Both sequence similarity and phylogenetic analysis were used to classify the 20 diverse TOMM clusters identified. RESULTS: Given the remarkable structural and functional diversity displayed by known TOMMs, a comprehensive bioinformatic study to catalog and classify the entire RiPP class was undertaken. Here we report the bioinformatic characterization of nearly 1,500 TOMM gene clusters from genomes in the European Molecular Biology Laboratory (EMBL) and the European Bioinformatics Institute (EBI) sequence repository. Genome mining suggests a complex diversification of modification enzymes and precursor peptides to create more than 20 distinct families of TOMMs, nine of which have not heretofore been described. Many of the identified TOMM families have an abundance of diverse precursor peptide sequences as well as unfamiliar combinations of modification enzymes, signifying a potential wealth of novel natural products on known and unknown biosynthetic scaffolds. Phylogenetic analysis suggests a widespread distribution of TOMMs across multiple phyla; however, producers of similar TOMMs are generally found in the same phylum with few exceptions. CONCLUSIONS: The comprehensive genome mining study described herein has uncovered a myriad of unique TOMM biosynthetic clusters and provides an atlas to guide future discovery efforts. These biosynthetic gene clusters are predicted to produce diverse final products, and the identification of additional combinations of modification enzymes could expand the potential of combinatorial natural product biosynthesis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2008-0) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4603692
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-46036922015-10-14 The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles Cox, Courtney L. Doroghazi, James R. Mitchell, Douglas A. BMC Genomics Research Article BACKGROUND: Ribosomally synthesized and post-translationally modified peptides (RiPPs) are a burgeoning class of natural products with diverse activity that share a similar origin and common features in their biosynthetic pathways. The precursor peptides of these natural products are ribosomally produced, upon which a combination of modification enzymes installs diverse functional groups. This genetically encoded peptide-based strategy allows for rapid diversification of these natural products by mutation in the precursor genes merged with unique combinations of modification enzymes. Thiazole/oxazole-modified microcins (TOMMs) are a class of RiPPs defined by the presence of heterocycles derived from cysteine, serine, and threonine residues in the precursor peptide. TOMMs encompass a number of different families, including but not limited to the linear azol(in)e-containing peptides (streptolysin S, microcin B17, and plantazolicin), cyanobactins, thiopeptides, and bottromycins. Although many TOMMs have been explored, the increased availability of genome sequences has illuminated several unexplored TOMM producers. METHODS: All YcaO domain-containing proteins (D protein) and the surrounding genomic regions were were obtained from the European Molecular Biology Laboratory (EMBL) and the European Bioinformatics Institute (EBI). MultiGeneBlast was used to group gene clusters contain a D protein. A number of techniques were used to identify TOMM biosynthetic gene clusters from the D protein containing gene clusters. Precursor peptides from these gene clusters were also identified. Both sequence similarity and phylogenetic analysis were used to classify the 20 diverse TOMM clusters identified. RESULTS: Given the remarkable structural and functional diversity displayed by known TOMMs, a comprehensive bioinformatic study to catalog and classify the entire RiPP class was undertaken. Here we report the bioinformatic characterization of nearly 1,500 TOMM gene clusters from genomes in the European Molecular Biology Laboratory (EMBL) and the European Bioinformatics Institute (EBI) sequence repository. Genome mining suggests a complex diversification of modification enzymes and precursor peptides to create more than 20 distinct families of TOMMs, nine of which have not heretofore been described. Many of the identified TOMM families have an abundance of diverse precursor peptide sequences as well as unfamiliar combinations of modification enzymes, signifying a potential wealth of novel natural products on known and unknown biosynthetic scaffolds. Phylogenetic analysis suggests a widespread distribution of TOMMs across multiple phyla; however, producers of similar TOMMs are generally found in the same phylum with few exceptions. CONCLUSIONS: The comprehensive genome mining study described herein has uncovered a myriad of unique TOMM biosynthetic clusters and provides an atlas to guide future discovery efforts. These biosynthetic gene clusters are predicted to produce diverse final products, and the identification of additional combinations of modification enzymes could expand the potential of combinatorial natural product biosynthesis. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2008-0) contains supplementary material, which is available to authorized users. BioMed Central 2015-10-13 /pmc/articles/PMC4603692/ /pubmed/26462797 http://dx.doi.org/10.1186/s12864-015-2008-0 Text en © Cox et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Cox, Courtney L.
Doroghazi, James R.
Mitchell, Douglas A.
The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles
title The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles
title_full The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles
title_fullStr The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles
title_full_unstemmed The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles
title_short The genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles
title_sort genomic landscape of ribosomal peptides containing thiazole and oxazole heterocycles
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4603692/
https://www.ncbi.nlm.nih.gov/pubmed/26462797
http://dx.doi.org/10.1186/s12864-015-2008-0
work_keys_str_mv AT coxcourtneyl thegenomiclandscapeofribosomalpeptidescontainingthiazoleandoxazoleheterocycles
AT doroghazijamesr thegenomiclandscapeofribosomalpeptidescontainingthiazoleandoxazoleheterocycles
AT mitchelldouglasa thegenomiclandscapeofribosomalpeptidescontainingthiazoleandoxazoleheterocycles
AT coxcourtneyl genomiclandscapeofribosomalpeptidescontainingthiazoleandoxazoleheterocycles
AT doroghazijamesr genomiclandscapeofribosomalpeptidescontainingthiazoleandoxazoleheterocycles
AT mitchelldouglasa genomiclandscapeofribosomalpeptidescontainingthiazoleandoxazoleheterocycles