Cargando…

Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides

Microbial natural products constitute a wide variety of chemical compounds, many which can have antibiotic, antiviral, or anticancer properties that make them interesting for clinical purposes. Natural product classes include polyketides (PKs), nonribosomal peptides (NRPs), and ribosomally synthesiz...

Descripción completa

Detalles Bibliográficos
Autores principales: Kloosterman, Alexander M., Cimermancic, Peter, Elsayed, Somayah S., Du, Chao, Hadjithomas, Michalis, Donia, Mohamed S., Fischbach, Michael A., van Wezel, Gilles P., Medema, Marnix H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7794033/
https://www.ncbi.nlm.nih.gov/pubmed/33351797
http://dx.doi.org/10.1371/journal.pbio.3001026
_version_ 1783634125129777152
author Kloosterman, Alexander M.
Cimermancic, Peter
Elsayed, Somayah S.
Du, Chao
Hadjithomas, Michalis
Donia, Mohamed S.
Fischbach, Michael A.
van Wezel, Gilles P.
Medema, Marnix H.
author_facet Kloosterman, Alexander M.
Cimermancic, Peter
Elsayed, Somayah S.
Du, Chao
Hadjithomas, Michalis
Donia, Mohamed S.
Fischbach, Michael A.
van Wezel, Gilles P.
Medema, Marnix H.
author_sort Kloosterman, Alexander M.
collection PubMed
description Microbial natural products constitute a wide variety of chemical compounds, many which can have antibiotic, antiviral, or anticancer properties that make them interesting for clinical purposes. Natural product classes include polyketides (PKs), nonribosomal peptides (NRPs), and ribosomally synthesized and post-translationally modified peptides (RiPPs). While variants of biosynthetic gene clusters (BGCs) for known classes of natural products are easy to identify in genome sequences, BGCs for new compound classes escape attention. In particular, evidence is accumulating that for RiPPs, subclasses known thus far may only represent the tip of an iceberg. Here, we present decRiPPter (Data-driven Exploratory Class-independent RiPP TrackER), a RiPP genome mining algorithm aimed at the discovery of novel RiPP classes. DecRiPPter combines a Support Vector Machine (SVM) that identifies candidate RiPP precursors with pan-genomic analyses to identify which of these are encoded within operon-like structures that are part of the accessory genome of a genus. Subsequently, it prioritizes such regions based on the presence of new enzymology and based on patterns of gene cluster and precursor peptide conservation across species. We then applied decRiPPter to mine 1,295 Streptomyces genomes, which led to the identification of 42 new candidate RiPP families that could not be found by existing programs. One of these was studied further and elucidated as a representative of a novel subfamily of lanthipeptides, which we designate class V. The 2D structure of the new RiPP, which we name pristinin A3 (1), was solved using nuclear magnetic resonance (NMR), tandem mass spectrometry (MS/MS) data, and chemical labeling. Two previously unidentified modifying enzymes are proposed to create the hallmark lanthionine bridges. Taken together, our work highlights how novel natural product families can be discovered by methods going beyond sequence similarity searches to integrate multiple pathway discovery criteria.
format Online
Article
Text
id pubmed-7794033
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-77940332021-01-21 Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides Kloosterman, Alexander M. Cimermancic, Peter Elsayed, Somayah S. Du, Chao Hadjithomas, Michalis Donia, Mohamed S. Fischbach, Michael A. van Wezel, Gilles P. Medema, Marnix H. PLoS Biol Methods and Resources Microbial natural products constitute a wide variety of chemical compounds, many which can have antibiotic, antiviral, or anticancer properties that make them interesting for clinical purposes. Natural product classes include polyketides (PKs), nonribosomal peptides (NRPs), and ribosomally synthesized and post-translationally modified peptides (RiPPs). While variants of biosynthetic gene clusters (BGCs) for known classes of natural products are easy to identify in genome sequences, BGCs for new compound classes escape attention. In particular, evidence is accumulating that for RiPPs, subclasses known thus far may only represent the tip of an iceberg. Here, we present decRiPPter (Data-driven Exploratory Class-independent RiPP TrackER), a RiPP genome mining algorithm aimed at the discovery of novel RiPP classes. DecRiPPter combines a Support Vector Machine (SVM) that identifies candidate RiPP precursors with pan-genomic analyses to identify which of these are encoded within operon-like structures that are part of the accessory genome of a genus. Subsequently, it prioritizes such regions based on the presence of new enzymology and based on patterns of gene cluster and precursor peptide conservation across species. We then applied decRiPPter to mine 1,295 Streptomyces genomes, which led to the identification of 42 new candidate RiPP families that could not be found by existing programs. One of these was studied further and elucidated as a representative of a novel subfamily of lanthipeptides, which we designate class V. The 2D structure of the new RiPP, which we name pristinin A3 (1), was solved using nuclear magnetic resonance (NMR), tandem mass spectrometry (MS/MS) data, and chemical labeling. Two previously unidentified modifying enzymes are proposed to create the hallmark lanthionine bridges. Taken together, our work highlights how novel natural product families can be discovered by methods going beyond sequence similarity searches to integrate multiple pathway discovery criteria. Public Library of Science 2020-12-22 /pmc/articles/PMC7794033/ /pubmed/33351797 http://dx.doi.org/10.1371/journal.pbio.3001026 Text en © 2020 Kloosterman et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Methods and Resources
Kloosterman, Alexander M.
Cimermancic, Peter
Elsayed, Somayah S.
Du, Chao
Hadjithomas, Michalis
Donia, Mohamed S.
Fischbach, Michael A.
van Wezel, Gilles P.
Medema, Marnix H.
Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides
title Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides
title_full Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides
title_fullStr Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides
title_full_unstemmed Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides
title_short Expansion of RiPP biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides
title_sort expansion of ripp biosynthetic space through integration of pan-genomics and machine learning uncovers a novel class of lanthipeptides
topic Methods and Resources
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7794033/
https://www.ncbi.nlm.nih.gov/pubmed/33351797
http://dx.doi.org/10.1371/journal.pbio.3001026
work_keys_str_mv AT kloostermanalexanderm expansionofrippbiosyntheticspacethroughintegrationofpangenomicsandmachinelearninguncoversanovelclassoflanthipeptides
AT cimermancicpeter expansionofrippbiosyntheticspacethroughintegrationofpangenomicsandmachinelearninguncoversanovelclassoflanthipeptides
AT elsayedsomayahs expansionofrippbiosyntheticspacethroughintegrationofpangenomicsandmachinelearninguncoversanovelclassoflanthipeptides
AT duchao expansionofrippbiosyntheticspacethroughintegrationofpangenomicsandmachinelearninguncoversanovelclassoflanthipeptides
AT hadjithomasmichalis expansionofrippbiosyntheticspacethroughintegrationofpangenomicsandmachinelearninguncoversanovelclassoflanthipeptides
AT doniamohameds expansionofrippbiosyntheticspacethroughintegrationofpangenomicsandmachinelearninguncoversanovelclassoflanthipeptides
AT fischbachmichaela expansionofrippbiosyntheticspacethroughintegrationofpangenomicsandmachinelearninguncoversanovelclassoflanthipeptides
AT vanwezelgillesp expansionofrippbiosyntheticspacethroughintegrationofpangenomicsandmachinelearninguncoversanovelclassoflanthipeptides
AT medemamarnixh expansionofrippbiosyntheticspacethroughintegrationofpangenomicsandmachinelearninguncoversanovelclassoflanthipeptides