Cargando…

Navigating and expanding the roadmap of natural product genome mining tools

Natural products are structurally highly diverse and exhibit a wide array of biological activities. As a result, they serve as an important source of new drug leads. Traditionally, natural products have been discovered by bioactivity-guided fractionation. The advent of genome sequencing technology h...

Descripción completa

Detalles Bibliográficos
Autores principales: Biermann, Friederike, Wenski, Sebastian L, Helfrich, Eric J N
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Beilstein-Institut 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9749553/
https://www.ncbi.nlm.nih.gov/pubmed/36570563
http://dx.doi.org/10.3762/bjoc.18.178
_version_ 1784850058373496832
author Biermann, Friederike
Wenski, Sebastian L
Helfrich, Eric J N
author_facet Biermann, Friederike
Wenski, Sebastian L
Helfrich, Eric J N
author_sort Biermann, Friederike
collection PubMed
description Natural products are structurally highly diverse and exhibit a wide array of biological activities. As a result, they serve as an important source of new drug leads. Traditionally, natural products have been discovered by bioactivity-guided fractionation. The advent of genome sequencing technology has resulted in the introduction of an alternative approach towards novel natural product scaffolds: Genome mining. Genome mining is an in-silico natural product discovery strategy in which sequenced genomes are analyzed for the potential of the associated organism to produce natural products. Seemingly universal biosynthetic principles have been deciphered for most natural product classes that are used to detect natural product biosynthetic gene clusters using pathway-encoded conserved key enzymes, domains, or motifs as bait. Several generations of highly sophisticated tools have been developed for the biosynthetic rule-based identification of natural product gene clusters. Apart from these hard-coded algorithms, multiple tools that use machine learning-based approaches have been designed to complement the existing genome mining tool set and focus on natural product gene clusters that lack genes with conserved signature sequences. In this perspective, we take a closer look at state-of-the-art genome mining tools that are based on either hard-coded rules or machine learning algorithms, with an emphasis on the confidence of their predictions and potential to identify non-canonical natural product biosynthetic gene clusters. We highlight the genome mining pipelines' current strengths and limitations by contrasting their advantages and disadvantages. Moreover, we introduce two indirect biosynthetic gene cluster identification strategies that complement current workflows. The combination of all genome mining approaches will pave the way towards a more comprehensive understanding of the full biosynthetic repertoire encoded in microbial genome sequences.
format Online
Article
Text
id pubmed-9749553
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Beilstein-Institut
record_format MEDLINE/PubMed
spelling pubmed-97495532022-12-23 Navigating and expanding the roadmap of natural product genome mining tools Biermann, Friederike Wenski, Sebastian L Helfrich, Eric J N Beilstein J Org Chem Commentary Natural products are structurally highly diverse and exhibit a wide array of biological activities. As a result, they serve as an important source of new drug leads. Traditionally, natural products have been discovered by bioactivity-guided fractionation. The advent of genome sequencing technology has resulted in the introduction of an alternative approach towards novel natural product scaffolds: Genome mining. Genome mining is an in-silico natural product discovery strategy in which sequenced genomes are analyzed for the potential of the associated organism to produce natural products. Seemingly universal biosynthetic principles have been deciphered for most natural product classes that are used to detect natural product biosynthetic gene clusters using pathway-encoded conserved key enzymes, domains, or motifs as bait. Several generations of highly sophisticated tools have been developed for the biosynthetic rule-based identification of natural product gene clusters. Apart from these hard-coded algorithms, multiple tools that use machine learning-based approaches have been designed to complement the existing genome mining tool set and focus on natural product gene clusters that lack genes with conserved signature sequences. In this perspective, we take a closer look at state-of-the-art genome mining tools that are based on either hard-coded rules or machine learning algorithms, with an emphasis on the confidence of their predictions and potential to identify non-canonical natural product biosynthetic gene clusters. We highlight the genome mining pipelines' current strengths and limitations by contrasting their advantages and disadvantages. Moreover, we introduce two indirect biosynthetic gene cluster identification strategies that complement current workflows. The combination of all genome mining approaches will pave the way towards a more comprehensive understanding of the full biosynthetic repertoire encoded in microbial genome sequences. Beilstein-Institut 2022-12-06 /pmc/articles/PMC9749553/ /pubmed/36570563 http://dx.doi.org/10.3762/bjoc.18.178 Text en Copyright © 2022, Biermann et al. https://creativecommons.org/licenses/by/4.0/This is an open access article licensed under the terms of the Beilstein-Institut Open Access License Agreement (https://www.beilstein-journals.org/bjoc/terms/terms), which is identical to the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0 (https://creativecommons.org/licenses/by/4.0/) ). The reuse of material under this license requires that the author(s), source and license are credited. Third-party material in this article could be subject to other licenses (typically indicated in the credit line), and in this case, users are required to obtain permission from the license holder to reuse the material.
spellingShingle Commentary
Biermann, Friederike
Wenski, Sebastian L
Helfrich, Eric J N
Navigating and expanding the roadmap of natural product genome mining tools
title Navigating and expanding the roadmap of natural product genome mining tools
title_full Navigating and expanding the roadmap of natural product genome mining tools
title_fullStr Navigating and expanding the roadmap of natural product genome mining tools
title_full_unstemmed Navigating and expanding the roadmap of natural product genome mining tools
title_short Navigating and expanding the roadmap of natural product genome mining tools
title_sort navigating and expanding the roadmap of natural product genome mining tools
topic Commentary
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9749553/
https://www.ncbi.nlm.nih.gov/pubmed/36570563
http://dx.doi.org/10.3762/bjoc.18.178
work_keys_str_mv AT biermannfriederike navigatingandexpandingtheroadmapofnaturalproductgenomeminingtools
AT wenskisebastianl navigatingandexpandingtheroadmapofnaturalproductgenomeminingtools
AT helfrichericjn navigatingandexpandingtheroadmapofnaturalproductgenomeminingtools