Cargando…

Automatic reconstruction of metabolic pathways from identified biosynthetic gene clusters

BACKGROUND: A wide range of bioactive compounds is produced by enzymes and enzymatic complexes encoded in biosynthetic gene clusters (BGCs). These BGCs can be identified and functionally annotated based on their DNA sequence. Candidates for further research and development may be prioritized based o...

Descripción completa

Detalles Bibliográficos
Autores principales: Sulheim, Snorre, Fossheim, Fredrik A., Wentzel, Alexander, Almaas, Eivind
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7901079/
https://www.ncbi.nlm.nih.gov/pubmed/33622234
http://dx.doi.org/10.1186/s12859-021-03985-0
_version_ 1783654330851655680
author Sulheim, Snorre
Fossheim, Fredrik A.
Wentzel, Alexander
Almaas, Eivind
author_facet Sulheim, Snorre
Fossheim, Fredrik A.
Wentzel, Alexander
Almaas, Eivind
author_sort Sulheim, Snorre
collection PubMed
description BACKGROUND: A wide range of bioactive compounds is produced by enzymes and enzymatic complexes encoded in biosynthetic gene clusters (BGCs). These BGCs can be identified and functionally annotated based on their DNA sequence. Candidates for further research and development may be prioritized based on properties such as their functional annotation, (dis)similarity to known BGCs, and bioactivity assays. Production of the target compound in the native strain is often not achievable, rendering heterologous expression in an optimized host strain as a promising alternative. Genome-scale metabolic models are frequently used to guide strain development, but large-scale incorporation and testing of heterologous production of complex natural products in this framework is hampered by the amount of manual work required to translate annotated BGCs to metabolic pathways. To this end, we have developed a pipeline for an automated reconstruction of BGC associated metabolic pathways responsible for the synthesis of non-ribosomal peptides and polyketides, two of the dominant classes of bioactive compounds. RESULTS: The developed pipeline correctly predicts 72.8% of the metabolic reactions in a detailed evaluation of 8 different BGCs comprising 228 functional domains. By introducing the reconstructed pathways into a genome-scale metabolic model we demonstrate that this level of accuracy is sufficient to make reliable in silico predictions with respect to production rate and gene knockout targets. Furthermore, we apply the pipeline to a large BGC database and reconstruct 943 metabolic pathways. We identify 17 enzymatic reactions using high-throughput assessment of potential knockout targets for increasing the production of any of the associated compounds. However, the targets only provide a relative increase of up to 6% compared to wild-type production rates. CONCLUSION: With this pipeline we pave the way for an extended use of genome-scale metabolic models in strain design of heterologous expression hosts. In this context, we identified generic knockout targets for the increased production of heterologous compounds. However, as the predicted increase is minor for any of the single-reaction knockout targets, these results indicate that more sophisticated strain-engineering strategies are necessary for the development of efficient BGC expression hosts.
format Online
Article
Text
id pubmed-7901079
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-79010792021-02-23 Automatic reconstruction of metabolic pathways from identified biosynthetic gene clusters Sulheim, Snorre Fossheim, Fredrik A. Wentzel, Alexander Almaas, Eivind BMC Bioinformatics Research Article BACKGROUND: A wide range of bioactive compounds is produced by enzymes and enzymatic complexes encoded in biosynthetic gene clusters (BGCs). These BGCs can be identified and functionally annotated based on their DNA sequence. Candidates for further research and development may be prioritized based on properties such as their functional annotation, (dis)similarity to known BGCs, and bioactivity assays. Production of the target compound in the native strain is often not achievable, rendering heterologous expression in an optimized host strain as a promising alternative. Genome-scale metabolic models are frequently used to guide strain development, but large-scale incorporation and testing of heterologous production of complex natural products in this framework is hampered by the amount of manual work required to translate annotated BGCs to metabolic pathways. To this end, we have developed a pipeline for an automated reconstruction of BGC associated metabolic pathways responsible for the synthesis of non-ribosomal peptides and polyketides, two of the dominant classes of bioactive compounds. RESULTS: The developed pipeline correctly predicts 72.8% of the metabolic reactions in a detailed evaluation of 8 different BGCs comprising 228 functional domains. By introducing the reconstructed pathways into a genome-scale metabolic model we demonstrate that this level of accuracy is sufficient to make reliable in silico predictions with respect to production rate and gene knockout targets. Furthermore, we apply the pipeline to a large BGC database and reconstruct 943 metabolic pathways. We identify 17 enzymatic reactions using high-throughput assessment of potential knockout targets for increasing the production of any of the associated compounds. However, the targets only provide a relative increase of up to 6% compared to wild-type production rates. CONCLUSION: With this pipeline we pave the way for an extended use of genome-scale metabolic models in strain design of heterologous expression hosts. In this context, we identified generic knockout targets for the increased production of heterologous compounds. However, as the predicted increase is minor for any of the single-reaction knockout targets, these results indicate that more sophisticated strain-engineering strategies are necessary for the development of efficient BGC expression hosts. BioMed Central 2021-02-23 /pmc/articles/PMC7901079/ /pubmed/33622234 http://dx.doi.org/10.1186/s12859-021-03985-0 Text en © The Author(s) 2021, corrected publication 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Sulheim, Snorre
Fossheim, Fredrik A.
Wentzel, Alexander
Almaas, Eivind
Automatic reconstruction of metabolic pathways from identified biosynthetic gene clusters
title Automatic reconstruction of metabolic pathways from identified biosynthetic gene clusters
title_full Automatic reconstruction of metabolic pathways from identified biosynthetic gene clusters
title_fullStr Automatic reconstruction of metabolic pathways from identified biosynthetic gene clusters
title_full_unstemmed Automatic reconstruction of metabolic pathways from identified biosynthetic gene clusters
title_short Automatic reconstruction of metabolic pathways from identified biosynthetic gene clusters
title_sort automatic reconstruction of metabolic pathways from identified biosynthetic gene clusters
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7901079/
https://www.ncbi.nlm.nih.gov/pubmed/33622234
http://dx.doi.org/10.1186/s12859-021-03985-0
work_keys_str_mv AT sulheimsnorre automaticreconstructionofmetabolicpathwaysfromidentifiedbiosyntheticgeneclusters
AT fossheimfredrika automaticreconstructionofmetabolicpathwaysfromidentifiedbiosyntheticgeneclusters
AT wentzelalexander automaticreconstructionofmetabolicpathwaysfromidentifiedbiosyntheticgeneclusters
AT almaaseivind automaticreconstructionofmetabolicpathwaysfromidentifiedbiosyntheticgeneclusters