Cargando…

FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution

Secondary metabolites (SMs) are a vast group of compounds with different structures and properties that have been utilized as drugs, food additives, dyes, and as monomers for novel plastics. In many cases, the biosynthesis of SMs is catalysed by enzymes whose corresponding genes are co-localized in...

Descripción completa

Detalles Bibliográficos
Autores principales: Vignolle, Gabriel A., Schaffer, Denise, Zehetner, Leopold, Mach, Robert L., Mach-Aigner, Astrid R., Derntl, Christian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8476034/
https://www.ncbi.nlm.nih.gov/pubmed/34570757
http://dx.doi.org/10.1371/journal.pcbi.1009372
_version_ 1784575519395676160
author Vignolle, Gabriel A.
Schaffer, Denise
Zehetner, Leopold
Mach, Robert L.
Mach-Aigner, Astrid R.
Derntl, Christian
author_facet Vignolle, Gabriel A.
Schaffer, Denise
Zehetner, Leopold
Mach, Robert L.
Mach-Aigner, Astrid R.
Derntl, Christian
author_sort Vignolle, Gabriel A.
collection PubMed
description Secondary metabolites (SMs) are a vast group of compounds with different structures and properties that have been utilized as drugs, food additives, dyes, and as monomers for novel plastics. In many cases, the biosynthesis of SMs is catalysed by enzymes whose corresponding genes are co-localized in the genome in biosynthetic gene clusters (BGCs). Notably, BGCs may contain so-called gap genes, that are not involved in the biosynthesis of the SM. Current genome mining tools can identify BGCs, but they have problems with distinguishing essential genes from gap genes. This can and must be done by expensive, laborious, and time-consuming comparative genomic approaches or transcriptome analyses. In this study, we developed a method that allows semi-automated identification of essential genes in a BGC based on co-evolution analysis. To this end, the protein sequences of a BGC are blasted against a suitable proteome database. For each protein, a phylogenetic tree is created. The trees are compared by treeKO to detect co-evolution. The results of this comparison are visualized in different output formats, which are compared visually. Our results suggest that co-evolution is commonly occurring within BGCs, albeit not all, and that especially those genes that encode for enzymes of the biosynthetic pathway are co-evolutionary linked and can be identified with FunOrder. In light of the growing number of genomic data available, this will contribute to the studies of BGCs in native hosts and facilitate heterologous expression in other organisms with the aim of the discovery of novel SMs.
format Online
Article
Text
id pubmed-8476034
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-84760342021-09-28 FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution Vignolle, Gabriel A. Schaffer, Denise Zehetner, Leopold Mach, Robert L. Mach-Aigner, Astrid R. Derntl, Christian PLoS Comput Biol Research Article Secondary metabolites (SMs) are a vast group of compounds with different structures and properties that have been utilized as drugs, food additives, dyes, and as monomers for novel plastics. In many cases, the biosynthesis of SMs is catalysed by enzymes whose corresponding genes are co-localized in the genome in biosynthetic gene clusters (BGCs). Notably, BGCs may contain so-called gap genes, that are not involved in the biosynthesis of the SM. Current genome mining tools can identify BGCs, but they have problems with distinguishing essential genes from gap genes. This can and must be done by expensive, laborious, and time-consuming comparative genomic approaches or transcriptome analyses. In this study, we developed a method that allows semi-automated identification of essential genes in a BGC based on co-evolution analysis. To this end, the protein sequences of a BGC are blasted against a suitable proteome database. For each protein, a phylogenetic tree is created. The trees are compared by treeKO to detect co-evolution. The results of this comparison are visualized in different output formats, which are compared visually. Our results suggest that co-evolution is commonly occurring within BGCs, albeit not all, and that especially those genes that encode for enzymes of the biosynthetic pathway are co-evolutionary linked and can be identified with FunOrder. In light of the growing number of genomic data available, this will contribute to the studies of BGCs in native hosts and facilitate heterologous expression in other organisms with the aim of the discovery of novel SMs. Public Library of Science 2021-09-27 /pmc/articles/PMC8476034/ /pubmed/34570757 http://dx.doi.org/10.1371/journal.pcbi.1009372 Text en © 2021 Vignolle et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Vignolle, Gabriel A.
Schaffer, Denise
Zehetner, Leopold
Mach, Robert L.
Mach-Aigner, Astrid R.
Derntl, Christian
FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution
title FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution
title_full FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution
title_fullStr FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution
title_full_unstemmed FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution
title_short FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution
title_sort funorder: a robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8476034/
https://www.ncbi.nlm.nih.gov/pubmed/34570757
http://dx.doi.org/10.1371/journal.pcbi.1009372
work_keys_str_mv AT vignollegabriela funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution
AT schafferdenise funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution
AT zehetnerleopold funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution
AT machrobertl funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution
AT machaignerastridr funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution
AT derntlchristian funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution