Cargando…
FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution
Secondary metabolites (SMs) are a vast group of compounds with different structures and properties that have been utilized as drugs, food additives, dyes, and as monomers for novel plastics. In many cases, the biosynthesis of SMs is catalysed by enzymes whose corresponding genes are co-localized in...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8476034/ https://www.ncbi.nlm.nih.gov/pubmed/34570757 http://dx.doi.org/10.1371/journal.pcbi.1009372 |
_version_ | 1784575519395676160 |
---|---|
author | Vignolle, Gabriel A. Schaffer, Denise Zehetner, Leopold Mach, Robert L. Mach-Aigner, Astrid R. Derntl, Christian |
author_facet | Vignolle, Gabriel A. Schaffer, Denise Zehetner, Leopold Mach, Robert L. Mach-Aigner, Astrid R. Derntl, Christian |
author_sort | Vignolle, Gabriel A. |
collection | PubMed |
description | Secondary metabolites (SMs) are a vast group of compounds with different structures and properties that have been utilized as drugs, food additives, dyes, and as monomers for novel plastics. In many cases, the biosynthesis of SMs is catalysed by enzymes whose corresponding genes are co-localized in the genome in biosynthetic gene clusters (BGCs). Notably, BGCs may contain so-called gap genes, that are not involved in the biosynthesis of the SM. Current genome mining tools can identify BGCs, but they have problems with distinguishing essential genes from gap genes. This can and must be done by expensive, laborious, and time-consuming comparative genomic approaches or transcriptome analyses. In this study, we developed a method that allows semi-automated identification of essential genes in a BGC based on co-evolution analysis. To this end, the protein sequences of a BGC are blasted against a suitable proteome database. For each protein, a phylogenetic tree is created. The trees are compared by treeKO to detect co-evolution. The results of this comparison are visualized in different output formats, which are compared visually. Our results suggest that co-evolution is commonly occurring within BGCs, albeit not all, and that especially those genes that encode for enzymes of the biosynthetic pathway are co-evolutionary linked and can be identified with FunOrder. In light of the growing number of genomic data available, this will contribute to the studies of BGCs in native hosts and facilitate heterologous expression in other organisms with the aim of the discovery of novel SMs. |
format | Online Article Text |
id | pubmed-8476034 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-84760342021-09-28 FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution Vignolle, Gabriel A. Schaffer, Denise Zehetner, Leopold Mach, Robert L. Mach-Aigner, Astrid R. Derntl, Christian PLoS Comput Biol Research Article Secondary metabolites (SMs) are a vast group of compounds with different structures and properties that have been utilized as drugs, food additives, dyes, and as monomers for novel plastics. In many cases, the biosynthesis of SMs is catalysed by enzymes whose corresponding genes are co-localized in the genome in biosynthetic gene clusters (BGCs). Notably, BGCs may contain so-called gap genes, that are not involved in the biosynthesis of the SM. Current genome mining tools can identify BGCs, but they have problems with distinguishing essential genes from gap genes. This can and must be done by expensive, laborious, and time-consuming comparative genomic approaches or transcriptome analyses. In this study, we developed a method that allows semi-automated identification of essential genes in a BGC based on co-evolution analysis. To this end, the protein sequences of a BGC are blasted against a suitable proteome database. For each protein, a phylogenetic tree is created. The trees are compared by treeKO to detect co-evolution. The results of this comparison are visualized in different output formats, which are compared visually. Our results suggest that co-evolution is commonly occurring within BGCs, albeit not all, and that especially those genes that encode for enzymes of the biosynthetic pathway are co-evolutionary linked and can be identified with FunOrder. In light of the growing number of genomic data available, this will contribute to the studies of BGCs in native hosts and facilitate heterologous expression in other organisms with the aim of the discovery of novel SMs. Public Library of Science 2021-09-27 /pmc/articles/PMC8476034/ /pubmed/34570757 http://dx.doi.org/10.1371/journal.pcbi.1009372 Text en © 2021 Vignolle et al https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Vignolle, Gabriel A. Schaffer, Denise Zehetner, Leopold Mach, Robert L. Mach-Aigner, Astrid R. Derntl, Christian FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution |
title | FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution |
title_full | FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution |
title_fullStr | FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution |
title_full_unstemmed | FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution |
title_short | FunOrder: A robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution |
title_sort | funorder: a robust and semi-automated method for the identification of essential biosynthetic genes through computational molecular co-evolution |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8476034/ https://www.ncbi.nlm.nih.gov/pubmed/34570757 http://dx.doi.org/10.1371/journal.pcbi.1009372 |
work_keys_str_mv | AT vignollegabriela funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution AT schafferdenise funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution AT zehetnerleopold funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution AT machrobertl funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution AT machaignerastridr funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution AT derntlchristian funorderarobustandsemiautomatedmethodfortheidentificationofessentialbiosyntheticgenesthroughcomputationalmolecularcoevolution |