Cargando…

Phylogeny-Aware Chemoinformatic Analysis of Chemical Diversity in Lamiaceae Enables Iridoid Pathway Assembly and Discovery of Aucubin Synthase

Countless reports describe the isolation and structural characterization of natural products, yet this information remains disconnected and underutilized. Using a cheminformatics approach, we leverage the reported observations of iridoid glucosides with the known phylogeny of a large iridoid produci...

Descripción completa

Detalles Bibliográficos
Autores principales: Rodríguez-López, Carlos E., Jiang, Yindi, Kamileen, Mohamed O., Lichman, Benjamin R., Hong, Benke, Vaillancourt, Brieanne, Buell, C. Robin, O'Connor, Sarah E.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9048965/
https://www.ncbi.nlm.nih.gov/pubmed/35298643
http://dx.doi.org/10.1093/molbev/msac057
_version_ 1784696038243696640
author Rodríguez-López, Carlos E.
Jiang, Yindi
Kamileen, Mohamed O.
Lichman, Benjamin R.
Hong, Benke
Vaillancourt, Brieanne
Buell, C. Robin
O'Connor, Sarah E.
author_facet Rodríguez-López, Carlos E.
Jiang, Yindi
Kamileen, Mohamed O.
Lichman, Benjamin R.
Hong, Benke
Vaillancourt, Brieanne
Buell, C. Robin
O'Connor, Sarah E.
author_sort Rodríguez-López, Carlos E.
collection PubMed
description Countless reports describe the isolation and structural characterization of natural products, yet this information remains disconnected and underutilized. Using a cheminformatics approach, we leverage the reported observations of iridoid glucosides with the known phylogeny of a large iridoid producing plant family (Lamiaceae) to generate a set of biosynthetic pathways that best explain the extant iridoid chemical diversity. We developed a pathway reconstruction algorithm that connects iridoid reports via reactions and prunes this solution space by considering phylogenetic relationships between genera. We formulate a model that emulates the evolution of iridoid glucosides to create a synthetic data set, used to select the parameters that would best reconstruct the pathways, and apply them to the iridoid data set to generate pathway hypotheses. These computationally generated pathways were then used as the basis by which to select and screen biosynthetic enzyme candidates. Our model was successfully applied to discover a cytochrome P450 enzyme from Callicarpa americana that catalyzes the oxidation of bartsioside to aucubin, predicted by our model despite neither molecule having been observed in the genus. We also demonstrate aucubin synthase activity in orthologues of Vitex agnus-castus, and the outgroup Paulownia tomentosa, further strengthening the hypothesis, enabled by our model, that the reaction was present in the ancestral biosynthetic pathway. This is the first systematic hypothesis on the epi-iridoid glucosides biosynthesis in 25 years and sets the stage for streamlined work on the iridoid pathway. This work highlights how curation and computational analysis of widely available structural data can facilitate hypothesis-based gene discovery.
format Online
Article
Text
id pubmed-9048965
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-90489652022-04-29 Phylogeny-Aware Chemoinformatic Analysis of Chemical Diversity in Lamiaceae Enables Iridoid Pathway Assembly and Discovery of Aucubin Synthase Rodríguez-López, Carlos E. Jiang, Yindi Kamileen, Mohamed O. Lichman, Benjamin R. Hong, Benke Vaillancourt, Brieanne Buell, C. Robin O'Connor, Sarah E. Mol Biol Evol Discoveries Countless reports describe the isolation and structural characterization of natural products, yet this information remains disconnected and underutilized. Using a cheminformatics approach, we leverage the reported observations of iridoid glucosides with the known phylogeny of a large iridoid producing plant family (Lamiaceae) to generate a set of biosynthetic pathways that best explain the extant iridoid chemical diversity. We developed a pathway reconstruction algorithm that connects iridoid reports via reactions and prunes this solution space by considering phylogenetic relationships between genera. We formulate a model that emulates the evolution of iridoid glucosides to create a synthetic data set, used to select the parameters that would best reconstruct the pathways, and apply them to the iridoid data set to generate pathway hypotheses. These computationally generated pathways were then used as the basis by which to select and screen biosynthetic enzyme candidates. Our model was successfully applied to discover a cytochrome P450 enzyme from Callicarpa americana that catalyzes the oxidation of bartsioside to aucubin, predicted by our model despite neither molecule having been observed in the genus. We also demonstrate aucubin synthase activity in orthologues of Vitex agnus-castus, and the outgroup Paulownia tomentosa, further strengthening the hypothesis, enabled by our model, that the reaction was present in the ancestral biosynthetic pathway. This is the first systematic hypothesis on the epi-iridoid glucosides biosynthesis in 25 years and sets the stage for streamlined work on the iridoid pathway. This work highlights how curation and computational analysis of widely available structural data can facilitate hypothesis-based gene discovery. Oxford University Press 2022-03-17 /pmc/articles/PMC9048965/ /pubmed/35298643 http://dx.doi.org/10.1093/molbev/msac057 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Society for Molecular Biology and Evolution. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Discoveries
Rodríguez-López, Carlos E.
Jiang, Yindi
Kamileen, Mohamed O.
Lichman, Benjamin R.
Hong, Benke
Vaillancourt, Brieanne
Buell, C. Robin
O'Connor, Sarah E.
Phylogeny-Aware Chemoinformatic Analysis of Chemical Diversity in Lamiaceae Enables Iridoid Pathway Assembly and Discovery of Aucubin Synthase
title Phylogeny-Aware Chemoinformatic Analysis of Chemical Diversity in Lamiaceae Enables Iridoid Pathway Assembly and Discovery of Aucubin Synthase
title_full Phylogeny-Aware Chemoinformatic Analysis of Chemical Diversity in Lamiaceae Enables Iridoid Pathway Assembly and Discovery of Aucubin Synthase
title_fullStr Phylogeny-Aware Chemoinformatic Analysis of Chemical Diversity in Lamiaceae Enables Iridoid Pathway Assembly and Discovery of Aucubin Synthase
title_full_unstemmed Phylogeny-Aware Chemoinformatic Analysis of Chemical Diversity in Lamiaceae Enables Iridoid Pathway Assembly and Discovery of Aucubin Synthase
title_short Phylogeny-Aware Chemoinformatic Analysis of Chemical Diversity in Lamiaceae Enables Iridoid Pathway Assembly and Discovery of Aucubin Synthase
title_sort phylogeny-aware chemoinformatic analysis of chemical diversity in lamiaceae enables iridoid pathway assembly and discovery of aucubin synthase
topic Discoveries
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9048965/
https://www.ncbi.nlm.nih.gov/pubmed/35298643
http://dx.doi.org/10.1093/molbev/msac057
work_keys_str_mv AT rodriguezlopezcarlose phylogenyawarechemoinformaticanalysisofchemicaldiversityinlamiaceaeenablesiridoidpathwayassemblyanddiscoveryofaucubinsynthase
AT jiangyindi phylogenyawarechemoinformaticanalysisofchemicaldiversityinlamiaceaeenablesiridoidpathwayassemblyanddiscoveryofaucubinsynthase
AT kamileenmohamedo phylogenyawarechemoinformaticanalysisofchemicaldiversityinlamiaceaeenablesiridoidpathwayassemblyanddiscoveryofaucubinsynthase
AT lichmanbenjaminr phylogenyawarechemoinformaticanalysisofchemicaldiversityinlamiaceaeenablesiridoidpathwayassemblyanddiscoveryofaucubinsynthase
AT hongbenke phylogenyawarechemoinformaticanalysisofchemicaldiversityinlamiaceaeenablesiridoidpathwayassemblyanddiscoveryofaucubinsynthase
AT vaillancourtbrieanne phylogenyawarechemoinformaticanalysisofchemicaldiversityinlamiaceaeenablesiridoidpathwayassemblyanddiscoveryofaucubinsynthase
AT buellcrobin phylogenyawarechemoinformaticanalysisofchemicaldiversityinlamiaceaeenablesiridoidpathwayassemblyanddiscoveryofaucubinsynthase
AT oconnorsarahe phylogenyawarechemoinformaticanalysisofchemicaldiversityinlamiaceaeenablesiridoidpathwayassemblyanddiscoveryofaucubinsynthase