Cargando…

Deep evolutionary analysis reveals the design principles of fold A glycosyltransferases

Glycosyltransferases (GTs) are prevalent across the tree of life and regulate nearly all aspects of cellular functions. The evolutionary basis for their complex and diverse modes of catalytic functions remain enigmatic. Here, based on deep mining of over half million GT-A fold sequences, we define a...

Descripción completa

Detalles Bibliográficos
Autores principales: Taujale, Rahil, Venkat, Aarya, Huang, Liang-Chin, Zhou, Zhongliang, Yeung, Wayland, Rasheed, Khaled M, Li, Sheng, Edison, Arthur S, Moremen, Kelley W, Kannan, Natarajan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: eLife Sciences Publications, Ltd 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7185993/
https://www.ncbi.nlm.nih.gov/pubmed/32234211
http://dx.doi.org/10.7554/eLife.54532
_version_ 1783526866914639872
author Taujale, Rahil
Venkat, Aarya
Huang, Liang-Chin
Zhou, Zhongliang
Yeung, Wayland
Rasheed, Khaled M
Li, Sheng
Edison, Arthur S
Moremen, Kelley W
Kannan, Natarajan
author_facet Taujale, Rahil
Venkat, Aarya
Huang, Liang-Chin
Zhou, Zhongliang
Yeung, Wayland
Rasheed, Khaled M
Li, Sheng
Edison, Arthur S
Moremen, Kelley W
Kannan, Natarajan
author_sort Taujale, Rahil
collection PubMed
description Glycosyltransferases (GTs) are prevalent across the tree of life and regulate nearly all aspects of cellular functions. The evolutionary basis for their complex and diverse modes of catalytic functions remain enigmatic. Here, based on deep mining of over half million GT-A fold sequences, we define a minimal core component shared among functionally diverse enzymes. We find that variations in the common core and emergence of hypervariable loops extending from the core contributed to GT-A diversity. We provide a phylogenetic framework relating diverse GT-A fold families for the first time and show that inverting and retaining mechanisms emerged multiple times independently during evolution. Using evolutionary information encoded in primary sequences, we trained a machine learning classifier to predict donor specificity with nearly 90% accuracy and deployed it for the annotation of understudied GTs. Our studies provide an evolutionary framework for investigating complex relationships connecting GT-A fold sequence, structure, function and regulation.
format Online
Article
Text
id pubmed-7185993
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher eLife Sciences Publications, Ltd
record_format MEDLINE/PubMed
spelling pubmed-71859932020-04-29 Deep evolutionary analysis reveals the design principles of fold A glycosyltransferases Taujale, Rahil Venkat, Aarya Huang, Liang-Chin Zhou, Zhongliang Yeung, Wayland Rasheed, Khaled M Li, Sheng Edison, Arthur S Moremen, Kelley W Kannan, Natarajan eLife Computational and Systems Biology Glycosyltransferases (GTs) are prevalent across the tree of life and regulate nearly all aspects of cellular functions. The evolutionary basis for their complex and diverse modes of catalytic functions remain enigmatic. Here, based on deep mining of over half million GT-A fold sequences, we define a minimal core component shared among functionally diverse enzymes. We find that variations in the common core and emergence of hypervariable loops extending from the core contributed to GT-A diversity. We provide a phylogenetic framework relating diverse GT-A fold families for the first time and show that inverting and retaining mechanisms emerged multiple times independently during evolution. Using evolutionary information encoded in primary sequences, we trained a machine learning classifier to predict donor specificity with nearly 90% accuracy and deployed it for the annotation of understudied GTs. Our studies provide an evolutionary framework for investigating complex relationships connecting GT-A fold sequence, structure, function and regulation. eLife Sciences Publications, Ltd 2020-04-01 /pmc/articles/PMC7185993/ /pubmed/32234211 http://dx.doi.org/10.7554/eLife.54532 Text en © 2020, Taujale et al http://creativecommons.org/licenses/by/4.0/ http://creativecommons.org/licenses/by/4.0/This article is distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use and redistribution provided that the original author and source are credited.
spellingShingle Computational and Systems Biology
Taujale, Rahil
Venkat, Aarya
Huang, Liang-Chin
Zhou, Zhongliang
Yeung, Wayland
Rasheed, Khaled M
Li, Sheng
Edison, Arthur S
Moremen, Kelley W
Kannan, Natarajan
Deep evolutionary analysis reveals the design principles of fold A glycosyltransferases
title Deep evolutionary analysis reveals the design principles of fold A glycosyltransferases
title_full Deep evolutionary analysis reveals the design principles of fold A glycosyltransferases
title_fullStr Deep evolutionary analysis reveals the design principles of fold A glycosyltransferases
title_full_unstemmed Deep evolutionary analysis reveals the design principles of fold A glycosyltransferases
title_short Deep evolutionary analysis reveals the design principles of fold A glycosyltransferases
title_sort deep evolutionary analysis reveals the design principles of fold a glycosyltransferases
topic Computational and Systems Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7185993/
https://www.ncbi.nlm.nih.gov/pubmed/32234211
http://dx.doi.org/10.7554/eLife.54532
work_keys_str_mv AT taujalerahil deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases
AT venkataarya deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases
AT huangliangchin deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases
AT zhouzhongliang deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases
AT yeungwayland deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases
AT rasheedkhaledm deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases
AT lisheng deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases
AT edisonarthurs deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases
AT moremenkelleyw deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases
AT kannannatarajan deepevolutionaryanalysisrevealsthedesignprinciplesoffoldaglycosyltransferases