Cargando…

Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes

The asparagine-X-serine/threonine (NXS/T) motif, where X is any amino acid except proline, is the consensus motif for N-linked glycosylation. Significant numbers of high-resolution crystal structures of glycosylated proteins allow us to carry out structural analysis of the N-linked glycosylation sit...

Descripción completa

Detalles Bibliográficos
Autores principales: Lam, Phuc Vinh Nguyen, Goldman, Radoslav, Karagiannis, Konstantinos, Narsule, Tejas, Simonyan, Vahan, Soika, Valerii, Mazumder, Raja
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3914773/
https://www.ncbi.nlm.nih.gov/pubmed/23459159
http://dx.doi.org/10.1016/j.gpb.2012.11.003
_version_ 1782302461130702848
author Lam, Phuc Vinh Nguyen
Goldman, Radoslav
Karagiannis, Konstantinos
Narsule, Tejas
Simonyan, Vahan
Soika, Valerii
Mazumder, Raja
author_facet Lam, Phuc Vinh Nguyen
Goldman, Radoslav
Karagiannis, Konstantinos
Narsule, Tejas
Simonyan, Vahan
Soika, Valerii
Mazumder, Raja
author_sort Lam, Phuc Vinh Nguyen
collection PubMed
description The asparagine-X-serine/threonine (NXS/T) motif, where X is any amino acid except proline, is the consensus motif for N-linked glycosylation. Significant numbers of high-resolution crystal structures of glycosylated proteins allow us to carry out structural analysis of the N-linked glycosylation sites (NGS). Our analysis shows that there is enough structural information from diverse glycoproteins to allow the development of rules which can be used to predict NGS. A Python-based tool was developed to investigate asparagines implicated in N-glycosylation in five species: Homo sapiens, Mus musculus, Drosophila melanogaster, Arabidopsis thaliana and Saccharomyces cerevisiae. Our analysis shows that 78% of all asparagines of NXS/T motif involved in N-glycosylation are localized in the loop/turn conformation in the human proteome. Similar distribution was revealed for all the other species examined. Comparative analysis of the occurrence of NXS/T motifs not known to be glycosylated and their reverse sequence (S/TXN) shows a similar distribution across the secondary structural elements, indicating that the NXS/T motif in itself is not biologically relevant. Based on our analysis, we have defined rules to determine NGS. Using machine learning methods based on these rules we can predict with 93% accuracy if a particular site will be glycosylated. If structural information is not available the tool uses structural prediction results resulting in 74% accuracy. The tool was used to identify glycosylation sites in 108 human proteins with structures and 2247 proteins without structures that have acquired NXS/T site/s due to non-synonymous variation. The tool, Structure Feature Analysis Tool (SFAT), is freely available to the public at http://hive.biochemistry.gwu.edu/tools/sfat.
format Online
Article
Text
id pubmed-3914773
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-39147732014-02-05 Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes Lam, Phuc Vinh Nguyen Goldman, Radoslav Karagiannis, Konstantinos Narsule, Tejas Simonyan, Vahan Soika, Valerii Mazumder, Raja Genomics Proteomics Bioinformatics Original Research The asparagine-X-serine/threonine (NXS/T) motif, where X is any amino acid except proline, is the consensus motif for N-linked glycosylation. Significant numbers of high-resolution crystal structures of glycosylated proteins allow us to carry out structural analysis of the N-linked glycosylation sites (NGS). Our analysis shows that there is enough structural information from diverse glycoproteins to allow the development of rules which can be used to predict NGS. A Python-based tool was developed to investigate asparagines implicated in N-glycosylation in five species: Homo sapiens, Mus musculus, Drosophila melanogaster, Arabidopsis thaliana and Saccharomyces cerevisiae. Our analysis shows that 78% of all asparagines of NXS/T motif involved in N-glycosylation are localized in the loop/turn conformation in the human proteome. Similar distribution was revealed for all the other species examined. Comparative analysis of the occurrence of NXS/T motifs not known to be glycosylated and their reverse sequence (S/TXN) shows a similar distribution across the secondary structural elements, indicating that the NXS/T motif in itself is not biologically relevant. Based on our analysis, we have defined rules to determine NGS. Using machine learning methods based on these rules we can predict with 93% accuracy if a particular site will be glycosylated. If structural information is not available the tool uses structural prediction results resulting in 74% accuracy. The tool was used to identify glycosylation sites in 108 human proteins with structures and 2247 proteins without structures that have acquired NXS/T site/s due to non-synonymous variation. The tool, Structure Feature Analysis Tool (SFAT), is freely available to the public at http://hive.biochemistry.gwu.edu/tools/sfat. Elsevier 2013-04 2013-02-28 /pmc/articles/PMC3914773/ /pubmed/23459159 http://dx.doi.org/10.1016/j.gpb.2012.11.003 Text en © 2013 Beijing Institute of Genomics, Chinese Academy of Sciences and Genetics Society of China. Production and hosting by Elsevier B.V. All rights reserved. http://creativecommons.org/licenses/by-nc-sa/3.0/ This is an open access article under the CC BY-NC-SA license (http://creativecommons.org/licenses/by-nc-sa/3.0/).
spellingShingle Original Research
Lam, Phuc Vinh Nguyen
Goldman, Radoslav
Karagiannis, Konstantinos
Narsule, Tejas
Simonyan, Vahan
Soika, Valerii
Mazumder, Raja
Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes
title Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes
title_full Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes
title_fullStr Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes
title_full_unstemmed Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes
title_short Structure-based Comparative Analysis and Prediction of N-linked Glycosylation Sites in Evolutionarily Distant Eukaryotes
title_sort structure-based comparative analysis and prediction of n-linked glycosylation sites in evolutionarily distant eukaryotes
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3914773/
https://www.ncbi.nlm.nih.gov/pubmed/23459159
http://dx.doi.org/10.1016/j.gpb.2012.11.003
work_keys_str_mv AT lamphucvinhnguyen structurebasedcomparativeanalysisandpredictionofnlinkedglycosylationsitesinevolutionarilydistanteukaryotes
AT goldmanradoslav structurebasedcomparativeanalysisandpredictionofnlinkedglycosylationsitesinevolutionarilydistanteukaryotes
AT karagianniskonstantinos structurebasedcomparativeanalysisandpredictionofnlinkedglycosylationsitesinevolutionarilydistanteukaryotes
AT narsuletejas structurebasedcomparativeanalysisandpredictionofnlinkedglycosylationsitesinevolutionarilydistanteukaryotes
AT simonyanvahan structurebasedcomparativeanalysisandpredictionofnlinkedglycosylationsitesinevolutionarilydistanteukaryotes
AT soikavalerii structurebasedcomparativeanalysisandpredictionofnlinkedglycosylationsitesinevolutionarilydistanteukaryotes
AT mazumderraja structurebasedcomparativeanalysisandpredictionofnlinkedglycosylationsitesinevolutionarilydistanteukaryotes