Cargando…

GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features

Glycosylation plays an important role in cell-cell adhesion, ligand-binding and subcellular recognition. Current approaches for predicting protein glycosylation are primarily based on sequence-derived features, while little work has been done to systematically assess the importance of structural fea...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Fuyi, Li, Chen, Revote, Jerico, Zhang, Yang, Webb, Geoffrey I., Li, Jian, Song, Jiangning, Lithgow, Trevor
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5052564/
https://www.ncbi.nlm.nih.gov/pubmed/27708373
http://dx.doi.org/10.1038/srep34595
_version_ 1782458249015984128
author Li, Fuyi
Li, Chen
Revote, Jerico
Zhang, Yang
Webb, Geoffrey I.
Li, Jian
Song, Jiangning
Lithgow, Trevor
author_facet Li, Fuyi
Li, Chen
Revote, Jerico
Zhang, Yang
Webb, Geoffrey I.
Li, Jian
Song, Jiangning
Lithgow, Trevor
author_sort Li, Fuyi
collection PubMed
description Glycosylation plays an important role in cell-cell adhesion, ligand-binding and subcellular recognition. Current approaches for predicting protein glycosylation are primarily based on sequence-derived features, while little work has been done to systematically assess the importance of structural features to glycosylation prediction. Here, we propose a novel bioinformatics method called GlycoMine(struct)(http://glycomine.erc.monash.edu/Lab/GlycoMine_Struct/) for improved prediction of human N- and O-linked glycosylation sites by combining sequence and structural features in an integrated computational framework with a two-step feature-selection strategy. Experiments indicated that GlycoMine(struct) outperformed NGlycPred, the only predictor that incorporated both sequence and structure features, achieving AUC values of 0.941 and 0.922 for N- and O-linked glycosylation, respectively, on an independent test dataset. We applied GlycoMine(struct) to screen the human structural proteome and obtained high-confidence predictions for N- and O-linked glycosylation sites. GlycoMine(struct) can be used as a powerful tool to expedite the discovery of glycosylation events and substrates to facilitate hypothesis-driven experimental studies.
format Online
Article
Text
id pubmed-5052564
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-50525642016-10-19 GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features Li, Fuyi Li, Chen Revote, Jerico Zhang, Yang Webb, Geoffrey I. Li, Jian Song, Jiangning Lithgow, Trevor Sci Rep Article Glycosylation plays an important role in cell-cell adhesion, ligand-binding and subcellular recognition. Current approaches for predicting protein glycosylation are primarily based on sequence-derived features, while little work has been done to systematically assess the importance of structural features to glycosylation prediction. Here, we propose a novel bioinformatics method called GlycoMine(struct)(http://glycomine.erc.monash.edu/Lab/GlycoMine_Struct/) for improved prediction of human N- and O-linked glycosylation sites by combining sequence and structural features in an integrated computational framework with a two-step feature-selection strategy. Experiments indicated that GlycoMine(struct) outperformed NGlycPred, the only predictor that incorporated both sequence and structure features, achieving AUC values of 0.941 and 0.922 for N- and O-linked glycosylation, respectively, on an independent test dataset. We applied GlycoMine(struct) to screen the human structural proteome and obtained high-confidence predictions for N- and O-linked glycosylation sites. GlycoMine(struct) can be used as a powerful tool to expedite the discovery of glycosylation events and substrates to facilitate hypothesis-driven experimental studies. Nature Publishing Group 2016-10-06 /pmc/articles/PMC5052564/ /pubmed/27708373 http://dx.doi.org/10.1038/srep34595 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Li, Fuyi
Li, Chen
Revote, Jerico
Zhang, Yang
Webb, Geoffrey I.
Li, Jian
Song, Jiangning
Lithgow, Trevor
GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features
title GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features
title_full GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features
title_fullStr GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features
title_full_unstemmed GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features
title_short GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features
title_sort glycomine(struct): a new bioinformatics tool for highly accurate mapping of the human n-linked and o-linked glycoproteomes by incorporating structural features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5052564/
https://www.ncbi.nlm.nih.gov/pubmed/27708373
http://dx.doi.org/10.1038/srep34595
work_keys_str_mv AT lifuyi glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures
AT lichen glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures
AT revotejerico glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures
AT zhangyang glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures
AT webbgeoffreyi glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures
AT lijian glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures
AT songjiangning glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures
AT lithgowtrevor glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures