Cargando…
GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features
Glycosylation plays an important role in cell-cell adhesion, ligand-binding and subcellular recognition. Current approaches for predicting protein glycosylation are primarily based on sequence-derived features, while little work has been done to systematically assess the importance of structural fea...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5052564/ https://www.ncbi.nlm.nih.gov/pubmed/27708373 http://dx.doi.org/10.1038/srep34595 |
_version_ | 1782458249015984128 |
---|---|
author | Li, Fuyi Li, Chen Revote, Jerico Zhang, Yang Webb, Geoffrey I. Li, Jian Song, Jiangning Lithgow, Trevor |
author_facet | Li, Fuyi Li, Chen Revote, Jerico Zhang, Yang Webb, Geoffrey I. Li, Jian Song, Jiangning Lithgow, Trevor |
author_sort | Li, Fuyi |
collection | PubMed |
description | Glycosylation plays an important role in cell-cell adhesion, ligand-binding and subcellular recognition. Current approaches for predicting protein glycosylation are primarily based on sequence-derived features, while little work has been done to systematically assess the importance of structural features to glycosylation prediction. Here, we propose a novel bioinformatics method called GlycoMine(struct)(http://glycomine.erc.monash.edu/Lab/GlycoMine_Struct/) for improved prediction of human N- and O-linked glycosylation sites by combining sequence and structural features in an integrated computational framework with a two-step feature-selection strategy. Experiments indicated that GlycoMine(struct) outperformed NGlycPred, the only predictor that incorporated both sequence and structure features, achieving AUC values of 0.941 and 0.922 for N- and O-linked glycosylation, respectively, on an independent test dataset. We applied GlycoMine(struct) to screen the human structural proteome and obtained high-confidence predictions for N- and O-linked glycosylation sites. GlycoMine(struct) can be used as a powerful tool to expedite the discovery of glycosylation events and substrates to facilitate hypothesis-driven experimental studies. |
format | Online Article Text |
id | pubmed-5052564 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Nature Publishing Group |
record_format | MEDLINE/PubMed |
spelling | pubmed-50525642016-10-19 GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features Li, Fuyi Li, Chen Revote, Jerico Zhang, Yang Webb, Geoffrey I. Li, Jian Song, Jiangning Lithgow, Trevor Sci Rep Article Glycosylation plays an important role in cell-cell adhesion, ligand-binding and subcellular recognition. Current approaches for predicting protein glycosylation are primarily based on sequence-derived features, while little work has been done to systematically assess the importance of structural features to glycosylation prediction. Here, we propose a novel bioinformatics method called GlycoMine(struct)(http://glycomine.erc.monash.edu/Lab/GlycoMine_Struct/) for improved prediction of human N- and O-linked glycosylation sites by combining sequence and structural features in an integrated computational framework with a two-step feature-selection strategy. Experiments indicated that GlycoMine(struct) outperformed NGlycPred, the only predictor that incorporated both sequence and structure features, achieving AUC values of 0.941 and 0.922 for N- and O-linked glycosylation, respectively, on an independent test dataset. We applied GlycoMine(struct) to screen the human structural proteome and obtained high-confidence predictions for N- and O-linked glycosylation sites. GlycoMine(struct) can be used as a powerful tool to expedite the discovery of glycosylation events and substrates to facilitate hypothesis-driven experimental studies. Nature Publishing Group 2016-10-06 /pmc/articles/PMC5052564/ /pubmed/27708373 http://dx.doi.org/10.1038/srep34595 Text en Copyright © 2016, The Author(s) http://creativecommons.org/licenses/by/4.0/ This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ |
spellingShingle | Article Li, Fuyi Li, Chen Revote, Jerico Zhang, Yang Webb, Geoffrey I. Li, Jian Song, Jiangning Lithgow, Trevor GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features |
title | GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features |
title_full | GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features |
title_fullStr | GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features |
title_full_unstemmed | GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features |
title_short | GlycoMine(struct): a new bioinformatics tool for highly accurate mapping of the human N-linked and O-linked glycoproteomes by incorporating structural features |
title_sort | glycomine(struct): a new bioinformatics tool for highly accurate mapping of the human n-linked and o-linked glycoproteomes by incorporating structural features |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5052564/ https://www.ncbi.nlm.nih.gov/pubmed/27708373 http://dx.doi.org/10.1038/srep34595 |
work_keys_str_mv | AT lifuyi glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures AT lichen glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures AT revotejerico glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures AT zhangyang glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures AT webbgeoffreyi glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures AT lijian glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures AT songjiangning glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures AT lithgowtrevor glycominestructanewbioinformaticstoolforhighlyaccuratemappingofthehumannlinkedandolinkedglycoproteomesbyincorporatingstructuralfeatures |