Cargando…

DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction

Protein N-linked glycosylation is a post-translational modification that plays an important role in a myriad of biological processes. Computational prediction approaches serve as complementary methods for the characterization of glycosylation sites. Most of the existing predictors for N-linked glyco...

Descripción completa

Detalles Bibliográficos
Autores principales: Pakhrin, Subash C., Aoki-Kinoshita, Kiyoko F., Caragea, Doina, KC, Dukka B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8658957/
https://www.ncbi.nlm.nih.gov/pubmed/34885895
http://dx.doi.org/10.3390/molecules26237314
_version_ 1784612851605831680
author Pakhrin, Subash C.
Aoki-Kinoshita, Kiyoko F.
Caragea, Doina
KC, Dukka B.
author_facet Pakhrin, Subash C.
Aoki-Kinoshita, Kiyoko F.
Caragea, Doina
KC, Dukka B.
author_sort Pakhrin, Subash C.
collection PubMed
description Protein N-linked glycosylation is a post-translational modification that plays an important role in a myriad of biological processes. Computational prediction approaches serve as complementary methods for the characterization of glycosylation sites. Most of the existing predictors for N-linked glycosylation utilize the information that the glycosylation site occurs at the N-X-[S/T] sequon, where X is any amino acid except proline. Not all N-X-[S/T] sequons are glycosylated, thus the N-X-[S/T] sequon is a necessary but not sufficient determinant for protein glycosylation. In that regard, computational prediction of N-linked glycosylation sites confined to N-X-[S/T] sequons is an important problem. Here, we report DeepNGlyPred a deep learning-based approach that encodes the positive and negative sequences in the human proteome dataset (extracted from N-GlycositeAtlas) using sequence-based features (gapped-dipeptide), predicted structural features, and evolutionary information. DeepNGlyPred produces SN, SP, MCC, and ACC of 88.62%, 73.92%, 0.60, and 79.41%, respectively on N-GlyDE independent test set, which is better than the compared approaches. These results demonstrate that DeepNGlyPred is a robust computational technique to predict N-Linked glycosylation sites confined to N-X-[S/T] sequon. DeepNGlyPred will be a useful resource for the glycobiology community.
format Online
Article
Text
id pubmed-8658957
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-86589572021-12-10 DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction Pakhrin, Subash C. Aoki-Kinoshita, Kiyoko F. Caragea, Doina KC, Dukka B. Molecules Article Protein N-linked glycosylation is a post-translational modification that plays an important role in a myriad of biological processes. Computational prediction approaches serve as complementary methods for the characterization of glycosylation sites. Most of the existing predictors for N-linked glycosylation utilize the information that the glycosylation site occurs at the N-X-[S/T] sequon, where X is any amino acid except proline. Not all N-X-[S/T] sequons are glycosylated, thus the N-X-[S/T] sequon is a necessary but not sufficient determinant for protein glycosylation. In that regard, computational prediction of N-linked glycosylation sites confined to N-X-[S/T] sequons is an important problem. Here, we report DeepNGlyPred a deep learning-based approach that encodes the positive and negative sequences in the human proteome dataset (extracted from N-GlycositeAtlas) using sequence-based features (gapped-dipeptide), predicted structural features, and evolutionary information. DeepNGlyPred produces SN, SP, MCC, and ACC of 88.62%, 73.92%, 0.60, and 79.41%, respectively on N-GlyDE independent test set, which is better than the compared approaches. These results demonstrate that DeepNGlyPred is a robust computational technique to predict N-Linked glycosylation sites confined to N-X-[S/T] sequon. DeepNGlyPred will be a useful resource for the glycobiology community. MDPI 2021-12-02 /pmc/articles/PMC8658957/ /pubmed/34885895 http://dx.doi.org/10.3390/molecules26237314 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Pakhrin, Subash C.
Aoki-Kinoshita, Kiyoko F.
Caragea, Doina
KC, Dukka B.
DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction
title DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction
title_full DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction
title_fullStr DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction
title_full_unstemmed DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction
title_short DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction
title_sort deepnglypred: a deep neural network-based approach for human n-linked glycosylation site prediction
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8658957/
https://www.ncbi.nlm.nih.gov/pubmed/34885895
http://dx.doi.org/10.3390/molecules26237314
work_keys_str_mv AT pakhrinsubashc deepnglypredadeepneuralnetworkbasedapproachforhumannlinkedglycosylationsiteprediction
AT aokikinoshitakiyokof deepnglypredadeepneuralnetworkbasedapproachforhumannlinkedglycosylationsiteprediction
AT carageadoina deepnglypredadeepneuralnetworkbasedapproachforhumannlinkedglycosylationsiteprediction
AT kcdukkab deepnglypredadeepneuralnetworkbasedapproachforhumannlinkedglycosylationsiteprediction