Cargando…
DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction
Protein N-linked glycosylation is a post-translational modification that plays an important role in a myriad of biological processes. Computational prediction approaches serve as complementary methods for the characterization of glycosylation sites. Most of the existing predictors for N-linked glyco...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8658957/ https://www.ncbi.nlm.nih.gov/pubmed/34885895 http://dx.doi.org/10.3390/molecules26237314 |
_version_ | 1784612851605831680 |
---|---|
author | Pakhrin, Subash C. Aoki-Kinoshita, Kiyoko F. Caragea, Doina KC, Dukka B. |
author_facet | Pakhrin, Subash C. Aoki-Kinoshita, Kiyoko F. Caragea, Doina KC, Dukka B. |
author_sort | Pakhrin, Subash C. |
collection | PubMed |
description | Protein N-linked glycosylation is a post-translational modification that plays an important role in a myriad of biological processes. Computational prediction approaches serve as complementary methods for the characterization of glycosylation sites. Most of the existing predictors for N-linked glycosylation utilize the information that the glycosylation site occurs at the N-X-[S/T] sequon, where X is any amino acid except proline. Not all N-X-[S/T] sequons are glycosylated, thus the N-X-[S/T] sequon is a necessary but not sufficient determinant for protein glycosylation. In that regard, computational prediction of N-linked glycosylation sites confined to N-X-[S/T] sequons is an important problem. Here, we report DeepNGlyPred a deep learning-based approach that encodes the positive and negative sequences in the human proteome dataset (extracted from N-GlycositeAtlas) using sequence-based features (gapped-dipeptide), predicted structural features, and evolutionary information. DeepNGlyPred produces SN, SP, MCC, and ACC of 88.62%, 73.92%, 0.60, and 79.41%, respectively on N-GlyDE independent test set, which is better than the compared approaches. These results demonstrate that DeepNGlyPred is a robust computational technique to predict N-Linked glycosylation sites confined to N-X-[S/T] sequon. DeepNGlyPred will be a useful resource for the glycobiology community. |
format | Online Article Text |
id | pubmed-8658957 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-86589572021-12-10 DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction Pakhrin, Subash C. Aoki-Kinoshita, Kiyoko F. Caragea, Doina KC, Dukka B. Molecules Article Protein N-linked glycosylation is a post-translational modification that plays an important role in a myriad of biological processes. Computational prediction approaches serve as complementary methods for the characterization of glycosylation sites. Most of the existing predictors for N-linked glycosylation utilize the information that the glycosylation site occurs at the N-X-[S/T] sequon, where X is any amino acid except proline. Not all N-X-[S/T] sequons are glycosylated, thus the N-X-[S/T] sequon is a necessary but not sufficient determinant for protein glycosylation. In that regard, computational prediction of N-linked glycosylation sites confined to N-X-[S/T] sequons is an important problem. Here, we report DeepNGlyPred a deep learning-based approach that encodes the positive and negative sequences in the human proteome dataset (extracted from N-GlycositeAtlas) using sequence-based features (gapped-dipeptide), predicted structural features, and evolutionary information. DeepNGlyPred produces SN, SP, MCC, and ACC of 88.62%, 73.92%, 0.60, and 79.41%, respectively on N-GlyDE independent test set, which is better than the compared approaches. These results demonstrate that DeepNGlyPred is a robust computational technique to predict N-Linked glycosylation sites confined to N-X-[S/T] sequon. DeepNGlyPred will be a useful resource for the glycobiology community. MDPI 2021-12-02 /pmc/articles/PMC8658957/ /pubmed/34885895 http://dx.doi.org/10.3390/molecules26237314 Text en © 2021 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Pakhrin, Subash C. Aoki-Kinoshita, Kiyoko F. Caragea, Doina KC, Dukka B. DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction |
title | DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction |
title_full | DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction |
title_fullStr | DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction |
title_full_unstemmed | DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction |
title_short | DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction |
title_sort | deepnglypred: a deep neural network-based approach for human n-linked glycosylation site prediction |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8658957/ https://www.ncbi.nlm.nih.gov/pubmed/34885895 http://dx.doi.org/10.3390/molecules26237314 |
work_keys_str_mv | AT pakhrinsubashc deepnglypredadeepneuralnetworkbasedapproachforhumannlinkedglycosylationsiteprediction AT aokikinoshitakiyokof deepnglypredadeepneuralnetworkbasedapproachforhumannlinkedglycosylationsiteprediction AT carageadoina deepnglypredadeepneuralnetworkbasedapproachforhumannlinkedglycosylationsiteprediction AT kcdukkab deepnglypredadeepneuralnetworkbasedapproachforhumannlinkedglycosylationsiteprediction |