Cargando…

Computational Prediction of O-linked Glycosylation Sites That Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins

O-glycosylation of mammalian proteins is one of the important posttranslational modifications. We applied a support vector machine (SVM) to predict whether Ser or Thr is glycosylated, in order to elucidate the O-glycosylation mechanism. O-glycosylated sites were often found clustered along the seque...

Descripción completa

Detalles Bibliográficos
Autores principales: Nishikawa, Ikuko, Nakajima, Yukiko, Ito, Masahiro, Fukuchi, Satoshi, Homma, Keiichi, Nishikawa, Ken
Formato: Texto
Lenguaje:English
Publicado: Molecular Diversity Preservation International (MDPI) 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3100847/
https://www.ncbi.nlm.nih.gov/pubmed/21614187
http://dx.doi.org/10.3390/ijms11124991
_version_ 1782204221401071616
author Nishikawa, Ikuko
Nakajima, Yukiko
Ito, Masahiro
Fukuchi, Satoshi
Homma, Keiichi
Nishikawa, Ken
author_facet Nishikawa, Ikuko
Nakajima, Yukiko
Ito, Masahiro
Fukuchi, Satoshi
Homma, Keiichi
Nishikawa, Ken
author_sort Nishikawa, Ikuko
collection PubMed
description O-glycosylation of mammalian proteins is one of the important posttranslational modifications. We applied a support vector machine (SVM) to predict whether Ser or Thr is glycosylated, in order to elucidate the O-glycosylation mechanism. O-glycosylated sites were often found clustered along the sequence, whereas other sites were located sporadically. Therefore, we developed two types of SVMs for predicting clustered and isolated sites separately. We found that the amino acid composition was effective for predicting the clustered type, whereas the site-specific algorithm was effective for the isolated type. The highest prediction accuracy for the clustered type was 74%, while that for the isolated type was 79%. The existence frequency of amino acids around the O-glycosylation sites was different in the two types: namely, Pro, Val and Ala had high existence probabilities at each specific position relative to a glycosylation site, especially for the isolated type. Independent component analyses for the amino acid sequences around O-glycosylation sites showed the position-specific existences of the identified amino acids as independent components. The O-glycosylation sites were preferentially located within intrinsically disordered regions of extracellular proteins: particularly, more than 90% of the clustered O-GalNAc glycosylation sites were observed in intrinsically disordered regions. This feature could be the key for understanding the non-conservation property of O-glycosylation, and its role in functional diversity and structural stability.
format Text
id pubmed-3100847
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Molecular Diversity Preservation International (MDPI)
record_format MEDLINE/PubMed
spelling pubmed-31008472011-05-25 Computational Prediction of O-linked Glycosylation Sites That Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins Nishikawa, Ikuko Nakajima, Yukiko Ito, Masahiro Fukuchi, Satoshi Homma, Keiichi Nishikawa, Ken Int J Mol Sci Article O-glycosylation of mammalian proteins is one of the important posttranslational modifications. We applied a support vector machine (SVM) to predict whether Ser or Thr is glycosylated, in order to elucidate the O-glycosylation mechanism. O-glycosylated sites were often found clustered along the sequence, whereas other sites were located sporadically. Therefore, we developed two types of SVMs for predicting clustered and isolated sites separately. We found that the amino acid composition was effective for predicting the clustered type, whereas the site-specific algorithm was effective for the isolated type. The highest prediction accuracy for the clustered type was 74%, while that for the isolated type was 79%. The existence frequency of amino acids around the O-glycosylation sites was different in the two types: namely, Pro, Val and Ala had high existence probabilities at each specific position relative to a glycosylation site, especially for the isolated type. Independent component analyses for the amino acid sequences around O-glycosylation sites showed the position-specific existences of the identified amino acids as independent components. The O-glycosylation sites were preferentially located within intrinsically disordered regions of extracellular proteins: particularly, more than 90% of the clustered O-GalNAc glycosylation sites were observed in intrinsically disordered regions. This feature could be the key for understanding the non-conservation property of O-glycosylation, and its role in functional diversity and structural stability. Molecular Diversity Preservation International (MDPI) 2010-12-03 /pmc/articles/PMC3100847/ /pubmed/21614187 http://dx.doi.org/10.3390/ijms11124991 Text en © 2010 by the authors; licensee MDPI, Basel, Switzerland. http://creativecommons.org/licenses/by/3.0 This article is an open-access article distributed under the terms and conditions of the Creative Commons Attribution license (http://creativecommons.org/licenses/by/3.0/).
spellingShingle Article
Nishikawa, Ikuko
Nakajima, Yukiko
Ito, Masahiro
Fukuchi, Satoshi
Homma, Keiichi
Nishikawa, Ken
Computational Prediction of O-linked Glycosylation Sites That Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins
title Computational Prediction of O-linked Glycosylation Sites That Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins
title_full Computational Prediction of O-linked Glycosylation Sites That Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins
title_fullStr Computational Prediction of O-linked Glycosylation Sites That Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins
title_full_unstemmed Computational Prediction of O-linked Glycosylation Sites That Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins
title_short Computational Prediction of O-linked Glycosylation Sites That Preferentially Map on Intrinsically Disordered Regions of Extracellular Proteins
title_sort computational prediction of o-linked glycosylation sites that preferentially map on intrinsically disordered regions of extracellular proteins
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3100847/
https://www.ncbi.nlm.nih.gov/pubmed/21614187
http://dx.doi.org/10.3390/ijms11124991
work_keys_str_mv AT nishikawaikuko computationalpredictionofolinkedglycosylationsitesthatpreferentiallymaponintrinsicallydisorderedregionsofextracellularproteins
AT nakajimayukiko computationalpredictionofolinkedglycosylationsitesthatpreferentiallymaponintrinsicallydisorderedregionsofextracellularproteins
AT itomasahiro computationalpredictionofolinkedglycosylationsitesthatpreferentiallymaponintrinsicallydisorderedregionsofextracellularproteins
AT fukuchisatoshi computationalpredictionofolinkedglycosylationsitesthatpreferentiallymaponintrinsicallydisorderedregionsofextracellularproteins
AT hommakeiichi computationalpredictionofolinkedglycosylationsitesthatpreferentiallymaponintrinsicallydisorderedregionsofextracellularproteins
AT nishikawaken computationalpredictionofolinkedglycosylationsitesthatpreferentiallymaponintrinsicallydisorderedregionsofextracellularproteins