Cargando…

Investigation and identification of protein γ-glutamyl carboxylation sites

BACKGROUND: Carboxylation is a modification of glutamate (Glu) residues which occurs post-translation that is catalyzed by γ-glutamyl carboxylase in the lumen of the endoplasmic reticulum. Vitamin K is a critical co-factor in the post-translational conversion of Glu residues to γ-carboxyglutamate (G...

Descripción completa

Detalles Bibliográficos
Autores principales: Lee, Tzong-Yi, Lu, Cheng-Tsung, Chen, Shu-An, Bretaña, Neil Arvin, Cheng, Tzu-Hsiu, Su, Min-Gang, Huang, Kai-Yao
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278826/
https://www.ncbi.nlm.nih.gov/pubmed/22372765
http://dx.doi.org/10.1186/1471-2105-12-S13-S10
_version_ 1782223610582138880
author Lee, Tzong-Yi
Lu, Cheng-Tsung
Chen, Shu-An
Bretaña, Neil Arvin
Cheng, Tzu-Hsiu
Su, Min-Gang
Huang, Kai-Yao
author_facet Lee, Tzong-Yi
Lu, Cheng-Tsung
Chen, Shu-An
Bretaña, Neil Arvin
Cheng, Tzu-Hsiu
Su, Min-Gang
Huang, Kai-Yao
author_sort Lee, Tzong-Yi
collection PubMed
description BACKGROUND: Carboxylation is a modification of glutamate (Glu) residues which occurs post-translation that is catalyzed by γ-glutamyl carboxylase in the lumen of the endoplasmic reticulum. Vitamin K is a critical co-factor in the post-translational conversion of Glu residues to γ-carboxyglutamate (Gla) residues. It has been shown that the process of carboxylation is involved in the blood clotting cascade, bone growth, and extraosseous calcification. However, studies in this field have been limited by the difficulty of experimentally studying substrate site specificity in γ-glutamyl carboxylation. In silico investigations have the potential for characterizing carboxylated sites before experiments are carried out. RESULTS: Because of the importance of γ-glutamyl carboxylation in biological mechanisms, this study investigates the substrate site specificity in carboxylation sites. It considers not only the composition of amino acids that surround carboxylation sites, but also the structural characteristics of these sites, including secondary structure and solvent-accessible surface area (ASA). The explored features are used to establish a predictive model for differentiating between carboxylation sites and non-carboxylation sites. A support vector machine (SVM) is employed to establish a predictive model with various features. A five-fold cross-validation evaluation reveals that the SVM model, trained with the combined features of positional weighted matrix (PWM), amino acid composition (AAC), and ASA, yields the highest accuracy (0.892). Furthermore, an independent testing set is constructed to evaluate whether the predictive model is over-fitted to the training set. CONCLUSIONS: Independent testing data that did not undergo the cross-validation process shows that the proposed model can differentiate between carboxylation sites and non-carboxylation sites. This investigation is the first to study carboxylation sites and to develop a system for identifying them. The proposed method is a practical means of preliminary analysis and greatly diminishes the total number of potential carboxylation sites requiring further experimental confirmation.
format Online
Article
Text
id pubmed-3278826
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-32788262012-02-14 Investigation and identification of protein γ-glutamyl carboxylation sites Lee, Tzong-Yi Lu, Cheng-Tsung Chen, Shu-An Bretaña, Neil Arvin Cheng, Tzu-Hsiu Su, Min-Gang Huang, Kai-Yao BMC Bioinformatics Proceedings BACKGROUND: Carboxylation is a modification of glutamate (Glu) residues which occurs post-translation that is catalyzed by γ-glutamyl carboxylase in the lumen of the endoplasmic reticulum. Vitamin K is a critical co-factor in the post-translational conversion of Glu residues to γ-carboxyglutamate (Gla) residues. It has been shown that the process of carboxylation is involved in the blood clotting cascade, bone growth, and extraosseous calcification. However, studies in this field have been limited by the difficulty of experimentally studying substrate site specificity in γ-glutamyl carboxylation. In silico investigations have the potential for characterizing carboxylated sites before experiments are carried out. RESULTS: Because of the importance of γ-glutamyl carboxylation in biological mechanisms, this study investigates the substrate site specificity in carboxylation sites. It considers not only the composition of amino acids that surround carboxylation sites, but also the structural characteristics of these sites, including secondary structure and solvent-accessible surface area (ASA). The explored features are used to establish a predictive model for differentiating between carboxylation sites and non-carboxylation sites. A support vector machine (SVM) is employed to establish a predictive model with various features. A five-fold cross-validation evaluation reveals that the SVM model, trained with the combined features of positional weighted matrix (PWM), amino acid composition (AAC), and ASA, yields the highest accuracy (0.892). Furthermore, an independent testing set is constructed to evaluate whether the predictive model is over-fitted to the training set. CONCLUSIONS: Independent testing data that did not undergo the cross-validation process shows that the proposed model can differentiate between carboxylation sites and non-carboxylation sites. This investigation is the first to study carboxylation sites and to develop a system for identifying them. The proposed method is a practical means of preliminary analysis and greatly diminishes the total number of potential carboxylation sites requiring further experimental confirmation. BioMed Central 2011-11-30 /pmc/articles/PMC3278826/ /pubmed/22372765 http://dx.doi.org/10.1186/1471-2105-12-S13-S10 Text en Copyright ©2011 Lee et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Proceedings
Lee, Tzong-Yi
Lu, Cheng-Tsung
Chen, Shu-An
Bretaña, Neil Arvin
Cheng, Tzu-Hsiu
Su, Min-Gang
Huang, Kai-Yao
Investigation and identification of protein γ-glutamyl carboxylation sites
title Investigation and identification of protein γ-glutamyl carboxylation sites
title_full Investigation and identification of protein γ-glutamyl carboxylation sites
title_fullStr Investigation and identification of protein γ-glutamyl carboxylation sites
title_full_unstemmed Investigation and identification of protein γ-glutamyl carboxylation sites
title_short Investigation and identification of protein γ-glutamyl carboxylation sites
title_sort investigation and identification of protein γ-glutamyl carboxylation sites
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3278826/
https://www.ncbi.nlm.nih.gov/pubmed/22372765
http://dx.doi.org/10.1186/1471-2105-12-S13-S10
work_keys_str_mv AT leetzongyi investigationandidentificationofproteingglutamylcarboxylationsites
AT luchengtsung investigationandidentificationofproteingglutamylcarboxylationsites
AT chenshuan investigationandidentificationofproteingglutamylcarboxylationsites
AT bretananeilarvin investigationandidentificationofproteingglutamylcarboxylationsites
AT chengtzuhsiu investigationandidentificationofproteingglutamylcarboxylationsites
AT sumingang investigationandidentificationofproteingglutamylcarboxylationsites
AT huangkaiyao investigationandidentificationofproteingglutamylcarboxylationsites