Cargando…

iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC

Carbonylation is a posttranslational modification (PTM or PTLM), where a carbonyl group is added to lysine (K), proline (P), arginine (R), and threonine (T) residue of a protein molecule. Carbonylation plays an important role in orchestrating various biological processes but it is also associated wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Jianhua, Liu, Zi, Xiao, Xuan, Liu, Bingxiang, Chou, Kuo-Chen
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Impact Journals LLC 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5085176/
https://www.ncbi.nlm.nih.gov/pubmed/27153555
http://dx.doi.org/10.18632/oncotarget.9148
_version_ 1782463521493090304
author Jia, Jianhua
Liu, Zi
Xiao, Xuan
Liu, Bingxiang
Chou, Kuo-Chen
author_facet Jia, Jianhua
Liu, Zi
Xiao, Xuan
Liu, Bingxiang
Chou, Kuo-Chen
author_sort Jia, Jianhua
collection PubMed
description Carbonylation is a posttranslational modification (PTM or PTLM), where a carbonyl group is added to lysine (K), proline (P), arginine (R), and threonine (T) residue of a protein molecule. Carbonylation plays an important role in orchestrating various biological processes but it is also associated with many diseases such as diabetes, chronic lung disease, Parkinson's disease, Alzheimer's disease, chronic renal failure, and sepsis. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of K, P, R, or T, which ones can be carbonylated, and which ones cannot? To address this problem, we have developed a predictor called iCar-PseCp by incorporating the sequence-coupled information into the general pseudo amino acid composition, and balancing out skewed training dataset by Monte Carlo sampling to expand positive subset. Rigorous target cross-validations on a same set of carbonylation-known proteins indicated that the new predictor remarkably outperformed its existing counterparts. For the convenience of most experimental scientists, a user-friendly web-server for iCar-PseCp has been established at http://www.jci-bioinfo.cn/iCar-PseCp, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved. It has not escaped our notice that the formulation and approach presented here can also be used to analyze many other problems in computational proteomics.
format Online
Article
Text
id pubmed-5085176
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Impact Journals LLC
record_format MEDLINE/PubMed
spelling pubmed-50851762016-10-31 iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC Jia, Jianhua Liu, Zi Xiao, Xuan Liu, Bingxiang Chou, Kuo-Chen Oncotarget Research Paper Carbonylation is a posttranslational modification (PTM or PTLM), where a carbonyl group is added to lysine (K), proline (P), arginine (R), and threonine (T) residue of a protein molecule. Carbonylation plays an important role in orchestrating various biological processes but it is also associated with many diseases such as diabetes, chronic lung disease, Parkinson's disease, Alzheimer's disease, chronic renal failure, and sepsis. Therefore, from the angles of both basic research and drug development, we are facing a challenging problem: for an uncharacterized protein sequence containing many residues of K, P, R, or T, which ones can be carbonylated, and which ones cannot? To address this problem, we have developed a predictor called iCar-PseCp by incorporating the sequence-coupled information into the general pseudo amino acid composition, and balancing out skewed training dataset by Monte Carlo sampling to expand positive subset. Rigorous target cross-validations on a same set of carbonylation-known proteins indicated that the new predictor remarkably outperformed its existing counterparts. For the convenience of most experimental scientists, a user-friendly web-server for iCar-PseCp has been established at http://www.jci-bioinfo.cn/iCar-PseCp, by which users can easily obtain their desired results without the need to go through the complicated mathematical equations involved. It has not escaped our notice that the formulation and approach presented here can also be used to analyze many other problems in computational proteomics. Impact Journals LLC 2016-05-03 /pmc/articles/PMC5085176/ /pubmed/27153555 http://dx.doi.org/10.18632/oncotarget.9148 Text en Copyright: © 2016 Jia et al. http://creativecommons.org/licenses/by/2.5/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Paper
Jia, Jianhua
Liu, Zi
Xiao, Xuan
Liu, Bingxiang
Chou, Kuo-Chen
iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC
title iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC
title_full iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC
title_fullStr iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC
title_full_unstemmed iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC
title_short iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC
title_sort icar-psecp: identify carbonylation sites in proteins by monte carlo sampling and incorporating sequence coupled effects into general pseaac
topic Research Paper
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5085176/
https://www.ncbi.nlm.nih.gov/pubmed/27153555
http://dx.doi.org/10.18632/oncotarget.9148
work_keys_str_mv AT jiajianhua icarpsecpidentifycarbonylationsitesinproteinsbymontecarlosamplingandincorporatingsequencecoupledeffectsintogeneralpseaac
AT liuzi icarpsecpidentifycarbonylationsitesinproteinsbymontecarlosamplingandincorporatingsequencecoupledeffectsintogeneralpseaac
AT xiaoxuan icarpsecpidentifycarbonylationsitesinproteinsbymontecarlosamplingandincorporatingsequencecoupledeffectsintogeneralpseaac
AT liubingxiang icarpsecpidentifycarbonylationsitesinproteinsbymontecarlosamplingandincorporatingsequencecoupledeffectsintogeneralpseaac
AT choukuochen icarpsecpidentifycarbonylationsitesinproteinsbymontecarlosamplingandincorporatingsequencecoupledeffectsintogeneralpseaac