Cargando…

Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features

BACKGROUND: Reactive oxygen species can modify the structure and function of proteins and may also act as important signaling molecules in various cellular processes. Cysteine thiol groups of proteins are particularly susceptible to oxidation. Meanwhile, their reversible oxidation is of critical rol...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Ming-an, Zhang, Qing, Wang, Yejun, Ge, Wei, Guo, Dianjing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4995733/
https://www.ncbi.nlm.nih.gov/pubmed/27553667
http://dx.doi.org/10.1186/s12859-016-1185-4
_version_ 1782449530895073280
author Sun, Ming-an
Zhang, Qing
Wang, Yejun
Ge, Wei
Guo, Dianjing
author_facet Sun, Ming-an
Zhang, Qing
Wang, Yejun
Ge, Wei
Guo, Dianjing
author_sort Sun, Ming-an
collection PubMed
description BACKGROUND: Reactive oxygen species can modify the structure and function of proteins and may also act as important signaling molecules in various cellular processes. Cysteine thiol groups of proteins are particularly susceptible to oxidation. Meanwhile, their reversible oxidation is of critical roles for redox regulation and signaling. Recently, several computational tools have been developed for predicting redox-sensitive cysteines; however, those methods either only focus on catalytic redox-sensitive cysteines in thiol oxidoreductases, or heavily depend on protein structural data, thus cannot be widely used. RESULTS: In this study, we analyzed various sequence-based features potentially related to cysteine redox-sensitivity, and identified three types of features for efficient computational prediction of redox-sensitive cysteines. These features are: sequential distance to the nearby cysteines, PSSM profile and predicted secondary structure of flanking residues. After further feature selection using SVM-RFE, we developed Redox-Sensitive Cysteine Predictor (RSCP), a SVM based classifier for redox-sensitive cysteine prediction using primary sequence only. Using 10-fold cross-validation on RSC758 dataset, the accuracy, sensitivity, specificity, MCC and AUC were estimated as 0.679, 0.602, 0.756, 0.362 and 0.727, respectively. When evaluated using 10-fold cross-validation with BALOSCTdb dataset which has structure information, the model achieved performance comparable to current structure-based method. Further validation using an independent dataset indicates it is robust and of relatively better accuracy for predicting redox-sensitive cysteines from non-enzyme proteins. CONCLUSIONS: In this study, we developed a sequence-based classifier for predicting redox-sensitive cysteines. The major advantage of this method is that it does not rely on protein structure data, which ensures more extensive application compared to other current implementations. Accurate prediction of redox-sensitive cysteines not only enhances our understanding about the redox sensitivity of cysteine, it may also complement the proteomics approach and facilitate further experimental investigation of important redox-sensitive cysteines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1185-4) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4995733
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-49957332016-09-07 Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features Sun, Ming-an Zhang, Qing Wang, Yejun Ge, Wei Guo, Dianjing BMC Bioinformatics Methodology Article BACKGROUND: Reactive oxygen species can modify the structure and function of proteins and may also act as important signaling molecules in various cellular processes. Cysteine thiol groups of proteins are particularly susceptible to oxidation. Meanwhile, their reversible oxidation is of critical roles for redox regulation and signaling. Recently, several computational tools have been developed for predicting redox-sensitive cysteines; however, those methods either only focus on catalytic redox-sensitive cysteines in thiol oxidoreductases, or heavily depend on protein structural data, thus cannot be widely used. RESULTS: In this study, we analyzed various sequence-based features potentially related to cysteine redox-sensitivity, and identified three types of features for efficient computational prediction of redox-sensitive cysteines. These features are: sequential distance to the nearby cysteines, PSSM profile and predicted secondary structure of flanking residues. After further feature selection using SVM-RFE, we developed Redox-Sensitive Cysteine Predictor (RSCP), a SVM based classifier for redox-sensitive cysteine prediction using primary sequence only. Using 10-fold cross-validation on RSC758 dataset, the accuracy, sensitivity, specificity, MCC and AUC were estimated as 0.679, 0.602, 0.756, 0.362 and 0.727, respectively. When evaluated using 10-fold cross-validation with BALOSCTdb dataset which has structure information, the model achieved performance comparable to current structure-based method. Further validation using an independent dataset indicates it is robust and of relatively better accuracy for predicting redox-sensitive cysteines from non-enzyme proteins. CONCLUSIONS: In this study, we developed a sequence-based classifier for predicting redox-sensitive cysteines. The major advantage of this method is that it does not rely on protein structure data, which ensures more extensive application compared to other current implementations. Accurate prediction of redox-sensitive cysteines not only enhances our understanding about the redox sensitivity of cysteine, it may also complement the proteomics approach and facilitate further experimental investigation of important redox-sensitive cysteines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1185-4) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-24 /pmc/articles/PMC4995733/ /pubmed/27553667 http://dx.doi.org/10.1186/s12859-016-1185-4 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Sun, Ming-an
Zhang, Qing
Wang, Yejun
Ge, Wei
Guo, Dianjing
Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features
title Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features
title_full Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features
title_fullStr Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features
title_full_unstemmed Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features
title_short Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features
title_sort prediction of redox-sensitive cysteines using sequential distance and other sequence-based features
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4995733/
https://www.ncbi.nlm.nih.gov/pubmed/27553667
http://dx.doi.org/10.1186/s12859-016-1185-4
work_keys_str_mv AT sunmingan predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures
AT zhangqing predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures
AT wangyejun predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures
AT gewei predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures
AT guodianjing predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures