Cargando…
Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features
BACKGROUND: Reactive oxygen species can modify the structure and function of proteins and may also act as important signaling molecules in various cellular processes. Cysteine thiol groups of proteins are particularly susceptible to oxidation. Meanwhile, their reversible oxidation is of critical rol...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4995733/ https://www.ncbi.nlm.nih.gov/pubmed/27553667 http://dx.doi.org/10.1186/s12859-016-1185-4 |
_version_ | 1782449530895073280 |
---|---|
author | Sun, Ming-an Zhang, Qing Wang, Yejun Ge, Wei Guo, Dianjing |
author_facet | Sun, Ming-an Zhang, Qing Wang, Yejun Ge, Wei Guo, Dianjing |
author_sort | Sun, Ming-an |
collection | PubMed |
description | BACKGROUND: Reactive oxygen species can modify the structure and function of proteins and may also act as important signaling molecules in various cellular processes. Cysteine thiol groups of proteins are particularly susceptible to oxidation. Meanwhile, their reversible oxidation is of critical roles for redox regulation and signaling. Recently, several computational tools have been developed for predicting redox-sensitive cysteines; however, those methods either only focus on catalytic redox-sensitive cysteines in thiol oxidoreductases, or heavily depend on protein structural data, thus cannot be widely used. RESULTS: In this study, we analyzed various sequence-based features potentially related to cysteine redox-sensitivity, and identified three types of features for efficient computational prediction of redox-sensitive cysteines. These features are: sequential distance to the nearby cysteines, PSSM profile and predicted secondary structure of flanking residues. After further feature selection using SVM-RFE, we developed Redox-Sensitive Cysteine Predictor (RSCP), a SVM based classifier for redox-sensitive cysteine prediction using primary sequence only. Using 10-fold cross-validation on RSC758 dataset, the accuracy, sensitivity, specificity, MCC and AUC were estimated as 0.679, 0.602, 0.756, 0.362 and 0.727, respectively. When evaluated using 10-fold cross-validation with BALOSCTdb dataset which has structure information, the model achieved performance comparable to current structure-based method. Further validation using an independent dataset indicates it is robust and of relatively better accuracy for predicting redox-sensitive cysteines from non-enzyme proteins. CONCLUSIONS: In this study, we developed a sequence-based classifier for predicting redox-sensitive cysteines. The major advantage of this method is that it does not rely on protein structure data, which ensures more extensive application compared to other current implementations. Accurate prediction of redox-sensitive cysteines not only enhances our understanding about the redox sensitivity of cysteine, it may also complement the proteomics approach and facilitate further experimental investigation of important redox-sensitive cysteines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1185-4) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-4995733 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-49957332016-09-07 Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features Sun, Ming-an Zhang, Qing Wang, Yejun Ge, Wei Guo, Dianjing BMC Bioinformatics Methodology Article BACKGROUND: Reactive oxygen species can modify the structure and function of proteins and may also act as important signaling molecules in various cellular processes. Cysteine thiol groups of proteins are particularly susceptible to oxidation. Meanwhile, their reversible oxidation is of critical roles for redox regulation and signaling. Recently, several computational tools have been developed for predicting redox-sensitive cysteines; however, those methods either only focus on catalytic redox-sensitive cysteines in thiol oxidoreductases, or heavily depend on protein structural data, thus cannot be widely used. RESULTS: In this study, we analyzed various sequence-based features potentially related to cysteine redox-sensitivity, and identified three types of features for efficient computational prediction of redox-sensitive cysteines. These features are: sequential distance to the nearby cysteines, PSSM profile and predicted secondary structure of flanking residues. After further feature selection using SVM-RFE, we developed Redox-Sensitive Cysteine Predictor (RSCP), a SVM based classifier for redox-sensitive cysteine prediction using primary sequence only. Using 10-fold cross-validation on RSC758 dataset, the accuracy, sensitivity, specificity, MCC and AUC were estimated as 0.679, 0.602, 0.756, 0.362 and 0.727, respectively. When evaluated using 10-fold cross-validation with BALOSCTdb dataset which has structure information, the model achieved performance comparable to current structure-based method. Further validation using an independent dataset indicates it is robust and of relatively better accuracy for predicting redox-sensitive cysteines from non-enzyme proteins. CONCLUSIONS: In this study, we developed a sequence-based classifier for predicting redox-sensitive cysteines. The major advantage of this method is that it does not rely on protein structure data, which ensures more extensive application compared to other current implementations. Accurate prediction of redox-sensitive cysteines not only enhances our understanding about the redox sensitivity of cysteine, it may also complement the proteomics approach and facilitate further experimental investigation of important redox-sensitive cysteines. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12859-016-1185-4) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-24 /pmc/articles/PMC4995733/ /pubmed/27553667 http://dx.doi.org/10.1186/s12859-016-1185-4 Text en © The Author(s). 2016 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Sun, Ming-an Zhang, Qing Wang, Yejun Ge, Wei Guo, Dianjing Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features |
title | Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features |
title_full | Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features |
title_fullStr | Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features |
title_full_unstemmed | Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features |
title_short | Prediction of redox-sensitive cysteines using sequential distance and other sequence-based features |
title_sort | prediction of redox-sensitive cysteines using sequential distance and other sequence-based features |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4995733/ https://www.ncbi.nlm.nih.gov/pubmed/27553667 http://dx.doi.org/10.1186/s12859-016-1185-4 |
work_keys_str_mv | AT sunmingan predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures AT zhangqing predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures AT wangyejun predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures AT gewei predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures AT guodianjing predictionofredoxsensitivecysteinesusingsequentialdistanceandothersequencebasedfeatures |