Cargando…
SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites
Protein S-sulfenylation, which results from oxidation of free thiols on cysteine residues, has recently emerged as an important post-translational modification that regulates the structure and function of proteins involved in a variety of physiological and pathological processes. By altering the siz...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6062547/ https://www.ncbi.nlm.nih.gov/pubmed/30050050 http://dx.doi.org/10.1038/s41598-018-29126-x |
_version_ | 1783342391763140608 |
---|---|
author | AL-barakati, Hussam J. McConnell, Evan W. Hicks, Leslie M. Poole, Leslie B. Newman, Robert H. KC, Dukka B. |
author_facet | AL-barakati, Hussam J. McConnell, Evan W. Hicks, Leslie M. Poole, Leslie B. Newman, Robert H. KC, Dukka B. |
author_sort | AL-barakati, Hussam J. |
collection | PubMed |
description | Protein S-sulfenylation, which results from oxidation of free thiols on cysteine residues, has recently emerged as an important post-translational modification that regulates the structure and function of proteins involved in a variety of physiological and pathological processes. By altering the size and physiochemical properties of modified cysteine residues, sulfenylation can impact the cellular function of proteins in several different ways. Thus, the ability to rapidly and accurately identify putative sulfenylation sites in proteins will provide important insights into redox-dependent regulation of protein function in a variety of cellular contexts. Though bottom-up proteomic approaches, such as tandem mass spectrometry (MS/MS), provide a wealth of information about global changes in the sulfenylation state of proteins, MS/MS-based experiments are often labor-intensive, costly and technically challenging. Therefore, to complement existing proteomic approaches, researchers have developed a series of computational tools to identify putative sulfenylation sites on proteins. However, existing methods often suffer from low accuracy, specificity, and/or sensitivity. In this study, we developed SVM-SulfoSite, a novel sulfenylation prediction tool that uses support vector machines (SVM) to identify key determinants of sulfenylation among five feature classes: binary code, physiochemical properties, k-space amino acid pairs, amino acid composition and high-quality physiochemical indices. Using 10-fold cross-validation, SVM-SulfoSite achieved 95% sensitivity and 83% specificity, with an overall accuracy of 89% and Matthew’s correlation coefficient (MCC) of 0.79. Likewise, using an independent test set of experimentally identified sulfenylation sites, our method achieved scores of 74%, 62%, 80% and 0.42 for accuracy, sensitivity, specificity and MCC, with an area under the receiver operator characteristic (ROC) curve of 0.81. Moreover, in side-by-side comparisons, SVM-SulfoSite performed as well as or better than existing sulfenylation prediction tools. Together, these results suggest that our method represents a robust and complementary technique for advanced exploration of protein S-sulfenylation. |
format | Online Article Text |
id | pubmed-6062547 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-60625472018-07-31 SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites AL-barakati, Hussam J. McConnell, Evan W. Hicks, Leslie M. Poole, Leslie B. Newman, Robert H. KC, Dukka B. Sci Rep Article Protein S-sulfenylation, which results from oxidation of free thiols on cysteine residues, has recently emerged as an important post-translational modification that regulates the structure and function of proteins involved in a variety of physiological and pathological processes. By altering the size and physiochemical properties of modified cysteine residues, sulfenylation can impact the cellular function of proteins in several different ways. Thus, the ability to rapidly and accurately identify putative sulfenylation sites in proteins will provide important insights into redox-dependent regulation of protein function in a variety of cellular contexts. Though bottom-up proteomic approaches, such as tandem mass spectrometry (MS/MS), provide a wealth of information about global changes in the sulfenylation state of proteins, MS/MS-based experiments are often labor-intensive, costly and technically challenging. Therefore, to complement existing proteomic approaches, researchers have developed a series of computational tools to identify putative sulfenylation sites on proteins. However, existing methods often suffer from low accuracy, specificity, and/or sensitivity. In this study, we developed SVM-SulfoSite, a novel sulfenylation prediction tool that uses support vector machines (SVM) to identify key determinants of sulfenylation among five feature classes: binary code, physiochemical properties, k-space amino acid pairs, amino acid composition and high-quality physiochemical indices. Using 10-fold cross-validation, SVM-SulfoSite achieved 95% sensitivity and 83% specificity, with an overall accuracy of 89% and Matthew’s correlation coefficient (MCC) of 0.79. Likewise, using an independent test set of experimentally identified sulfenylation sites, our method achieved scores of 74%, 62%, 80% and 0.42 for accuracy, sensitivity, specificity and MCC, with an area under the receiver operator characteristic (ROC) curve of 0.81. Moreover, in side-by-side comparisons, SVM-SulfoSite performed as well as or better than existing sulfenylation prediction tools. Together, these results suggest that our method represents a robust and complementary technique for advanced exploration of protein S-sulfenylation. Nature Publishing Group UK 2018-07-26 /pmc/articles/PMC6062547/ /pubmed/30050050 http://dx.doi.org/10.1038/s41598-018-29126-x Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article AL-barakati, Hussam J. McConnell, Evan W. Hicks, Leslie M. Poole, Leslie B. Newman, Robert H. KC, Dukka B. SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites |
title | SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites |
title_full | SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites |
title_fullStr | SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites |
title_full_unstemmed | SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites |
title_short | SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites |
title_sort | svm-sulfosite: a support vector machine based predictor for sulfenylation sites |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6062547/ https://www.ncbi.nlm.nih.gov/pubmed/30050050 http://dx.doi.org/10.1038/s41598-018-29126-x |
work_keys_str_mv | AT albarakatihussamj svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites AT mcconnellevanw svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites AT hickslesliem svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites AT pooleleslieb svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites AT newmanroberth svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites AT kcdukkab svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites |