Cargando…

SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites

Protein S-sulfenylation, which results from oxidation of free thiols on cysteine residues, has recently emerged as an important post-translational modification that regulates the structure and function of proteins involved in a variety of physiological and pathological processes. By altering the siz...

Descripción completa

Detalles Bibliográficos
Autores principales: AL-barakati, Hussam J., McConnell, Evan W., Hicks, Leslie M., Poole, Leslie B., Newman, Robert H., KC, Dukka B.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6062547/
https://www.ncbi.nlm.nih.gov/pubmed/30050050
http://dx.doi.org/10.1038/s41598-018-29126-x
_version_ 1783342391763140608
author AL-barakati, Hussam J.
McConnell, Evan W.
Hicks, Leslie M.
Poole, Leslie B.
Newman, Robert H.
KC, Dukka B.
author_facet AL-barakati, Hussam J.
McConnell, Evan W.
Hicks, Leslie M.
Poole, Leslie B.
Newman, Robert H.
KC, Dukka B.
author_sort AL-barakati, Hussam J.
collection PubMed
description Protein S-sulfenylation, which results from oxidation of free thiols on cysteine residues, has recently emerged as an important post-translational modification that regulates the structure and function of proteins involved in a variety of physiological and pathological processes. By altering the size and physiochemical properties of modified cysteine residues, sulfenylation can impact the cellular function of proteins in several different ways. Thus, the ability to rapidly and accurately identify putative sulfenylation sites in proteins will provide important insights into redox-dependent regulation of protein function in a variety of cellular contexts. Though bottom-up proteomic approaches, such as tandem mass spectrometry (MS/MS), provide a wealth of information about global changes in the sulfenylation state of proteins, MS/MS-based experiments are often labor-intensive, costly and technically challenging. Therefore, to complement existing proteomic approaches, researchers have developed a series of computational tools to identify putative sulfenylation sites on proteins. However, existing methods often suffer from low accuracy, specificity, and/or sensitivity. In this study, we developed SVM-SulfoSite, a novel sulfenylation prediction tool that uses support vector machines (SVM) to identify key determinants of sulfenylation among five feature classes: binary code, physiochemical properties, k-space amino acid pairs, amino acid composition and high-quality physiochemical indices. Using 10-fold cross-validation, SVM-SulfoSite achieved 95% sensitivity and 83% specificity, with an overall accuracy of 89% and Matthew’s correlation coefficient (MCC) of 0.79. Likewise, using an independent test set of experimentally identified sulfenylation sites, our method achieved scores of 74%, 62%, 80% and 0.42 for accuracy, sensitivity, specificity and MCC, with an area under the receiver operator characteristic (ROC) curve of 0.81. Moreover, in side-by-side comparisons, SVM-SulfoSite performed as well as or better than existing sulfenylation prediction tools. Together, these results suggest that our method represents a robust and complementary technique for advanced exploration of protein S-sulfenylation.
format Online
Article
Text
id pubmed-6062547
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-60625472018-07-31 SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites AL-barakati, Hussam J. McConnell, Evan W. Hicks, Leslie M. Poole, Leslie B. Newman, Robert H. KC, Dukka B. Sci Rep Article Protein S-sulfenylation, which results from oxidation of free thiols on cysteine residues, has recently emerged as an important post-translational modification that regulates the structure and function of proteins involved in a variety of physiological and pathological processes. By altering the size and physiochemical properties of modified cysteine residues, sulfenylation can impact the cellular function of proteins in several different ways. Thus, the ability to rapidly and accurately identify putative sulfenylation sites in proteins will provide important insights into redox-dependent regulation of protein function in a variety of cellular contexts. Though bottom-up proteomic approaches, such as tandem mass spectrometry (MS/MS), provide a wealth of information about global changes in the sulfenylation state of proteins, MS/MS-based experiments are often labor-intensive, costly and technically challenging. Therefore, to complement existing proteomic approaches, researchers have developed a series of computational tools to identify putative sulfenylation sites on proteins. However, existing methods often suffer from low accuracy, specificity, and/or sensitivity. In this study, we developed SVM-SulfoSite, a novel sulfenylation prediction tool that uses support vector machines (SVM) to identify key determinants of sulfenylation among five feature classes: binary code, physiochemical properties, k-space amino acid pairs, amino acid composition and high-quality physiochemical indices. Using 10-fold cross-validation, SVM-SulfoSite achieved 95% sensitivity and 83% specificity, with an overall accuracy of 89% and Matthew’s correlation coefficient (MCC) of 0.79. Likewise, using an independent test set of experimentally identified sulfenylation sites, our method achieved scores of 74%, 62%, 80% and 0.42 for accuracy, sensitivity, specificity and MCC, with an area under the receiver operator characteristic (ROC) curve of 0.81. Moreover, in side-by-side comparisons, SVM-SulfoSite performed as well as or better than existing sulfenylation prediction tools. Together, these results suggest that our method represents a robust and complementary technique for advanced exploration of protein S-sulfenylation. Nature Publishing Group UK 2018-07-26 /pmc/articles/PMC6062547/ /pubmed/30050050 http://dx.doi.org/10.1038/s41598-018-29126-x Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
AL-barakati, Hussam J.
McConnell, Evan W.
Hicks, Leslie M.
Poole, Leslie B.
Newman, Robert H.
KC, Dukka B.
SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites
title SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites
title_full SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites
title_fullStr SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites
title_full_unstemmed SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites
title_short SVM-SulfoSite: A support vector machine based predictor for sulfenylation sites
title_sort svm-sulfosite: a support vector machine based predictor for sulfenylation sites
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6062547/
https://www.ncbi.nlm.nih.gov/pubmed/30050050
http://dx.doi.org/10.1038/s41598-018-29126-x
work_keys_str_mv AT albarakatihussamj svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites
AT mcconnellevanw svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites
AT hickslesliem svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites
AT pooleleslieb svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites
AT newmanroberth svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites
AT kcdukkab svmsulfositeasupportvectormachinebasedpredictorforsulfenylationsites