Cargando…

SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites

BACKGROUND: Protein S-sulfenylation is a type of post-translational modification (PTM) involving the covalent binding of a hydroxyl group to the thiol of a cysteine amino acid. Recent evidence has shown the importance of S-sulfenylation in various biological processes, including transcriptional regu...

Descripción completa

Detalles Bibliográficos
Autores principales: Bui, Van-Minh, Weng, Shun-Long, Lu, Cheng-Tsung, Chang, Tzu-Hao, Weng, Julia Tzu-Ya, Lee, Tzong-Yi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895302/
https://www.ncbi.nlm.nih.gov/pubmed/26819243
http://dx.doi.org/10.1186/s12864-015-2299-1
_version_ 1782435821033357312
author Bui, Van-Minh
Weng, Shun-Long
Lu, Cheng-Tsung
Chang, Tzu-Hao
Weng, Julia Tzu-Ya
Lee, Tzong-Yi
author_facet Bui, Van-Minh
Weng, Shun-Long
Lu, Cheng-Tsung
Chang, Tzu-Hao
Weng, Julia Tzu-Ya
Lee, Tzong-Yi
author_sort Bui, Van-Minh
collection PubMed
description BACKGROUND: Protein S-sulfenylation is a type of post-translational modification (PTM) involving the covalent binding of a hydroxyl group to the thiol of a cysteine amino acid. Recent evidence has shown the importance of S-sulfenylation in various biological processes, including transcriptional regulation, apoptosis and cytokine signaling. Determining the specific sites of S-sulfenylation is fundamental to understanding the structures and functions of S-sulfenylated proteins. However, the current lack of reliable tools often limits researchers to use expensive and time-consuming laboratory techniques for the identification of S-sulfenylation sites. Thus, we were motivated to develop a bioinformatics method for investigating S-sulfenylation sites based on amino acid compositions and physicochemical properties. RESULTS: In this work, physicochemical properties were utilized not only to identify S-sulfenylation sites from 1,096 experimentally verified S-sulfenylated proteins, but also to compare the effectiveness of prediction with other characteristics such as amino acid composition (AAC), amino acid pair composition (AAPC), solvent-accessible surface area (ASA), amino acid substitution matrix (BLOSUM62), position-specific scoring matrix (PSSM), and positional weighted matrix (PWM). Various prediction models were built using support vector machine (SVM) and evaluated by five-fold cross-validation. The model constructed from hybrid features, including PSSM and physicochemical properties, yielded the best performance with sensitivity, specificity, accuracy and MCC measurements of 0.746, 0.737, 0.738 and 0.337, respectively. The selected model also provided a promising accuracy (0.693) on an independent testing dataset. Additionally, we employed TwoSampleLogo to help discover the difference of amino acid composition among S-sulfenylation, S-glutathionylation and S-nitrosylation sites. CONCLUSION: This work proposed a computational method to explore informative features and functions for protein S-sulfenylation. Evaluation by five-fold cross validation indicated that the selected features were effective in the identification of S-sulfenylation sites. Moreover, the independent testing results demonstrated that the proposed method could provide a feasible means for conducting preliminary analyses of protein S-sulfenylation. We also anticipate that the uncovered differences in amino acid composition may facilitate future studies of the extensive crosstalk among S-sulfenylation, S-glutathionylation and S-nitrosylation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2299-1) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4895302
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-48953022016-06-10 SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites Bui, Van-Minh Weng, Shun-Long Lu, Cheng-Tsung Chang, Tzu-Hao Weng, Julia Tzu-Ya Lee, Tzong-Yi BMC Genomics Proceedings BACKGROUND: Protein S-sulfenylation is a type of post-translational modification (PTM) involving the covalent binding of a hydroxyl group to the thiol of a cysteine amino acid. Recent evidence has shown the importance of S-sulfenylation in various biological processes, including transcriptional regulation, apoptosis and cytokine signaling. Determining the specific sites of S-sulfenylation is fundamental to understanding the structures and functions of S-sulfenylated proteins. However, the current lack of reliable tools often limits researchers to use expensive and time-consuming laboratory techniques for the identification of S-sulfenylation sites. Thus, we were motivated to develop a bioinformatics method for investigating S-sulfenylation sites based on amino acid compositions and physicochemical properties. RESULTS: In this work, physicochemical properties were utilized not only to identify S-sulfenylation sites from 1,096 experimentally verified S-sulfenylated proteins, but also to compare the effectiveness of prediction with other characteristics such as amino acid composition (AAC), amino acid pair composition (AAPC), solvent-accessible surface area (ASA), amino acid substitution matrix (BLOSUM62), position-specific scoring matrix (PSSM), and positional weighted matrix (PWM). Various prediction models were built using support vector machine (SVM) and evaluated by five-fold cross-validation. The model constructed from hybrid features, including PSSM and physicochemical properties, yielded the best performance with sensitivity, specificity, accuracy and MCC measurements of 0.746, 0.737, 0.738 and 0.337, respectively. The selected model also provided a promising accuracy (0.693) on an independent testing dataset. Additionally, we employed TwoSampleLogo to help discover the difference of amino acid composition among S-sulfenylation, S-glutathionylation and S-nitrosylation sites. CONCLUSION: This work proposed a computational method to explore informative features and functions for protein S-sulfenylation. Evaluation by five-fold cross validation indicated that the selected features were effective in the identification of S-sulfenylation sites. Moreover, the independent testing results demonstrated that the proposed method could provide a feasible means for conducting preliminary analyses of protein S-sulfenylation. We also anticipate that the uncovered differences in amino acid composition may facilitate future studies of the extensive crosstalk among S-sulfenylation, S-glutathionylation and S-nitrosylation. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-015-2299-1) contains supplementary material, which is available to authorized users. BioMed Central 2016-01-11 /pmc/articles/PMC4895302/ /pubmed/26819243 http://dx.doi.org/10.1186/s12864-015-2299-1 Text en © Bui et al. 2015 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Proceedings
Bui, Van-Minh
Weng, Shun-Long
Lu, Cheng-Tsung
Chang, Tzu-Hao
Weng, Julia Tzu-Ya
Lee, Tzong-Yi
SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites
title SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites
title_full SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites
title_fullStr SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites
title_full_unstemmed SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites
title_short SOHSite: incorporating evolutionary information and physicochemical properties to identify protein S-sulfenylation sites
title_sort sohsite: incorporating evolutionary information and physicochemical properties to identify protein s-sulfenylation sites
topic Proceedings
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4895302/
https://www.ncbi.nlm.nih.gov/pubmed/26819243
http://dx.doi.org/10.1186/s12864-015-2299-1
work_keys_str_mv AT buivanminh sohsiteincorporatingevolutionaryinformationandphysicochemicalpropertiestoidentifyproteinssulfenylationsites
AT wengshunlong sohsiteincorporatingevolutionaryinformationandphysicochemicalpropertiestoidentifyproteinssulfenylationsites
AT luchengtsung sohsiteincorporatingevolutionaryinformationandphysicochemicalpropertiestoidentifyproteinssulfenylationsites
AT changtzuhao sohsiteincorporatingevolutionaryinformationandphysicochemicalpropertiestoidentifyproteinssulfenylationsites
AT wengjuliatzuya sohsiteincorporatingevolutionaryinformationandphysicochemicalpropertiestoidentifyproteinssulfenylationsites
AT leetzongyi sohsiteincorporatingevolutionaryinformationandphysicochemicalpropertiestoidentifyproteinssulfenylationsites