Cargando…
SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models
BACKGROUND: S-sulphenylation is a ubiquitous protein post-translational modification (PTM) where an S-hydroxyl (−SOH) bond is formed via the reversible oxidation on the Sulfhydryl group of cysteine (C). Recent experimental studies have revealed that S-sulphenylation plays critical roles in many biol...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6868744/ https://www.ncbi.nlm.nih.gov/pubmed/31752668 http://dx.doi.org/10.1186/s12859-019-3178-6 |
_version_ | 1783472334136410112 |
---|---|
author | Wang, Xiaochuan Li, Chen Li, Fuyi Sharma, Varun S. Song, Jiangning Webb, Geoffrey I. |
author_facet | Wang, Xiaochuan Li, Chen Li, Fuyi Sharma, Varun S. Song, Jiangning Webb, Geoffrey I. |
author_sort | Wang, Xiaochuan |
collection | PubMed |
description | BACKGROUND: S-sulphenylation is a ubiquitous protein post-translational modification (PTM) where an S-hydroxyl (−SOH) bond is formed via the reversible oxidation on the Sulfhydryl group of cysteine (C). Recent experimental studies have revealed that S-sulphenylation plays critical roles in many biological functions, such as protein regulation and cell signaling. State-of-the-art bioinformatic advances have facilitated high-throughput in silico screening of protein S-sulphenylation sites, thereby significantly reducing the time and labour costs traditionally required for the experimental investigation of S-sulphenylation. RESULTS: In this study, we have proposed a novel hybrid computational framework, termed SIMLIN, for accurate prediction of protein S-sulphenylation sites using a multi-stage neural-network based ensemble-learning model integrating both protein sequence derived and protein structural features. Benchmarking experiments against the current state-of-the-art predictors for S-sulphenylation demonstrated that SIMLIN delivered competitive prediction performance. The empirical studies on the independent testing dataset demonstrated that SIMLIN achieved 88.0% prediction accuracy and an AUC score of 0.82, which outperforms currently existing methods. CONCLUSIONS: In summary, SIMLIN predicts human S-sulphenylation sites with high accuracy thereby facilitating biological hypothesis generation and experimental validation. The web server, datasets, and online instructions are freely available at http://simlin.erc.monash.edu/ for academic purposes. |
format | Online Article Text |
id | pubmed-6868744 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-68687442019-12-12 SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models Wang, Xiaochuan Li, Chen Li, Fuyi Sharma, Varun S. Song, Jiangning Webb, Geoffrey I. BMC Bioinformatics Software BACKGROUND: S-sulphenylation is a ubiquitous protein post-translational modification (PTM) where an S-hydroxyl (−SOH) bond is formed via the reversible oxidation on the Sulfhydryl group of cysteine (C). Recent experimental studies have revealed that S-sulphenylation plays critical roles in many biological functions, such as protein regulation and cell signaling. State-of-the-art bioinformatic advances have facilitated high-throughput in silico screening of protein S-sulphenylation sites, thereby significantly reducing the time and labour costs traditionally required for the experimental investigation of S-sulphenylation. RESULTS: In this study, we have proposed a novel hybrid computational framework, termed SIMLIN, for accurate prediction of protein S-sulphenylation sites using a multi-stage neural-network based ensemble-learning model integrating both protein sequence derived and protein structural features. Benchmarking experiments against the current state-of-the-art predictors for S-sulphenylation demonstrated that SIMLIN delivered competitive prediction performance. The empirical studies on the independent testing dataset demonstrated that SIMLIN achieved 88.0% prediction accuracy and an AUC score of 0.82, which outperforms currently existing methods. CONCLUSIONS: In summary, SIMLIN predicts human S-sulphenylation sites with high accuracy thereby facilitating biological hypothesis generation and experimental validation. The web server, datasets, and online instructions are freely available at http://simlin.erc.monash.edu/ for academic purposes. BioMed Central 2019-11-21 /pmc/articles/PMC6868744/ /pubmed/31752668 http://dx.doi.org/10.1186/s12859-019-3178-6 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Software Wang, Xiaochuan Li, Chen Li, Fuyi Sharma, Varun S. Song, Jiangning Webb, Geoffrey I. SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models |
title | SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models |
title_full | SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models |
title_fullStr | SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models |
title_full_unstemmed | SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models |
title_short | SIMLIN: a bioinformatics tool for prediction of S-sulphenylation in the human proteome based on multi-stage ensemble-learning models |
title_sort | simlin: a bioinformatics tool for prediction of s-sulphenylation in the human proteome based on multi-stage ensemble-learning models |
topic | Software |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6868744/ https://www.ncbi.nlm.nih.gov/pubmed/31752668 http://dx.doi.org/10.1186/s12859-019-3178-6 |
work_keys_str_mv | AT wangxiaochuan simlinabioinformaticstoolforpredictionofssulphenylationinthehumanproteomebasedonmultistageensemblelearningmodels AT lichen simlinabioinformaticstoolforpredictionofssulphenylationinthehumanproteomebasedonmultistageensemblelearningmodels AT lifuyi simlinabioinformaticstoolforpredictionofssulphenylationinthehumanproteomebasedonmultistageensemblelearningmodels AT sharmavaruns simlinabioinformaticstoolforpredictionofssulphenylationinthehumanproteomebasedonmultistageensemblelearningmodels AT songjiangning simlinabioinformaticstoolforpredictionofssulphenylationinthehumanproteomebasedonmultistageensemblelearningmodels AT webbgeoffreyi simlinabioinformaticstoolforpredictionofssulphenylationinthehumanproteomebasedonmultistageensemblelearningmodels |