Cargando…
SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications
Most modern tools used to predict sites of small ubiquitin-like modifier (SUMO) binding (referred to as SUMOylation) use algorithms, chemical features of the protein, and consensus motifs. However, these tools rarely consider the influence of post-translational modification (PTM) information for oth...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6195521/ https://www.ncbi.nlm.nih.gov/pubmed/30341374 http://dx.doi.org/10.1038/s41598-018-33951-5 |
_version_ | 1783364401131159552 |
---|---|
author | Chang, Chi-Chang Tung, Chi-Hua Chen, Chi-Wei Tu, Chin-Hau Chu, Yen-Wei |
author_facet | Chang, Chi-Chang Tung, Chi-Hua Chen, Chi-Wei Tu, Chin-Hau Chu, Yen-Wei |
author_sort | Chang, Chi-Chang |
collection | PubMed |
description | Most modern tools used to predict sites of small ubiquitin-like modifier (SUMO) binding (referred to as SUMOylation) use algorithms, chemical features of the protein, and consensus motifs. However, these tools rarely consider the influence of post-translational modification (PTM) information for other sites within the same protein on the accuracy of prediction results. This study applied the Random Forest machine learning method, as well as motif screening models and a feature selection combination mechanism, to develop a SUMOylation prediction system, referred to as SUMOgo. With regard to prediction method, PTM sites were coded as new functional features in addition to structural features, such as sequence-based binary coding, encoded chemical features of proteins, and encoded secondary structure information that is important for PTM. Twenty cycles of prediction were conducted with a 1:1 combination of positive test data and random negative data. Matthew’s correlation coefficient of SUMOgo reached 0.511, which is higher than that of current commonly used tools. This study further verified the important role of PTM in SUMOgo and includes a case study on CREB binding protein (CREBBP). The website for the final tool is http://predictor.nchu.edu.tw/SUMOgo. |
format | Online Article Text |
id | pubmed-6195521 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-61955212018-10-24 SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications Chang, Chi-Chang Tung, Chi-Hua Chen, Chi-Wei Tu, Chin-Hau Chu, Yen-Wei Sci Rep Article Most modern tools used to predict sites of small ubiquitin-like modifier (SUMO) binding (referred to as SUMOylation) use algorithms, chemical features of the protein, and consensus motifs. However, these tools rarely consider the influence of post-translational modification (PTM) information for other sites within the same protein on the accuracy of prediction results. This study applied the Random Forest machine learning method, as well as motif screening models and a feature selection combination mechanism, to develop a SUMOylation prediction system, referred to as SUMOgo. With regard to prediction method, PTM sites were coded as new functional features in addition to structural features, such as sequence-based binary coding, encoded chemical features of proteins, and encoded secondary structure information that is important for PTM. Twenty cycles of prediction were conducted with a 1:1 combination of positive test data and random negative data. Matthew’s correlation coefficient of SUMOgo reached 0.511, which is higher than that of current commonly used tools. This study further verified the important role of PTM in SUMOgo and includes a case study on CREB binding protein (CREBBP). The website for the final tool is http://predictor.nchu.edu.tw/SUMOgo. Nature Publishing Group UK 2018-10-19 /pmc/articles/PMC6195521/ /pubmed/30341374 http://dx.doi.org/10.1038/s41598-018-33951-5 Text en © The Author(s) 2018 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Chang, Chi-Chang Tung, Chi-Hua Chen, Chi-Wei Tu, Chin-Hau Chu, Yen-Wei SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications |
title | SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications |
title_full | SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications |
title_fullStr | SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications |
title_full_unstemmed | SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications |
title_short | SUMOgo: Prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications |
title_sort | sumogo: prediction of sumoylation sites on lysines by motif screening models and the effects of various post-translational modifications |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6195521/ https://www.ncbi.nlm.nih.gov/pubmed/30341374 http://dx.doi.org/10.1038/s41598-018-33951-5 |
work_keys_str_mv | AT changchichang sumogopredictionofsumoylationsitesonlysinesbymotifscreeningmodelsandtheeffectsofvariousposttranslationalmodifications AT tungchihua sumogopredictionofsumoylationsitesonlysinesbymotifscreeningmodelsandtheeffectsofvariousposttranslationalmodifications AT chenchiwei sumogopredictionofsumoylationsitesonlysinesbymotifscreeningmodelsandtheeffectsofvariousposttranslationalmodifications AT tuchinhau sumogopredictionofsumoylationsitesonlysinesbymotifscreeningmodelsandtheeffectsofvariousposttranslationalmodifications AT chuyenwei sumogopredictionofsumoylationsitesonlysinesbymotifscreeningmodelsandtheeffectsofvariousposttranslationalmodifications |