Cargando…

Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder

BACKGROUND: Sumoylation, which is a reversible and dynamic post-translational modification, is one of the vital processes in a cell. Before a protein matures to perform its function, sumoylation may alter its localization, interactions, and possibly structural conformation. Abberations in protein su...

Descripción completa

Detalles Bibliográficos
Autores principales: Yavuz, Ahmet Sinan, Sezerman, Osman Ugur
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4290605/
https://www.ncbi.nlm.nih.gov/pubmed/25521314
http://dx.doi.org/10.1186/1471-2164-15-S9-S18
_version_ 1782352270928642048
author Yavuz, Ahmet Sinan
Sezerman, Osman Ugur
author_facet Yavuz, Ahmet Sinan
Sezerman, Osman Ugur
author_sort Yavuz, Ahmet Sinan
collection PubMed
description BACKGROUND: Sumoylation, which is a reversible and dynamic post-translational modification, is one of the vital processes in a cell. Before a protein matures to perform its function, sumoylation may alter its localization, interactions, and possibly structural conformation. Abberations in protein sumoylation has been linked with a variety of disorders and developmental anomalies. Experimental approaches to identification of sumoylation sites may not be effective due to the dynamic nature of sumoylation, laborsome experiments and their cost. Therefore, computational approaches may guide experimental identification of sumoylation sites and provide insights for further understanding sumoylation mechanism. RESULTS: In this paper, the effectiveness of using various sequence properties in predicting sumoylation sites was investigated with statistical analyses and machine learning approach employing support vector machines. These sequence properties were derived from windows of size 7 including position-specific amino acid composition, hydrophobicity, estimated sub-window volumes, predicted disorder, and conformational flexibility. 5-fold cross-validation results on experimentally identified sumoylation sites revealed that our method successfully predicts sumoylation sites with a Matthew's correlation coefficient, sensitivity, specificity, and accuracy equal to 0.66, 73%, 98%, and 97%, respectively. Additionally, we have showed that our method compares favorably to the existing prediction methods and basic regular expressions scanner. CONCLUSIONS: By using support vector machines, a new, robust method for sumoylation site prediction was introduced. Besides, the possible effects of predicted conformational flexibility and disorder on sumoylation site recognition were explored computationally for the first time to our knowledge as an additional parameter that could aid in sumoylation site prediction.
format Online
Article
Text
id pubmed-4290605
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-42906052015-01-15 Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder Yavuz, Ahmet Sinan Sezerman, Osman Ugur BMC Genomics Research BACKGROUND: Sumoylation, which is a reversible and dynamic post-translational modification, is one of the vital processes in a cell. Before a protein matures to perform its function, sumoylation may alter its localization, interactions, and possibly structural conformation. Abberations in protein sumoylation has been linked with a variety of disorders and developmental anomalies. Experimental approaches to identification of sumoylation sites may not be effective due to the dynamic nature of sumoylation, laborsome experiments and their cost. Therefore, computational approaches may guide experimental identification of sumoylation sites and provide insights for further understanding sumoylation mechanism. RESULTS: In this paper, the effectiveness of using various sequence properties in predicting sumoylation sites was investigated with statistical analyses and machine learning approach employing support vector machines. These sequence properties were derived from windows of size 7 including position-specific amino acid composition, hydrophobicity, estimated sub-window volumes, predicted disorder, and conformational flexibility. 5-fold cross-validation results on experimentally identified sumoylation sites revealed that our method successfully predicts sumoylation sites with a Matthew's correlation coefficient, sensitivity, specificity, and accuracy equal to 0.66, 73%, 98%, and 97%, respectively. Additionally, we have showed that our method compares favorably to the existing prediction methods and basic regular expressions scanner. CONCLUSIONS: By using support vector machines, a new, robust method for sumoylation site prediction was introduced. Besides, the possible effects of predicted conformational flexibility and disorder on sumoylation site recognition were explored computationally for the first time to our knowledge as an additional parameter that could aid in sumoylation site prediction. BioMed Central 2014-12-08 /pmc/articles/PMC4290605/ /pubmed/25521314 http://dx.doi.org/10.1186/1471-2164-15-S9-S18 Text en Copyright © 2014 Yavuz and Sezerman; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Yavuz, Ahmet Sinan
Sezerman, Osman Ugur
Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder
title Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder
title_full Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder
title_fullStr Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder
title_full_unstemmed Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder
title_short Predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder
title_sort predicting sumoylation sites using support vector machines based on various sequence features, conformational flexibility and disorder
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4290605/
https://www.ncbi.nlm.nih.gov/pubmed/25521314
http://dx.doi.org/10.1186/1471-2164-15-S9-S18
work_keys_str_mv AT yavuzahmetsinan predictingsumoylationsitesusingsupportvectormachinesbasedonvarioussequencefeaturesconformationalflexibilityanddisorder
AT sezermanosmanugur predictingsumoylationsitesusingsupportvectormachinesbasedonvarioussequencefeaturesconformationalflexibilityanddisorder