Cargando…

SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure

Post Translational Modification (PTM) is defined as the modification of amino acids along the protein sequences after the translation process. These modifications significantly impact on the functioning of proteins. Therefore, having a comprehensive understanding of the underlying mechanism of PTMs...

Descripción completa

Detalles Bibliográficos
Autores principales: Dehzangi, Abdollah, López, Yosvany, Taherzadeh, Ghazaleh, Sharma, Alok, Tsunoda, Tatsuhiko
Formato: Online Artículo Texto
Lenguaje:English
Publicado: MDPI 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6320791/
https://www.ncbi.nlm.nih.gov/pubmed/30544729
http://dx.doi.org/10.3390/molecules23123260
_version_ 1783385286159368192
author Dehzangi, Abdollah
López, Yosvany
Taherzadeh, Ghazaleh
Sharma, Alok
Tsunoda, Tatsuhiko
author_facet Dehzangi, Abdollah
López, Yosvany
Taherzadeh, Ghazaleh
Sharma, Alok
Tsunoda, Tatsuhiko
author_sort Dehzangi, Abdollah
collection PubMed
description Post Translational Modification (PTM) is defined as the modification of amino acids along the protein sequences after the translation process. These modifications significantly impact on the functioning of proteins. Therefore, having a comprehensive understanding of the underlying mechanism of PTMs turns out to be critical in studying the biological roles of proteins. Among a wide range of PTMs, sumoylation is one of the most important modifications due to its known cellular functions which include transcriptional regulation, protein stability, and protein subcellular localization. Despite its importance, determining sumoylation sites via experimental methods is time-consuming and costly. This has led to a great demand for the development of fast computational methods able to accurately determine sumoylation sites in proteins. In this study, we present a new machine learning-based method for predicting sumoylation sites called SumSec. To do this, we employed the predicted secondary structure of amino acids to extract two types of structural features from neighboring amino acids along the protein sequence which has never been used for this task. As a result, our proposed method is able to enhance the sumoylation site prediction task, outperforming previously proposed methods in the literature. SumSec demonstrated high sensitivity (0.91), accuracy (0.94) and MCC (0.88). The prediction accuracy achieved in this study is 21% better than those reported in previous studies. The script and extracted features are publicly available at: https://github.com/YosvanyLopez/SumSec.
format Online
Article
Text
id pubmed-6320791
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher MDPI
record_format MEDLINE/PubMed
spelling pubmed-63207912019-01-14 SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure Dehzangi, Abdollah López, Yosvany Taherzadeh, Ghazaleh Sharma, Alok Tsunoda, Tatsuhiko Molecules Article Post Translational Modification (PTM) is defined as the modification of amino acids along the protein sequences after the translation process. These modifications significantly impact on the functioning of proteins. Therefore, having a comprehensive understanding of the underlying mechanism of PTMs turns out to be critical in studying the biological roles of proteins. Among a wide range of PTMs, sumoylation is one of the most important modifications due to its known cellular functions which include transcriptional regulation, protein stability, and protein subcellular localization. Despite its importance, determining sumoylation sites via experimental methods is time-consuming and costly. This has led to a great demand for the development of fast computational methods able to accurately determine sumoylation sites in proteins. In this study, we present a new machine learning-based method for predicting sumoylation sites called SumSec. To do this, we employed the predicted secondary structure of amino acids to extract two types of structural features from neighboring amino acids along the protein sequence which has never been used for this task. As a result, our proposed method is able to enhance the sumoylation site prediction task, outperforming previously proposed methods in the literature. SumSec demonstrated high sensitivity (0.91), accuracy (0.94) and MCC (0.88). The prediction accuracy achieved in this study is 21% better than those reported in previous studies. The script and extracted features are publicly available at: https://github.com/YosvanyLopez/SumSec. MDPI 2018-12-10 /pmc/articles/PMC6320791/ /pubmed/30544729 http://dx.doi.org/10.3390/molecules23123260 Text en © 2018 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle Article
Dehzangi, Abdollah
López, Yosvany
Taherzadeh, Ghazaleh
Sharma, Alok
Tsunoda, Tatsuhiko
SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_full SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_fullStr SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_full_unstemmed SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_short SumSec: Accurate Prediction of Sumoylation Sites Using Predicted Secondary Structure
title_sort sumsec: accurate prediction of sumoylation sites using predicted secondary structure
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6320791/
https://www.ncbi.nlm.nih.gov/pubmed/30544729
http://dx.doi.org/10.3390/molecules23123260
work_keys_str_mv AT dehzangiabdollah sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure
AT lopezyosvany sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure
AT taherzadehghazaleh sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure
AT sharmaalok sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure
AT tsunodatatsuhiko sumsecaccuratepredictionofsumoylationsitesusingpredictedsecondarystructure