Cargando…

Discriminating between Lysine Sumoylation and Lysine Acetylation Using mRMR Feature Selection and Analysis

Post-translational modifications (PTMs) are crucial steps in protein synthesis and are important factors contributing to protein diversity. PTMs play important roles in the regulation of gene expression, protein stability and metabolism. Lysine residues in protein sequences have been found to be tar...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Ning, Zhou, You, Huang, Tao, Zhang, Yu-Chao, Li, Bi-Qing, Chen, Lei, Cai, Yu-Dong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4164654/
https://www.ncbi.nlm.nih.gov/pubmed/25222670
http://dx.doi.org/10.1371/journal.pone.0107464
_version_ 1782334980925423616
author Zhang, Ning
Zhou, You
Huang, Tao
Zhang, Yu-Chao
Li, Bi-Qing
Chen, Lei
Cai, Yu-Dong
author_facet Zhang, Ning
Zhou, You
Huang, Tao
Zhang, Yu-Chao
Li, Bi-Qing
Chen, Lei
Cai, Yu-Dong
author_sort Zhang, Ning
collection PubMed
description Post-translational modifications (PTMs) are crucial steps in protein synthesis and are important factors contributing to protein diversity. PTMs play important roles in the regulation of gene expression, protein stability and metabolism. Lysine residues in protein sequences have been found to be targeted for both types of PTMs: sumoylations and acetylations; however, each PTM has a different cellular role. As experimental approaches are often laborious and time consuming, it is challenging to distinguish the two types of PTMs on lysine residues using computational methods. In this study, we developed a method to discriminate between sumoylated lysine residues and acetylated residues. The method incorporated several features: PSSM conservation scores, amino acid factors, secondary structures, solvent accessibilities and disorder scores. By using the mRMR (Maximum Relevance Minimum Redundancy) method and the IFS (Incremental Feature Selection) method, an optimal feature set was selected from all of the incorporated features, with which the classifier achieved 92.14% accuracy with an MCC value of 0.7322. Analysis of the optimal feature set revealed some differences between acetylation and sumoylation. The results from our study also supported the previous finding that there exist different consensus motifs for the two types of PTMs. The results could suggest possible dominant factors governing the acetylation and sumoylation of lysine residues, shedding some light on the modification dynamics and molecular mechanisms of the two types of PTMs, and provide guidelines for experimental validations.
format Online
Article
Text
id pubmed-4164654
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41646542014-09-19 Discriminating between Lysine Sumoylation and Lysine Acetylation Using mRMR Feature Selection and Analysis Zhang, Ning Zhou, You Huang, Tao Zhang, Yu-Chao Li, Bi-Qing Chen, Lei Cai, Yu-Dong PLoS One Research Article Post-translational modifications (PTMs) are crucial steps in protein synthesis and are important factors contributing to protein diversity. PTMs play important roles in the regulation of gene expression, protein stability and metabolism. Lysine residues in protein sequences have been found to be targeted for both types of PTMs: sumoylations and acetylations; however, each PTM has a different cellular role. As experimental approaches are often laborious and time consuming, it is challenging to distinguish the two types of PTMs on lysine residues using computational methods. In this study, we developed a method to discriminate between sumoylated lysine residues and acetylated residues. The method incorporated several features: PSSM conservation scores, amino acid factors, secondary structures, solvent accessibilities and disorder scores. By using the mRMR (Maximum Relevance Minimum Redundancy) method and the IFS (Incremental Feature Selection) method, an optimal feature set was selected from all of the incorporated features, with which the classifier achieved 92.14% accuracy with an MCC value of 0.7322. Analysis of the optimal feature set revealed some differences between acetylation and sumoylation. The results from our study also supported the previous finding that there exist different consensus motifs for the two types of PTMs. The results could suggest possible dominant factors governing the acetylation and sumoylation of lysine residues, shedding some light on the modification dynamics and molecular mechanisms of the two types of PTMs, and provide guidelines for experimental validations. Public Library of Science 2014-09-15 /pmc/articles/PMC4164654/ /pubmed/25222670 http://dx.doi.org/10.1371/journal.pone.0107464 Text en © 2014 Zhang et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhang, Ning
Zhou, You
Huang, Tao
Zhang, Yu-Chao
Li, Bi-Qing
Chen, Lei
Cai, Yu-Dong
Discriminating between Lysine Sumoylation and Lysine Acetylation Using mRMR Feature Selection and Analysis
title Discriminating between Lysine Sumoylation and Lysine Acetylation Using mRMR Feature Selection and Analysis
title_full Discriminating between Lysine Sumoylation and Lysine Acetylation Using mRMR Feature Selection and Analysis
title_fullStr Discriminating between Lysine Sumoylation and Lysine Acetylation Using mRMR Feature Selection and Analysis
title_full_unstemmed Discriminating between Lysine Sumoylation and Lysine Acetylation Using mRMR Feature Selection and Analysis
title_short Discriminating between Lysine Sumoylation and Lysine Acetylation Using mRMR Feature Selection and Analysis
title_sort discriminating between lysine sumoylation and lysine acetylation using mrmr feature selection and analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4164654/
https://www.ncbi.nlm.nih.gov/pubmed/25222670
http://dx.doi.org/10.1371/journal.pone.0107464
work_keys_str_mv AT zhangning discriminatingbetweenlysinesumoylationandlysineacetylationusingmrmrfeatureselectionandanalysis
AT zhouyou discriminatingbetweenlysinesumoylationandlysineacetylationusingmrmrfeatureselectionandanalysis
AT huangtao discriminatingbetweenlysinesumoylationandlysineacetylationusingmrmrfeatureselectionandanalysis
AT zhangyuchao discriminatingbetweenlysinesumoylationandlysineacetylationusingmrmrfeatureselectionandanalysis
AT libiqing discriminatingbetweenlysinesumoylationandlysineacetylationusingmrmrfeatureselectionandanalysis
AT chenlei discriminatingbetweenlysinesumoylationandlysineacetylationusingmrmrfeatureselectionandanalysis
AT caiyudong discriminatingbetweenlysinesumoylationandlysineacetylationusingmrmrfeatureselectionandanalysis