Cargando…

Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features

Lysine acetylation is a reversible post-translational modification, playing an important role in cytokine signaling, transcriptional regulation, and apoptosis. To fully understand acetylation mechanisms, identification of substrates and specific acetylation sites is crucial. Experimental identificat...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Yuan, Wang, Mingjun, Wang, Huilin, Tan, Hao, Zhang, Ziding, Webb, Geoffrey I., Song, Jiangning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4104576/
https://www.ncbi.nlm.nih.gov/pubmed/25042424
http://dx.doi.org/10.1038/srep05765
_version_ 1782327272584249344
author Li, Yuan
Wang, Mingjun
Wang, Huilin
Tan, Hao
Zhang, Ziding
Webb, Geoffrey I.
Song, Jiangning
author_facet Li, Yuan
Wang, Mingjun
Wang, Huilin
Tan, Hao
Zhang, Ziding
Webb, Geoffrey I.
Song, Jiangning
author_sort Li, Yuan
collection PubMed
description Lysine acetylation is a reversible post-translational modification, playing an important role in cytokine signaling, transcriptional regulation, and apoptosis. To fully understand acetylation mechanisms, identification of substrates and specific acetylation sites is crucial. Experimental identification is often time-consuming and expensive. Alternative bioinformatics methods are cost-effective and can be used in a high-throughput manner to generate relatively precise predictions. Here we develop a method termed as SSPKA for species-specific lysine acetylation prediction, using random forest classifiers that combine sequence-derived and functional features with two-step feature selection. Feature importance analysis indicates functional features, applied for lysine acetylation site prediction for the first time, significantly improve the predictive performance. We apply the SSPKA model to screen the entire human proteome and identify many high-confidence putative substrates that are not previously identified. The results along with the implemented Java tool, serve as useful resources to elucidate the mechanism of lysine acetylation and facilitate hypothesis-driven experimental design and validation.
format Online
Article
Text
id pubmed-4104576
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Nature Publishing Group
record_format MEDLINE/PubMed
spelling pubmed-41045762014-07-22 Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features Li, Yuan Wang, Mingjun Wang, Huilin Tan, Hao Zhang, Ziding Webb, Geoffrey I. Song, Jiangning Sci Rep Article Lysine acetylation is a reversible post-translational modification, playing an important role in cytokine signaling, transcriptional regulation, and apoptosis. To fully understand acetylation mechanisms, identification of substrates and specific acetylation sites is crucial. Experimental identification is often time-consuming and expensive. Alternative bioinformatics methods are cost-effective and can be used in a high-throughput manner to generate relatively precise predictions. Here we develop a method termed as SSPKA for species-specific lysine acetylation prediction, using random forest classifiers that combine sequence-derived and functional features with two-step feature selection. Feature importance analysis indicates functional features, applied for lysine acetylation site prediction for the first time, significantly improve the predictive performance. We apply the SSPKA model to screen the entire human proteome and identify many high-confidence putative substrates that are not previously identified. The results along with the implemented Java tool, serve as useful resources to elucidate the mechanism of lysine acetylation and facilitate hypothesis-driven experimental design and validation. Nature Publishing Group 2014-07-21 /pmc/articles/PMC4104576/ /pubmed/25042424 http://dx.doi.org/10.1038/srep05765 Text en Copyright © 2014, Macmillan Publishers Limited. All rights reserved http://creativecommons.org/licenses/by-nc-sa/4.0/ This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License. The images or other third party material in this article are included in the article's Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder in order to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-sa/4.0/
spellingShingle Article
Li, Yuan
Wang, Mingjun
Wang, Huilin
Tan, Hao
Zhang, Ziding
Webb, Geoffrey I.
Song, Jiangning
Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
title Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
title_full Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
title_fullStr Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
title_full_unstemmed Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
title_short Accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
title_sort accurate in silico identification of species-specific acetylation sites by integrating protein sequence-derived and functional features
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4104576/
https://www.ncbi.nlm.nih.gov/pubmed/25042424
http://dx.doi.org/10.1038/srep05765
work_keys_str_mv AT liyuan accurateinsilicoidentificationofspeciesspecificacetylationsitesbyintegratingproteinsequencederivedandfunctionalfeatures
AT wangmingjun accurateinsilicoidentificationofspeciesspecificacetylationsitesbyintegratingproteinsequencederivedandfunctionalfeatures
AT wanghuilin accurateinsilicoidentificationofspeciesspecificacetylationsitesbyintegratingproteinsequencederivedandfunctionalfeatures
AT tanhao accurateinsilicoidentificationofspeciesspecificacetylationsitesbyintegratingproteinsequencederivedandfunctionalfeatures
AT zhangziding accurateinsilicoidentificationofspeciesspecificacetylationsitesbyintegratingproteinsequencederivedandfunctionalfeatures
AT webbgeoffreyi accurateinsilicoidentificationofspeciesspecificacetylationsitesbyintegratingproteinsequencederivedandfunctionalfeatures
AT songjiangning accurateinsilicoidentificationofspeciesspecificacetylationsitesbyintegratingproteinsequencederivedandfunctionalfeatures