Cargando…
DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction
As a novel type of post-translational modification, lysine 2-Hydroxyisobutyrylation (K(hib)) plays an important role in gene transcription and signal transduction. In order to understand its regulatory mechanism, the essential step is the recognition of K(hib) sites. Thousands of K(hib) sites have b...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7509169/ https://www.ncbi.nlm.nih.gov/pubmed/33015075 http://dx.doi.org/10.3389/fcell.2020.580217 |
_version_ | 1783585550156955648 |
---|---|
author | Zhang, Luna Zou, Yang He, Ningning Chen, Yu Chen, Zhen Li, Lei |
author_facet | Zhang, Luna Zou, Yang He, Ningning Chen, Yu Chen, Zhen Li, Lei |
author_sort | Zhang, Luna |
collection | PubMed |
description | As a novel type of post-translational modification, lysine 2-Hydroxyisobutyrylation (K(hib)) plays an important role in gene transcription and signal transduction. In order to understand its regulatory mechanism, the essential step is the recognition of K(hib) sites. Thousands of K(hib) sites have been experimentally verified across five different species. However, there are only a couple traditional machine-learning algorithms developed to predict K(hib) sites for limited species, lacking a general prediction algorithm. We constructed a deep-learning algorithm based on convolutional neural network with the one-hot encoding approach, dubbed CNN(OH). It performs favorably to the traditional machine-learning models and other deep-learning models across different species, in terms of cross-validation and independent test. The area under the ROC curve (AUC) values for CNN(OH) ranged from 0.82 to 0.87 for different organisms, which is superior to the currently available K(hib) predictors. Moreover, we developed the general model based on the integrated data from multiple species and it showed great universality and effectiveness with the AUC values in the range of 0.79–0.87. Accordingly, we constructed the on-line prediction tool dubbed DeepKhib for easily identifying K(hib) sites, which includes both species-specific and general models. DeepKhib is available at http://www.bioinfogo.org/DeepKhib. |
format | Online Article Text |
id | pubmed-7509169 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-75091692020-10-02 DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction Zhang, Luna Zou, Yang He, Ningning Chen, Yu Chen, Zhen Li, Lei Front Cell Dev Biol Cell and Developmental Biology As a novel type of post-translational modification, lysine 2-Hydroxyisobutyrylation (K(hib)) plays an important role in gene transcription and signal transduction. In order to understand its regulatory mechanism, the essential step is the recognition of K(hib) sites. Thousands of K(hib) sites have been experimentally verified across five different species. However, there are only a couple traditional machine-learning algorithms developed to predict K(hib) sites for limited species, lacking a general prediction algorithm. We constructed a deep-learning algorithm based on convolutional neural network with the one-hot encoding approach, dubbed CNN(OH). It performs favorably to the traditional machine-learning models and other deep-learning models across different species, in terms of cross-validation and independent test. The area under the ROC curve (AUC) values for CNN(OH) ranged from 0.82 to 0.87 for different organisms, which is superior to the currently available K(hib) predictors. Moreover, we developed the general model based on the integrated data from multiple species and it showed great universality and effectiveness with the AUC values in the range of 0.79–0.87. Accordingly, we constructed the on-line prediction tool dubbed DeepKhib for easily identifying K(hib) sites, which includes both species-specific and general models. DeepKhib is available at http://www.bioinfogo.org/DeepKhib. Frontiers Media S.A. 2020-09-09 /pmc/articles/PMC7509169/ /pubmed/33015075 http://dx.doi.org/10.3389/fcell.2020.580217 Text en Copyright © 2020 Zhang, Zou, He, Chen, Chen and Li. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cell and Developmental Biology Zhang, Luna Zou, Yang He, Ningning Chen, Yu Chen, Zhen Li, Lei DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction |
title | DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction |
title_full | DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction |
title_fullStr | DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction |
title_full_unstemmed | DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction |
title_short | DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction |
title_sort | deepkhib: a deep-learning framework for lysine 2-hydroxyisobutyrylation sites prediction |
topic | Cell and Developmental Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7509169/ https://www.ncbi.nlm.nih.gov/pubmed/33015075 http://dx.doi.org/10.3389/fcell.2020.580217 |
work_keys_str_mv | AT zhangluna deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction AT zouyang deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction AT heningning deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction AT chenyu deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction AT chenzhen deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction AT lilei deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction |