Cargando…

DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction

As a novel type of post-translational modification, lysine 2-Hydroxyisobutyrylation (K(hib)) plays an important role in gene transcription and signal transduction. In order to understand its regulatory mechanism, the essential step is the recognition of K(hib) sites. Thousands of K(hib) sites have b...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Luna, Zou, Yang, He, Ningning, Chen, Yu, Chen, Zhen, Li, Lei
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7509169/
https://www.ncbi.nlm.nih.gov/pubmed/33015075
http://dx.doi.org/10.3389/fcell.2020.580217
_version_ 1783585550156955648
author Zhang, Luna
Zou, Yang
He, Ningning
Chen, Yu
Chen, Zhen
Li, Lei
author_facet Zhang, Luna
Zou, Yang
He, Ningning
Chen, Yu
Chen, Zhen
Li, Lei
author_sort Zhang, Luna
collection PubMed
description As a novel type of post-translational modification, lysine 2-Hydroxyisobutyrylation (K(hib)) plays an important role in gene transcription and signal transduction. In order to understand its regulatory mechanism, the essential step is the recognition of K(hib) sites. Thousands of K(hib) sites have been experimentally verified across five different species. However, there are only a couple traditional machine-learning algorithms developed to predict K(hib) sites for limited species, lacking a general prediction algorithm. We constructed a deep-learning algorithm based on convolutional neural network with the one-hot encoding approach, dubbed CNN(OH). It performs favorably to the traditional machine-learning models and other deep-learning models across different species, in terms of cross-validation and independent test. The area under the ROC curve (AUC) values for CNN(OH) ranged from 0.82 to 0.87 for different organisms, which is superior to the currently available K(hib) predictors. Moreover, we developed the general model based on the integrated data from multiple species and it showed great universality and effectiveness with the AUC values in the range of 0.79–0.87. Accordingly, we constructed the on-line prediction tool dubbed DeepKhib for easily identifying K(hib) sites, which includes both species-specific and general models. DeepKhib is available at http://www.bioinfogo.org/DeepKhib.
format Online
Article
Text
id pubmed-7509169
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-75091692020-10-02 DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction Zhang, Luna Zou, Yang He, Ningning Chen, Yu Chen, Zhen Li, Lei Front Cell Dev Biol Cell and Developmental Biology As a novel type of post-translational modification, lysine 2-Hydroxyisobutyrylation (K(hib)) plays an important role in gene transcription and signal transduction. In order to understand its regulatory mechanism, the essential step is the recognition of K(hib) sites. Thousands of K(hib) sites have been experimentally verified across five different species. However, there are only a couple traditional machine-learning algorithms developed to predict K(hib) sites for limited species, lacking a general prediction algorithm. We constructed a deep-learning algorithm based on convolutional neural network with the one-hot encoding approach, dubbed CNN(OH). It performs favorably to the traditional machine-learning models and other deep-learning models across different species, in terms of cross-validation and independent test. The area under the ROC curve (AUC) values for CNN(OH) ranged from 0.82 to 0.87 for different organisms, which is superior to the currently available K(hib) predictors. Moreover, we developed the general model based on the integrated data from multiple species and it showed great universality and effectiveness with the AUC values in the range of 0.79–0.87. Accordingly, we constructed the on-line prediction tool dubbed DeepKhib for easily identifying K(hib) sites, which includes both species-specific and general models. DeepKhib is available at http://www.bioinfogo.org/DeepKhib. Frontiers Media S.A. 2020-09-09 /pmc/articles/PMC7509169/ /pubmed/33015075 http://dx.doi.org/10.3389/fcell.2020.580217 Text en Copyright © 2020 Zhang, Zou, He, Chen, Chen and Li. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Cell and Developmental Biology
Zhang, Luna
Zou, Yang
He, Ningning
Chen, Yu
Chen, Zhen
Li, Lei
DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction
title DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction
title_full DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction
title_fullStr DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction
title_full_unstemmed DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction
title_short DeepKhib: A Deep-Learning Framework for Lysine 2-Hydroxyisobutyrylation Sites Prediction
title_sort deepkhib: a deep-learning framework for lysine 2-hydroxyisobutyrylation sites prediction
topic Cell and Developmental Biology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7509169/
https://www.ncbi.nlm.nih.gov/pubmed/33015075
http://dx.doi.org/10.3389/fcell.2020.580217
work_keys_str_mv AT zhangluna deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction
AT zouyang deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction
AT heningning deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction
AT chenyu deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction
AT chenzhen deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction
AT lilei deepkhibadeeplearningframeworkforlysine2hydroxyisobutyrylationsitesprediction