Cargando…

ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning

Lysine 2-hydroxyisobutylation (K(hib)), which was first reported in 2014, has been shown to play vital roles in a myriad of biological processes including gene transcription, regulation of chromatin functions, purine metabolism, pentose phosphate pathway and glycolysis/gluconeogenesis. Identificatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Jia, Xiaoti, Zhao, Pei, Li, Fuyi, Qin, Zhaohui, Ren, Haoran, Li, Junzhou, Miao, Chunbo, Zhao, Quanzhi, Akutsu, Tatsuya, Dou, Gensheng, Chen, Zhen, Song, Jiangning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10185920/
https://www.ncbi.nlm.nih.gov/pubmed/36880172
http://dx.doi.org/10.1093/bib/bbad063
_version_ 1785042463255166976
author Jia, Xiaoti
Zhao, Pei
Li, Fuyi
Qin, Zhaohui
Ren, Haoran
Li, Junzhou
Miao, Chunbo
Zhao, Quanzhi
Akutsu, Tatsuya
Dou, Gensheng
Chen, Zhen
Song, Jiangning
author_facet Jia, Xiaoti
Zhao, Pei
Li, Fuyi
Qin, Zhaohui
Ren, Haoran
Li, Junzhou
Miao, Chunbo
Zhao, Quanzhi
Akutsu, Tatsuya
Dou, Gensheng
Chen, Zhen
Song, Jiangning
author_sort Jia, Xiaoti
collection PubMed
description Lysine 2-hydroxyisobutylation (K(hib)), which was first reported in 2014, has been shown to play vital roles in a myriad of biological processes including gene transcription, regulation of chromatin functions, purine metabolism, pentose phosphate pathway and glycolysis/gluconeogenesis. Identification of K(hib) sites in protein substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein 2-hydroxyisobutylation. Experimental identification of K(hib) sites mainly depends on the combination of liquid chromatography and mass spectrometry. However, experimental approaches for identifying K(hib) sites are often time-consuming and expensive compared with computational approaches. Previous studies have shown that K(hib) sites may have distinct characteristics for different cell types of the same species. Several tools have been developed to identify K(hib) sites, which exhibit high diversity in their algorithms, encoding schemes and feature selection techniques. However, to date, there are no tools designed for predicting cell type-specific K(hib) sites. Therefore, it is highly desirable to develop an effective predictor for cell type-specific K(hib) site prediction. Inspired by the residual connection of ResNet, we develop a deep learning-based approach, termed ResNetKhib, which leverages both the one-dimensional convolution and transfer learning to enable and improve the prediction of cell type-specific 2-hydroxyisobutylation sites. ResNetKhib is capable of predicting K(hib) sites for four human cell types, mouse liver cell and three rice cell types. Its performance is benchmarked against the commonly used random forest (RF) predictor on both 10-fold cross-validation and independent tests. The results show that ResNetKhib achieves the area under the receiver operating characteristic curve values ranging from 0.807 to 0.901, depending on the cell type and species, which performs better than RF-based predictors and other currently available K(hib) site prediction tools. We also implement an online web server of the proposed ResNetKhib algorithm together with all the curated datasets and trained model for the wider research community to use, which is publicly accessible at https://resnetkhib.erc.monash.edu/.
format Online
Article
Text
id pubmed-10185920
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-101859202023-05-17 ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning Jia, Xiaoti Zhao, Pei Li, Fuyi Qin, Zhaohui Ren, Haoran Li, Junzhou Miao, Chunbo Zhao, Quanzhi Akutsu, Tatsuya Dou, Gensheng Chen, Zhen Song, Jiangning Brief Bioinform Problem Solving Protocol Lysine 2-hydroxyisobutylation (K(hib)), which was first reported in 2014, has been shown to play vital roles in a myriad of biological processes including gene transcription, regulation of chromatin functions, purine metabolism, pentose phosphate pathway and glycolysis/gluconeogenesis. Identification of K(hib) sites in protein substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein 2-hydroxyisobutylation. Experimental identification of K(hib) sites mainly depends on the combination of liquid chromatography and mass spectrometry. However, experimental approaches for identifying K(hib) sites are often time-consuming and expensive compared with computational approaches. Previous studies have shown that K(hib) sites may have distinct characteristics for different cell types of the same species. Several tools have been developed to identify K(hib) sites, which exhibit high diversity in their algorithms, encoding schemes and feature selection techniques. However, to date, there are no tools designed for predicting cell type-specific K(hib) sites. Therefore, it is highly desirable to develop an effective predictor for cell type-specific K(hib) site prediction. Inspired by the residual connection of ResNet, we develop a deep learning-based approach, termed ResNetKhib, which leverages both the one-dimensional convolution and transfer learning to enable and improve the prediction of cell type-specific 2-hydroxyisobutylation sites. ResNetKhib is capable of predicting K(hib) sites for four human cell types, mouse liver cell and three rice cell types. Its performance is benchmarked against the commonly used random forest (RF) predictor on both 10-fold cross-validation and independent tests. The results show that ResNetKhib achieves the area under the receiver operating characteristic curve values ranging from 0.807 to 0.901, depending on the cell type and species, which performs better than RF-based predictors and other currently available K(hib) site prediction tools. We also implement an online web server of the proposed ResNetKhib algorithm together with all the curated datasets and trained model for the wider research community to use, which is publicly accessible at https://resnetkhib.erc.monash.edu/. Oxford University Press 2023-03-04 /pmc/articles/PMC10185920/ /pubmed/36880172 http://dx.doi.org/10.1093/bib/bbad063 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Problem Solving Protocol
Jia, Xiaoti
Zhao, Pei
Li, Fuyi
Qin, Zhaohui
Ren, Haoran
Li, Junzhou
Miao, Chunbo
Zhao, Quanzhi
Akutsu, Tatsuya
Dou, Gensheng
Chen, Zhen
Song, Jiangning
ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning
title ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning
title_full ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning
title_fullStr ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning
title_full_unstemmed ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning
title_short ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning
title_sort resnetkhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning
topic Problem Solving Protocol
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10185920/
https://www.ncbi.nlm.nih.gov/pubmed/36880172
http://dx.doi.org/10.1093/bib/bbad063
work_keys_str_mv AT jiaxiaoti resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT zhaopei resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT lifuyi resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT qinzhaohui resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT renhaoran resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT lijunzhou resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT miaochunbo resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT zhaoquanzhi resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT akutsutatsuya resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT dougensheng resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT chenzhen resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning
AT songjiangning resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning