Cargando…
ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning
Lysine 2-hydroxyisobutylation (K(hib)), which was first reported in 2014, has been shown to play vital roles in a myriad of biological processes including gene transcription, regulation of chromatin functions, purine metabolism, pentose phosphate pathway and glycolysis/gluconeogenesis. Identificatio...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10185920/ https://www.ncbi.nlm.nih.gov/pubmed/36880172 http://dx.doi.org/10.1093/bib/bbad063 |
_version_ | 1785042463255166976 |
---|---|
author | Jia, Xiaoti Zhao, Pei Li, Fuyi Qin, Zhaohui Ren, Haoran Li, Junzhou Miao, Chunbo Zhao, Quanzhi Akutsu, Tatsuya Dou, Gensheng Chen, Zhen Song, Jiangning |
author_facet | Jia, Xiaoti Zhao, Pei Li, Fuyi Qin, Zhaohui Ren, Haoran Li, Junzhou Miao, Chunbo Zhao, Quanzhi Akutsu, Tatsuya Dou, Gensheng Chen, Zhen Song, Jiangning |
author_sort | Jia, Xiaoti |
collection | PubMed |
description | Lysine 2-hydroxyisobutylation (K(hib)), which was first reported in 2014, has been shown to play vital roles in a myriad of biological processes including gene transcription, regulation of chromatin functions, purine metabolism, pentose phosphate pathway and glycolysis/gluconeogenesis. Identification of K(hib) sites in protein substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein 2-hydroxyisobutylation. Experimental identification of K(hib) sites mainly depends on the combination of liquid chromatography and mass spectrometry. However, experimental approaches for identifying K(hib) sites are often time-consuming and expensive compared with computational approaches. Previous studies have shown that K(hib) sites may have distinct characteristics for different cell types of the same species. Several tools have been developed to identify K(hib) sites, which exhibit high diversity in their algorithms, encoding schemes and feature selection techniques. However, to date, there are no tools designed for predicting cell type-specific K(hib) sites. Therefore, it is highly desirable to develop an effective predictor for cell type-specific K(hib) site prediction. Inspired by the residual connection of ResNet, we develop a deep learning-based approach, termed ResNetKhib, which leverages both the one-dimensional convolution and transfer learning to enable and improve the prediction of cell type-specific 2-hydroxyisobutylation sites. ResNetKhib is capable of predicting K(hib) sites for four human cell types, mouse liver cell and three rice cell types. Its performance is benchmarked against the commonly used random forest (RF) predictor on both 10-fold cross-validation and independent tests. The results show that ResNetKhib achieves the area under the receiver operating characteristic curve values ranging from 0.807 to 0.901, depending on the cell type and species, which performs better than RF-based predictors and other currently available K(hib) site prediction tools. We also implement an online web server of the proposed ResNetKhib algorithm together with all the curated datasets and trained model for the wider research community to use, which is publicly accessible at https://resnetkhib.erc.monash.edu/. |
format | Online Article Text |
id | pubmed-10185920 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-101859202023-05-17 ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning Jia, Xiaoti Zhao, Pei Li, Fuyi Qin, Zhaohui Ren, Haoran Li, Junzhou Miao, Chunbo Zhao, Quanzhi Akutsu, Tatsuya Dou, Gensheng Chen, Zhen Song, Jiangning Brief Bioinform Problem Solving Protocol Lysine 2-hydroxyisobutylation (K(hib)), which was first reported in 2014, has been shown to play vital roles in a myriad of biological processes including gene transcription, regulation of chromatin functions, purine metabolism, pentose phosphate pathway and glycolysis/gluconeogenesis. Identification of K(hib) sites in protein substrates represents an initial but crucial step in elucidating the molecular mechanisms underlying protein 2-hydroxyisobutylation. Experimental identification of K(hib) sites mainly depends on the combination of liquid chromatography and mass spectrometry. However, experimental approaches for identifying K(hib) sites are often time-consuming and expensive compared with computational approaches. Previous studies have shown that K(hib) sites may have distinct characteristics for different cell types of the same species. Several tools have been developed to identify K(hib) sites, which exhibit high diversity in their algorithms, encoding schemes and feature selection techniques. However, to date, there are no tools designed for predicting cell type-specific K(hib) sites. Therefore, it is highly desirable to develop an effective predictor for cell type-specific K(hib) site prediction. Inspired by the residual connection of ResNet, we develop a deep learning-based approach, termed ResNetKhib, which leverages both the one-dimensional convolution and transfer learning to enable and improve the prediction of cell type-specific 2-hydroxyisobutylation sites. ResNetKhib is capable of predicting K(hib) sites for four human cell types, mouse liver cell and three rice cell types. Its performance is benchmarked against the commonly used random forest (RF) predictor on both 10-fold cross-validation and independent tests. The results show that ResNetKhib achieves the area under the receiver operating characteristic curve values ranging from 0.807 to 0.901, depending on the cell type and species, which performs better than RF-based predictors and other currently available K(hib) site prediction tools. We also implement an online web server of the proposed ResNetKhib algorithm together with all the curated datasets and trained model for the wider research community to use, which is publicly accessible at https://resnetkhib.erc.monash.edu/. Oxford University Press 2023-03-04 /pmc/articles/PMC10185920/ /pubmed/36880172 http://dx.doi.org/10.1093/bib/bbad063 Text en © The Author(s) 2023. Published by Oxford University Press. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Problem Solving Protocol Jia, Xiaoti Zhao, Pei Li, Fuyi Qin, Zhaohui Ren, Haoran Li, Junzhou Miao, Chunbo Zhao, Quanzhi Akutsu, Tatsuya Dou, Gensheng Chen, Zhen Song, Jiangning ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning |
title | ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning |
title_full | ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning |
title_fullStr | ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning |
title_full_unstemmed | ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning |
title_short | ResNetKhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning |
title_sort | resnetkhib: a novel cell type-specific tool for predicting lysine 2-hydroxyisobutylation sites via transfer learning |
topic | Problem Solving Protocol |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10185920/ https://www.ncbi.nlm.nih.gov/pubmed/36880172 http://dx.doi.org/10.1093/bib/bbad063 |
work_keys_str_mv | AT jiaxiaoti resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT zhaopei resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT lifuyi resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT qinzhaohui resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT renhaoran resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT lijunzhou resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT miaochunbo resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT zhaoquanzhi resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT akutsutatsuya resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT dougensheng resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT chenzhen resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning AT songjiangning resnetkhibanovelcelltypespecifictoolforpredictinglysine2hydroxyisobutylationsitesviatransferlearning |