Cargando…
iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss
Lysine crotonylation (Kcr) is a newly discovered protein post-translational modification and has been proved to be widely involved in various biological processes and human diseases. Thus, the accurate and fast identification of this modification became the preliminary task in investigating the rela...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9251780/ https://www.ncbi.nlm.nih.gov/pubmed/35832615 http://dx.doi.org/10.1016/j.csbj.2022.06.032 |
_version_ | 1784740106293215232 |
---|---|
author | Dou, Lijun Zhang, Zilong Xu, Lei Zou, Quan |
author_facet | Dou, Lijun Zhang, Zilong Xu, Lei Zou, Quan |
author_sort | Dou, Lijun |
collection | PubMed |
description | Lysine crotonylation (Kcr) is a newly discovered protein post-translational modification and has been proved to be widely involved in various biological processes and human diseases. Thus, the accurate and fast identification of this modification became the preliminary task in investigating the related biological functions. Due to the long duration, high cost and intensity of traditional high-throughput experimental techniques, constructing bioinformatics predictors based on machine learning algorithms is treated as a most popular solution. Although dozens of predictors have been reported to identify Kcr sites, only two, nhKcr and DeepKcrot, focused on human nonhistone protein sequences. Moreover, due to the imbalance nature of data distribution, associated detection performance is severely biased towards the major negative samples and remains much room for improvement. In this research, we developed a convolutional neural network framework, dubbed iKcr_CNN, to identify the human nonhistone Kcr modification. To overcome the imbalance issue (Kcr: 15,274; non-Kcr: 74,018 with imbalance ratio: 1:4), we applied the focal loss function instead of the standard cross-entropy as the indicator to optimize the model, which not only assigns different weights to samples belonging to different categories but also distinguishes easy- and hard-classified samples. Ultimately, the obtained model presents more balanced prediction scores between real-world positive and negative samples than existing tools. The user-friendly web server is accessible at ikcrcnn.webmalab.cn/, and the involved Python scripts can be conveniently downloaded at github.com/lijundou/iKcr_CNN/. The proposed model may serve as an efficient tool to assist academicians with their experimental researches. |
format | Online Article Text |
id | pubmed-9251780 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-92517802022-07-12 iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss Dou, Lijun Zhang, Zilong Xu, Lei Zou, Quan Comput Struct Biotechnol J Research Article Lysine crotonylation (Kcr) is a newly discovered protein post-translational modification and has been proved to be widely involved in various biological processes and human diseases. Thus, the accurate and fast identification of this modification became the preliminary task in investigating the related biological functions. Due to the long duration, high cost and intensity of traditional high-throughput experimental techniques, constructing bioinformatics predictors based on machine learning algorithms is treated as a most popular solution. Although dozens of predictors have been reported to identify Kcr sites, only two, nhKcr and DeepKcrot, focused on human nonhistone protein sequences. Moreover, due to the imbalance nature of data distribution, associated detection performance is severely biased towards the major negative samples and remains much room for improvement. In this research, we developed a convolutional neural network framework, dubbed iKcr_CNN, to identify the human nonhistone Kcr modification. To overcome the imbalance issue (Kcr: 15,274; non-Kcr: 74,018 with imbalance ratio: 1:4), we applied the focal loss function instead of the standard cross-entropy as the indicator to optimize the model, which not only assigns different weights to samples belonging to different categories but also distinguishes easy- and hard-classified samples. Ultimately, the obtained model presents more balanced prediction scores between real-world positive and negative samples than existing tools. The user-friendly web server is accessible at ikcrcnn.webmalab.cn/, and the involved Python scripts can be conveniently downloaded at github.com/lijundou/iKcr_CNN/. The proposed model may serve as an efficient tool to assist academicians with their experimental researches. Research Network of Computational and Structural Biotechnology 2022-06-16 /pmc/articles/PMC9251780/ /pubmed/35832615 http://dx.doi.org/10.1016/j.csbj.2022.06.032 Text en © 2022 The Authors https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Dou, Lijun Zhang, Zilong Xu, Lei Zou, Quan iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss |
title | iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss |
title_full | iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss |
title_fullStr | iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss |
title_full_unstemmed | iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss |
title_short | iKcr_CNN: A novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss |
title_sort | ikcr_cnn: a novel computational tool for imbalance classification of human nonhistone crotonylation sites based on convolutional neural networks with focal loss |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9251780/ https://www.ncbi.nlm.nih.gov/pubmed/35832615 http://dx.doi.org/10.1016/j.csbj.2022.06.032 |
work_keys_str_mv | AT doulijun ikcrcnnanovelcomputationaltoolforimbalanceclassificationofhumannonhistonecrotonylationsitesbasedonconvolutionalneuralnetworkswithfocalloss AT zhangzilong ikcrcnnanovelcomputationaltoolforimbalanceclassificationofhumannonhistonecrotonylationsitesbasedonconvolutionalneuralnetworkswithfocalloss AT xulei ikcrcnnanovelcomputationaltoolforimbalanceclassificationofhumannonhistonecrotonylationsitesbasedonconvolutionalneuralnetworkswithfocalloss AT zouquan ikcrcnnanovelcomputationaltoolforimbalanceclassificationofhumannonhistonecrotonylationsitesbasedonconvolutionalneuralnetworkswithfocalloss |