Cargando…
DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins
Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play vital roles in gene expression. Accurate identification of these proteins is crucial. However, there are two existing challenges: one is the problem of ignoring DNA- and RNA-binding pro...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Research Network of Computational and Structural Biotechnology
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9065708/ https://www.ncbi.nlm.nih.gov/pubmed/35521556 http://dx.doi.org/10.1016/j.csbj.2022.04.029 |
_version_ | 1784699644717039616 |
---|---|
author | Cui, Feifei Li, Shuang Zhang, Zilong Sui, Miaomiao Cao, Chen El-Latif Hesham, Abd Zou, Quan |
author_facet | Cui, Feifei Li, Shuang Zhang, Zilong Sui, Miaomiao Cao, Chen El-Latif Hesham, Abd Zou, Quan |
author_sort | Cui, Feifei |
collection | PubMed |
description | Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play vital roles in gene expression. Accurate identification of these proteins is crucial. However, there are two existing challenges: one is the problem of ignoring DNA- and RNA-binding proteins (DRBPs), and the other is a cross-predicting problem referring to DBP predictors predicting DBPs as RBPs, and vice versa. In this study, we proposed a computational predictor, called DeepMC-iNABP, with the goal of solving these difficulties by utilizing a multiclass classification strategy and deep learning approaches. DBPs, RBPs, DRBPs and non-NABPs as separate classes of data were used for training the DeepMC-iNABP model. The results on test data collected in this study and two independent test datasets showed that DeepMC-iNABP has a strong advantage in identifying the DRBPs and has the ability to alleviate the cross-prediction problem to a certain extent. The web-server of DeepMC-iNABP is freely available at http://www.deepmc-inabp.net/. The datasets used in this research can also be downloaded from the website. |
format | Online Article Text |
id | pubmed-9065708 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Research Network of Computational and Structural Biotechnology |
record_format | MEDLINE/PubMed |
spelling | pubmed-90657082022-05-04 DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins Cui, Feifei Li, Shuang Zhang, Zilong Sui, Miaomiao Cao, Chen El-Latif Hesham, Abd Zou, Quan Comput Struct Biotechnol J Research Article Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play vital roles in gene expression. Accurate identification of these proteins is crucial. However, there are two existing challenges: one is the problem of ignoring DNA- and RNA-binding proteins (DRBPs), and the other is a cross-predicting problem referring to DBP predictors predicting DBPs as RBPs, and vice versa. In this study, we proposed a computational predictor, called DeepMC-iNABP, with the goal of solving these difficulties by utilizing a multiclass classification strategy and deep learning approaches. DBPs, RBPs, DRBPs and non-NABPs as separate classes of data were used for training the DeepMC-iNABP model. The results on test data collected in this study and two independent test datasets showed that DeepMC-iNABP has a strong advantage in identifying the DRBPs and has the ability to alleviate the cross-prediction problem to a certain extent. The web-server of DeepMC-iNABP is freely available at http://www.deepmc-inabp.net/. The datasets used in this research can also be downloaded from the website. Research Network of Computational and Structural Biotechnology 2022-04-26 /pmc/articles/PMC9065708/ /pubmed/35521556 http://dx.doi.org/10.1016/j.csbj.2022.04.029 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/). |
spellingShingle | Research Article Cui, Feifei Li, Shuang Zhang, Zilong Sui, Miaomiao Cao, Chen El-Latif Hesham, Abd Zou, Quan DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins |
title | DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins |
title_full | DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins |
title_fullStr | DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins |
title_full_unstemmed | DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins |
title_short | DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins |
title_sort | deepmc-inabp: deep learning for multiclass identification and classification of nucleic acid-binding proteins |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9065708/ https://www.ncbi.nlm.nih.gov/pubmed/35521556 http://dx.doi.org/10.1016/j.csbj.2022.04.029 |
work_keys_str_mv | AT cuifeifei deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins AT lishuang deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins AT zhangzilong deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins AT suimiaomiao deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins AT caochen deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins AT ellatifheshamabd deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins AT zouquan deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins |