Cargando…

DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins

Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play vital roles in gene expression. Accurate identification of these proteins is crucial. However, there are two existing challenges: one is the problem of ignoring DNA- and RNA-binding pro...

Descripción completa

Detalles Bibliográficos
Autores principales: Cui, Feifei, Li, Shuang, Zhang, Zilong, Sui, Miaomiao, Cao, Chen, El-Latif Hesham, Abd, Zou, Quan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Research Network of Computational and Structural Biotechnology 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9065708/
https://www.ncbi.nlm.nih.gov/pubmed/35521556
http://dx.doi.org/10.1016/j.csbj.2022.04.029
_version_ 1784699644717039616
author Cui, Feifei
Li, Shuang
Zhang, Zilong
Sui, Miaomiao
Cao, Chen
El-Latif Hesham, Abd
Zou, Quan
author_facet Cui, Feifei
Li, Shuang
Zhang, Zilong
Sui, Miaomiao
Cao, Chen
El-Latif Hesham, Abd
Zou, Quan
author_sort Cui, Feifei
collection PubMed
description Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play vital roles in gene expression. Accurate identification of these proteins is crucial. However, there are two existing challenges: one is the problem of ignoring DNA- and RNA-binding proteins (DRBPs), and the other is a cross-predicting problem referring to DBP predictors predicting DBPs as RBPs, and vice versa. In this study, we proposed a computational predictor, called DeepMC-iNABP, with the goal of solving these difficulties by utilizing a multiclass classification strategy and deep learning approaches. DBPs, RBPs, DRBPs and non-NABPs as separate classes of data were used for training the DeepMC-iNABP model. The results on test data collected in this study and two independent test datasets showed that DeepMC-iNABP has a strong advantage in identifying the DRBPs and has the ability to alleviate the cross-prediction problem to a certain extent. The web-server of DeepMC-iNABP is freely available at http://www.deepmc-inabp.net/. The datasets used in this research can also be downloaded from the website.
format Online
Article
Text
id pubmed-9065708
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Research Network of Computational and Structural Biotechnology
record_format MEDLINE/PubMed
spelling pubmed-90657082022-05-04 DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins Cui, Feifei Li, Shuang Zhang, Zilong Sui, Miaomiao Cao, Chen El-Latif Hesham, Abd Zou, Quan Comput Struct Biotechnol J Research Article Nucleic acid-binding proteins (NABPs), including DNA-binding proteins (DBPs) and RNA-binding proteins (RBPs), play vital roles in gene expression. Accurate identification of these proteins is crucial. However, there are two existing challenges: one is the problem of ignoring DNA- and RNA-binding proteins (DRBPs), and the other is a cross-predicting problem referring to DBP predictors predicting DBPs as RBPs, and vice versa. In this study, we proposed a computational predictor, called DeepMC-iNABP, with the goal of solving these difficulties by utilizing a multiclass classification strategy and deep learning approaches. DBPs, RBPs, DRBPs and non-NABPs as separate classes of data were used for training the DeepMC-iNABP model. The results on test data collected in this study and two independent test datasets showed that DeepMC-iNABP has a strong advantage in identifying the DRBPs and has the ability to alleviate the cross-prediction problem to a certain extent. The web-server of DeepMC-iNABP is freely available at http://www.deepmc-inabp.net/. The datasets used in this research can also be downloaded from the website. Research Network of Computational and Structural Biotechnology 2022-04-26 /pmc/articles/PMC9065708/ /pubmed/35521556 http://dx.doi.org/10.1016/j.csbj.2022.04.029 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Research Article
Cui, Feifei
Li, Shuang
Zhang, Zilong
Sui, Miaomiao
Cao, Chen
El-Latif Hesham, Abd
Zou, Quan
DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins
title DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins
title_full DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins
title_fullStr DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins
title_full_unstemmed DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins
title_short DeepMC-iNABP: Deep learning for multiclass identification and classification of nucleic acid-binding proteins
title_sort deepmc-inabp: deep learning for multiclass identification and classification of nucleic acid-binding proteins
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9065708/
https://www.ncbi.nlm.nih.gov/pubmed/35521556
http://dx.doi.org/10.1016/j.csbj.2022.04.029
work_keys_str_mv AT cuifeifei deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins
AT lishuang deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins
AT zhangzilong deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins
AT suimiaomiao deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins
AT caochen deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins
AT ellatifheshamabd deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins
AT zouquan deepmcinabpdeeplearningformulticlassidentificationandclassificationofnucleicacidbindingproteins