Cargando…

ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction

BACKGROUND: Although research on non-coding RNAs (ncRNAs) is a hot topic in life sciences, the functions of numerous ncRNAs remain unclear. In recent years, researchers have found that ncRNAs of the same family have similar functions, therefore, it is important to accurately predict ncRNAs families...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Kai, Zhu, Xiaodong, Wang, Jiahao, Hao, Lei, Liu, Zhen, Liu, Yuanning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9972773/
https://www.ncbi.nlm.nih.gov/pubmed/36849908
http://dx.doi.org/10.1186/s12859-023-05191-6
_version_ 1784898387956465664
author Chen, Kai
Zhu, Xiaodong
Wang, Jiahao
Hao, Lei
Liu, Zhen
Liu, Yuanning
author_facet Chen, Kai
Zhu, Xiaodong
Wang, Jiahao
Hao, Lei
Liu, Zhen
Liu, Yuanning
author_sort Chen, Kai
collection PubMed
description BACKGROUND: Although research on non-coding RNAs (ncRNAs) is a hot topic in life sciences, the functions of numerous ncRNAs remain unclear. In recent years, researchers have found that ncRNAs of the same family have similar functions, therefore, it is important to accurately predict ncRNAs families to identify their functions. There are several methods available to solve the prediction problem of ncRNAs family, whose main ideas can be divided into two categories, including prediction based on the secondary structure features of ncRNAs, and prediction according to sequence features of ncRNAs. The first type of prediction method requires a complicated process and has a low accuracy in obtaining the secondary structure of ncRNAs, while the second type of method has a simple prediction process and a high accuracy, but there is still room for improvement. The existing methods for ncRNAs family prediction are associated with problems such as complicated prediction processes and low accuracy, in this regard, it is necessary to propose a new method to predict the ncRNAs family more perfectly. RESULTS: A deep learning model-based method, ncDENSE, was proposed in this study, which predicted ncRNAs families by extracting ncRNAs sequence features. The bases in ncRNAs sequences were encoded by one-hot coding and later fed into an ensemble deep learning model, which contained the dynamic bi-directional gated recurrent unit (Bi-GRU), the dense convolutional network (DenseNet), and the Attention Mechanism (AM). To be specific, dynamic Bi-GRU was used to extract contextual feature information and capture long-term dependencies of ncRNAs sequences. AM was employed to assign different weights to features extracted by Bi-GRU and focused the attention on information with greater weights. Whereas DenseNet was adopted to extract local feature information of ncRNAs sequences and classify them by the full connection layer. According to our results, the ncDENSE method improved the Accuracy, Sensitivity, Precision, F-score, and MCC by 2.08[Formula: see text] , 2.33[Formula: see text] , 2.14[Formula: see text] , 2.16[Formula: see text] , and 2.39[Formula: see text] , respectively, compared with the suboptimal method. CONCLUSIONS: Overall, the ncDENSE method proposed in this paper extracts sequence features of ncRNAs by dynamic Bi-GRU and DenseNet and improves the accuracy in predicting ncRNAs family and other data.
format Online
Article
Text
id pubmed-9972773
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-99727732023-03-01 ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction Chen, Kai Zhu, Xiaodong Wang, Jiahao Hao, Lei Liu, Zhen Liu, Yuanning BMC Bioinformatics Research BACKGROUND: Although research on non-coding RNAs (ncRNAs) is a hot topic in life sciences, the functions of numerous ncRNAs remain unclear. In recent years, researchers have found that ncRNAs of the same family have similar functions, therefore, it is important to accurately predict ncRNAs families to identify their functions. There are several methods available to solve the prediction problem of ncRNAs family, whose main ideas can be divided into two categories, including prediction based on the secondary structure features of ncRNAs, and prediction according to sequence features of ncRNAs. The first type of prediction method requires a complicated process and has a low accuracy in obtaining the secondary structure of ncRNAs, while the second type of method has a simple prediction process and a high accuracy, but there is still room for improvement. The existing methods for ncRNAs family prediction are associated with problems such as complicated prediction processes and low accuracy, in this regard, it is necessary to propose a new method to predict the ncRNAs family more perfectly. RESULTS: A deep learning model-based method, ncDENSE, was proposed in this study, which predicted ncRNAs families by extracting ncRNAs sequence features. The bases in ncRNAs sequences were encoded by one-hot coding and later fed into an ensemble deep learning model, which contained the dynamic bi-directional gated recurrent unit (Bi-GRU), the dense convolutional network (DenseNet), and the Attention Mechanism (AM). To be specific, dynamic Bi-GRU was used to extract contextual feature information and capture long-term dependencies of ncRNAs sequences. AM was employed to assign different weights to features extracted by Bi-GRU and focused the attention on information with greater weights. Whereas DenseNet was adopted to extract local feature information of ncRNAs sequences and classify them by the full connection layer. According to our results, the ncDENSE method improved the Accuracy, Sensitivity, Precision, F-score, and MCC by 2.08[Formula: see text] , 2.33[Formula: see text] , 2.14[Formula: see text] , 2.16[Formula: see text] , and 2.39[Formula: see text] , respectively, compared with the suboptimal method. CONCLUSIONS: Overall, the ncDENSE method proposed in this paper extracts sequence features of ncRNAs by dynamic Bi-GRU and DenseNet and improves the accuracy in predicting ncRNAs family and other data. BioMed Central 2023-02-27 /pmc/articles/PMC9972773/ /pubmed/36849908 http://dx.doi.org/10.1186/s12859-023-05191-6 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Chen, Kai
Zhu, Xiaodong
Wang, Jiahao
Hao, Lei
Liu, Zhen
Liu, Yuanning
ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction
title ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction
title_full ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction
title_fullStr ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction
title_full_unstemmed ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction
title_short ncDENSE: a novel computational method based on a deep learning framework for non-coding RNAs family prediction
title_sort ncdense: a novel computational method based on a deep learning framework for non-coding rnas family prediction
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9972773/
https://www.ncbi.nlm.nih.gov/pubmed/36849908
http://dx.doi.org/10.1186/s12859-023-05191-6
work_keys_str_mv AT chenkai ncdenseanovelcomputationalmethodbasedonadeeplearningframeworkfornoncodingrnasfamilyprediction
AT zhuxiaodong ncdenseanovelcomputationalmethodbasedonadeeplearningframeworkfornoncodingrnasfamilyprediction
AT wangjiahao ncdenseanovelcomputationalmethodbasedonadeeplearningframeworkfornoncodingrnasfamilyprediction
AT haolei ncdenseanovelcomputationalmethodbasedonadeeplearningframeworkfornoncodingrnasfamilyprediction
AT liuzhen ncdenseanovelcomputationalmethodbasedonadeeplearningframeworkfornoncodingrnasfamilyprediction
AT liuyuanning ncdenseanovelcomputationalmethodbasedonadeeplearningframeworkfornoncodingrnasfamilyprediction