Cargando…

ncDLRES: a novel method for non-coding RNAs family prediction based on dynamic LSTM and ResNet

BACKGROUND: Studies have proven that the same family of non-coding RNAs (ncRNAs) have similar functions, so predicting the ncRNAs family is helpful to the research of ncRNAs functions. The existing calculation methods mainly fall into two categories: the first type is to predict ncRNAs family by lea...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Linyu, Zhong, Xiaodan, Wang, Shuo, Liu, Yuanning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8451086/
https://www.ncbi.nlm.nih.gov/pubmed/34544356
http://dx.doi.org/10.1186/s12859-021-04365-4
_version_ 1784569768563441664
author Wang, Linyu
Zhong, Xiaodan
Wang, Shuo
Liu, Yuanning
author_facet Wang, Linyu
Zhong, Xiaodan
Wang, Shuo
Liu, Yuanning
author_sort Wang, Linyu
collection PubMed
description BACKGROUND: Studies have proven that the same family of non-coding RNAs (ncRNAs) have similar functions, so predicting the ncRNAs family is helpful to the research of ncRNAs functions. The existing calculation methods mainly fall into two categories: the first type is to predict ncRNAs family by learning the features of sequence or secondary structure, and the other type is to predict ncRNAs family by the alignment among homologs sequences. In the first type, some methods predict ncRNAs family by learning predicted secondary structure features. The inaccuracy of predicted secondary structure may cause the low accuracy of those methods. Different from that, ncRFP directly learning the features of ncRNA sequences to predict ncRNAs family. Although ncRFP simplifies the prediction process and improves the performance, there is room for improvement in ncRFP performance due to the incomplete features of its input data. In the secondary type, the homologous sequence alignment method can achieve the highest performance at present. However, due to the need for consensus secondary structure annotation of ncRNA sequences, and the helplessness for modeling pseudoknots, the use of the method is limited. RESULTS: In this paper, a novel method “ncDLRES”, which according to learning the sequence features, is proposed to predict the family of ncRNAs based on Dynamic LSTM (Long Short-term Memory) and ResNet (Residual Neural Network). CONCLUSIONS: ncDLRES extracts the features of ncRNA sequences based on Dynamic LSTM and then classifies them by ResNet. Compared with the homologous sequence alignment method, ncDLRES reduces the data requirement and expands the application scope. By comparing with the first type of methods, the performance of ncDLRES is greatly improved.
format Online
Article
Text
id pubmed-8451086
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-84510862021-09-20 ncDLRES: a novel method for non-coding RNAs family prediction based on dynamic LSTM and ResNet Wang, Linyu Zhong, Xiaodan Wang, Shuo Liu, Yuanning BMC Bioinformatics Research BACKGROUND: Studies have proven that the same family of non-coding RNAs (ncRNAs) have similar functions, so predicting the ncRNAs family is helpful to the research of ncRNAs functions. The existing calculation methods mainly fall into two categories: the first type is to predict ncRNAs family by learning the features of sequence or secondary structure, and the other type is to predict ncRNAs family by the alignment among homologs sequences. In the first type, some methods predict ncRNAs family by learning predicted secondary structure features. The inaccuracy of predicted secondary structure may cause the low accuracy of those methods. Different from that, ncRFP directly learning the features of ncRNA sequences to predict ncRNAs family. Although ncRFP simplifies the prediction process and improves the performance, there is room for improvement in ncRFP performance due to the incomplete features of its input data. In the secondary type, the homologous sequence alignment method can achieve the highest performance at present. However, due to the need for consensus secondary structure annotation of ncRNA sequences, and the helplessness for modeling pseudoknots, the use of the method is limited. RESULTS: In this paper, a novel method “ncDLRES”, which according to learning the sequence features, is proposed to predict the family of ncRNAs based on Dynamic LSTM (Long Short-term Memory) and ResNet (Residual Neural Network). CONCLUSIONS: ncDLRES extracts the features of ncRNA sequences based on Dynamic LSTM and then classifies them by ResNet. Compared with the homologous sequence alignment method, ncDLRES reduces the data requirement and expands the application scope. By comparing with the first type of methods, the performance of ncDLRES is greatly improved. BioMed Central 2021-09-20 /pmc/articles/PMC8451086/ /pubmed/34544356 http://dx.doi.org/10.1186/s12859-021-04365-4 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Wang, Linyu
Zhong, Xiaodan
Wang, Shuo
Liu, Yuanning
ncDLRES: a novel method for non-coding RNAs family prediction based on dynamic LSTM and ResNet
title ncDLRES: a novel method for non-coding RNAs family prediction based on dynamic LSTM and ResNet
title_full ncDLRES: a novel method for non-coding RNAs family prediction based on dynamic LSTM and ResNet
title_fullStr ncDLRES: a novel method for non-coding RNAs family prediction based on dynamic LSTM and ResNet
title_full_unstemmed ncDLRES: a novel method for non-coding RNAs family prediction based on dynamic LSTM and ResNet
title_short ncDLRES: a novel method for non-coding RNAs family prediction based on dynamic LSTM and ResNet
title_sort ncdlres: a novel method for non-coding rnas family prediction based on dynamic lstm and resnet
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8451086/
https://www.ncbi.nlm.nih.gov/pubmed/34544356
http://dx.doi.org/10.1186/s12859-021-04365-4
work_keys_str_mv AT wanglinyu ncdlresanovelmethodfornoncodingrnasfamilypredictionbasedondynamiclstmandresnet
AT zhongxiaodan ncdlresanovelmethodfornoncodingrnasfamilypredictionbasedondynamiclstmandresnet
AT wangshuo ncdlresanovelmethodfornoncodingrnasfamilypredictionbasedondynamiclstmandresnet
AT liuyuanning ncdlresanovelmethodfornoncodingrnasfamilypredictionbasedondynamiclstmandresnet