Cargando…

A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming

In recent years, obtaining RNA secondary structure information has played an important role in RNA and gene function research. Although some RNA secondary structures can be gained experimentally, in most cases, efficient, and accurate computational methods are still needed to predict RNA secondary s...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Hao, Zhang, Chunhe, Li, Zhi, Li, Cong, Wei, Xu, Zhang, Borui, Liu, Yuanning
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6540740/
https://www.ncbi.nlm.nih.gov/pubmed/31191603
http://dx.doi.org/10.3389/fgene.2019.00467
_version_ 1783422681218023424
author Zhang, Hao
Zhang, Chunhe
Li, Zhi
Li, Cong
Wei, Xu
Zhang, Borui
Liu, Yuanning
author_facet Zhang, Hao
Zhang, Chunhe
Li, Zhi
Li, Cong
Wei, Xu
Zhang, Borui
Liu, Yuanning
author_sort Zhang, Hao
collection PubMed
description In recent years, obtaining RNA secondary structure information has played an important role in RNA and gene function research. Although some RNA secondary structures can be gained experimentally, in most cases, efficient, and accurate computational methods are still needed to predict RNA secondary structure. Current RNA secondary structure prediction methods are mainly based on the minimum free energy algorithm, which finds the optimal folding state of RNA in vivo using an iterative method to meet the minimum energy or other constraints. However, due to the complexity of biotic environment, a true RNA structure always keeps the balance of biological potential energy status, rather than the optimal folding status that meets the minimum energy. For short sequence RNA its equilibrium energy status for the RNA folding organism is close to the minimum free energy status; therefore, the minimum free energy algorithm for predicting RNA secondary structure has higher accuracy. Nevertheless, in a longer sequence RNA, constant folding causes its biopotential energy balance to deviate far from the minimum free energy status. This deviation is because of its complex structure and results in a serious decline in the prediction accuracy of its secondary structure. In this paper, we propose a novel RNA secondary structure prediction algorithm using a convolutional neural network model combined with a dynamic programming method to improve the accuracy with large-scale RNA sequence and structure data. We analyze current experimental RNA sequences and structure data to construct a deep convolutional network model, and then we extract implicit features of an effective classification from large-scale data to predict the pairing probability of each base in an RNA sequence. For the obtained probabilities of RNA sequence base pairing, an enhanced dynamic programming method is applied to obtain the optimal RNA secondary structure. Results indicate that our proposed method is superior to the common RNA secondary structure prediction algorithms in predicting three benchmark RNA families. Based on the characteristics of deep learning algorithm, it can be inferred that the method proposed in this paper has a 30% higher prediction success rate when compared with other algorithms, which will be needed as the amount of real RNA structure data increases in the future.
format Online
Article
Text
id pubmed-6540740
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-65407402019-06-12 A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming Zhang, Hao Zhang, Chunhe Li, Zhi Li, Cong Wei, Xu Zhang, Borui Liu, Yuanning Front Genet Genetics In recent years, obtaining RNA secondary structure information has played an important role in RNA and gene function research. Although some RNA secondary structures can be gained experimentally, in most cases, efficient, and accurate computational methods are still needed to predict RNA secondary structure. Current RNA secondary structure prediction methods are mainly based on the minimum free energy algorithm, which finds the optimal folding state of RNA in vivo using an iterative method to meet the minimum energy or other constraints. However, due to the complexity of biotic environment, a true RNA structure always keeps the balance of biological potential energy status, rather than the optimal folding status that meets the minimum energy. For short sequence RNA its equilibrium energy status for the RNA folding organism is close to the minimum free energy status; therefore, the minimum free energy algorithm for predicting RNA secondary structure has higher accuracy. Nevertheless, in a longer sequence RNA, constant folding causes its biopotential energy balance to deviate far from the minimum free energy status. This deviation is because of its complex structure and results in a serious decline in the prediction accuracy of its secondary structure. In this paper, we propose a novel RNA secondary structure prediction algorithm using a convolutional neural network model combined with a dynamic programming method to improve the accuracy with large-scale RNA sequence and structure data. We analyze current experimental RNA sequences and structure data to construct a deep convolutional network model, and then we extract implicit features of an effective classification from large-scale data to predict the pairing probability of each base in an RNA sequence. For the obtained probabilities of RNA sequence base pairing, an enhanced dynamic programming method is applied to obtain the optimal RNA secondary structure. Results indicate that our proposed method is superior to the common RNA secondary structure prediction algorithms in predicting three benchmark RNA families. Based on the characteristics of deep learning algorithm, it can be inferred that the method proposed in this paper has a 30% higher prediction success rate when compared with other algorithms, which will be needed as the amount of real RNA structure data increases in the future. Frontiers Media S.A. 2019-05-22 /pmc/articles/PMC6540740/ /pubmed/31191603 http://dx.doi.org/10.3389/fgene.2019.00467 Text en Copyright © 2019 Zhang, Zhang, Li, Li, Wei, Zhang and Liu. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Zhang, Hao
Zhang, Chunhe
Li, Zhi
Li, Cong
Wei, Xu
Zhang, Borui
Liu, Yuanning
A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming
title A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming
title_full A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming
title_fullStr A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming
title_full_unstemmed A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming
title_short A New Method of RNA Secondary Structure Prediction Based on Convolutional Neural Network and Dynamic Programming
title_sort new method of rna secondary structure prediction based on convolutional neural network and dynamic programming
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6540740/
https://www.ncbi.nlm.nih.gov/pubmed/31191603
http://dx.doi.org/10.3389/fgene.2019.00467
work_keys_str_mv AT zhanghao anewmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT zhangchunhe anewmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT lizhi anewmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT licong anewmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT weixu anewmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT zhangborui anewmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT liuyuanning anewmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT zhanghao newmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT zhangchunhe newmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT lizhi newmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT licong newmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT weixu newmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT zhangborui newmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming
AT liuyuanning newmethodofrnasecondarystructurepredictionbasedonconvolutionalneuralnetworkanddynamicprogramming