Cargando…

Prediction of Long Non-Coding RNAs Based on Deep Learning

With the rapid development of high-throughput sequencing technology, a large number of transcript sequences have been discovered, and how to identify long non-coding RNAs (lncRNAs) from transcripts is a challenging task. The identification and inclusion of lncRNAs not only can more clearly help us t...

Descripción completa

Detalles Bibliográficos
Autores principales:	Liu, Xiu-Qin, Li, Bing-Xiu, Zeng, Guan-Rong, Liu, Qiao-Yue, Ai, Dong-Mei
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	MDPI 2019
Materias:	Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6523782/ https://www.ncbi.nlm.nih.gov/pubmed/30987229 http://dx.doi.org/10.3390/genes10040273

_version_	1783419414112108544
author	Liu, Xiu-Qin Li, Bing-Xiu Zeng, Guan-Rong Liu, Qiao-Yue Ai, Dong-Mei
author_facet	Liu, Xiu-Qin Li, Bing-Xiu Zeng, Guan-Rong Liu, Qiao-Yue Ai, Dong-Mei
author_sort	Liu, Xiu-Qin
collection	PubMed
description	With the rapid development of high-throughput sequencing technology, a large number of transcript sequences have been discovered, and how to identify long non-coding RNAs (lncRNAs) from transcripts is a challenging task. The identification and inclusion of lncRNAs not only can more clearly help us to understand life activities themselves, but can also help humans further explore and study the disease at the molecular level. At present, the detection of lncRNAs mainly includes two forms of calculation and experiment. Due to the limitations of bio sequencing technology and ineluctable errors in sequencing processes, the detection effect of these methods is not very satisfactory. In this paper, we constructed a deep-learning model to effectively distinguish lncRNAs from mRNAs. We used k-mer embedding vectors obtained through training the GloVe algorithm as input features and set up the deep learning framework to include a bidirectional long short-term memory model (BLSTM) layer and a convolutional neural network (CNN) layer with three additional hidden layers. By testing our model, we have found that it obtained the best values of 97.9%, 96.4% and 99.0% in F1score, accuracy and auROC, respectively, which showed better classification performance than the traditional PLEK, CNCI and CPC methods for identifying lncRNAs. We hope that our model will provide effective help in distinguishing mature mRNAs from lncRNAs, and become a potential tool to help humans understand and detect the diseases associated with lncRNAs.
format	Online Article Text
id	pubmed-6523782
institution	National Center for Biotechnology Information
language	English
publishDate	2019
publisher	MDPI
record_format	MEDLINE/PubMed
spelling	pubmed-65237822019-06-03 Prediction of Long Non-Coding RNAs Based on Deep Learning Liu, Xiu-Qin Li, Bing-Xiu Zeng, Guan-Rong Liu, Qiao-Yue Ai, Dong-Mei Genes (Basel) Article With the rapid development of high-throughput sequencing technology, a large number of transcript sequences have been discovered, and how to identify long non-coding RNAs (lncRNAs) from transcripts is a challenging task. The identification and inclusion of lncRNAs not only can more clearly help us to understand life activities themselves, but can also help humans further explore and study the disease at the molecular level. At present, the detection of lncRNAs mainly includes two forms of calculation and experiment. Due to the limitations of bio sequencing technology and ineluctable errors in sequencing processes, the detection effect of these methods is not very satisfactory. In this paper, we constructed a deep-learning model to effectively distinguish lncRNAs from mRNAs. We used k-mer embedding vectors obtained through training the GloVe algorithm as input features and set up the deep learning framework to include a bidirectional long short-term memory model (BLSTM) layer and a convolutional neural network (CNN) layer with three additional hidden layers. By testing our model, we have found that it obtained the best values of 97.9%, 96.4% and 99.0% in F1score, accuracy and auROC, respectively, which showed better classification performance than the traditional PLEK, CNCI and CPC methods for identifying lncRNAs. We hope that our model will provide effective help in distinguishing mature mRNAs from lncRNAs, and become a potential tool to help humans understand and detect the diseases associated with lncRNAs. MDPI 2019-04-03 /pmc/articles/PMC6523782/ /pubmed/30987229 http://dx.doi.org/10.3390/genes10040273 Text en © 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
spellingShingle	Article Liu, Xiu-Qin Li, Bing-Xiu Zeng, Guan-Rong Liu, Qiao-Yue Ai, Dong-Mei Prediction of Long Non-Coding RNAs Based on Deep Learning
title	Prediction of Long Non-Coding RNAs Based on Deep Learning
title_full	Prediction of Long Non-Coding RNAs Based on Deep Learning
title_fullStr	Prediction of Long Non-Coding RNAs Based on Deep Learning
title_full_unstemmed	Prediction of Long Non-Coding RNAs Based on Deep Learning
title_short	Prediction of Long Non-Coding RNAs Based on Deep Learning
title_sort	prediction of long non-coding rnas based on deep learning
topic	Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6523782/ https://www.ncbi.nlm.nih.gov/pubmed/30987229 http://dx.doi.org/10.3390/genes10040273
work_keys_str_mv	AT liuxiuqin predictionoflongnoncodingrnasbasedondeeplearning AT libingxiu predictionoflongnoncodingrnasbasedondeeplearning AT zengguanrong predictionoflongnoncodingrnasbasedondeeplearning AT liuqiaoyue predictionoflongnoncodingrnasbasedondeeplearning AT aidongmei predictionoflongnoncodingrnasbasedondeeplearning

Prediction of Long Non-Coding RNAs Based on Deep Learning

Ejemplares similares