Cargando…

CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks

MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containi...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Huiqing, Ma, Yue, Dong, Chunlin, Li, Chun, Wang, Jingjing, Liu, Dan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798641/
https://www.ncbi.nlm.nih.gov/pubmed/31681416
http://dx.doi.org/10.3389/fgene.2019.00967
_version_ 1783460091074183168
author Wang, Huiqing
Ma, Yue
Dong, Chunlin
Li, Chun
Wang, Jingjing
Liu, Dan
author_facet Wang, Huiqing
Ma, Yue
Dong, Chunlin
Li, Chun
Wang, Jingjing
Liu, Dan
author_sort Wang, Huiqing
collection PubMed
description MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containing more structural features, and it is difficult to identify mature miRNAs because of their short length. Therefore, most research focuses on the identification of pre-miRNAs. Most computational models rely on manual feature extraction to identify pre-miRNAs and do not consider the sequential and spatial characteristics of pre-miRNAs, resulting in a loss of information. As the number of unidentified pre-miRNAs is far greater than that of known pre-miRNAs, there is a dataset imbalance problem, which leads to a degradation of the performance of pre-miRNA identification methods. In order to overcome the limitations of existing methods, we propose a pre-miRNA identification algorithm based on a cascaded CNN-LSTM framework, called CL-PMI. We used a convolutional neural network to automatically extract features and obtain pre-miRNA spatial information. We also employed long short-term memory (LSTM) to capture time characteristics of pre-miRNAs and improve attention mechanisms for long-term dependence modeling. Focal loss was used to improve the dataset imbalance. Compared with existing methods, CL-PMI achieved better performance on all datasets. The results demonstrate that this method can effectively identify pre-miRNAs by simultaneously considering their spatial and sequential information, as well as dealing with imbalance in the datasets.
format Online
Article
Text
id pubmed-6798641
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-67986412019-11-01 CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks Wang, Huiqing Ma, Yue Dong, Chunlin Li, Chun Wang, Jingjing Liu, Dan Front Genet Genetics MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containing more structural features, and it is difficult to identify mature miRNAs because of their short length. Therefore, most research focuses on the identification of pre-miRNAs. Most computational models rely on manual feature extraction to identify pre-miRNAs and do not consider the sequential and spatial characteristics of pre-miRNAs, resulting in a loss of information. As the number of unidentified pre-miRNAs is far greater than that of known pre-miRNAs, there is a dataset imbalance problem, which leads to a degradation of the performance of pre-miRNA identification methods. In order to overcome the limitations of existing methods, we propose a pre-miRNA identification algorithm based on a cascaded CNN-LSTM framework, called CL-PMI. We used a convolutional neural network to automatically extract features and obtain pre-miRNA spatial information. We also employed long short-term memory (LSTM) to capture time characteristics of pre-miRNAs and improve attention mechanisms for long-term dependence modeling. Focal loss was used to improve the dataset imbalance. Compared with existing methods, CL-PMI achieved better performance on all datasets. The results demonstrate that this method can effectively identify pre-miRNAs by simultaneously considering their spatial and sequential information, as well as dealing with imbalance in the datasets. Frontiers Media S.A. 2019-10-11 /pmc/articles/PMC6798641/ /pubmed/31681416 http://dx.doi.org/10.3389/fgene.2019.00967 Text en Copyright © 2019 Wang, Ma, Dong, Li, Wang and Liu http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Wang, Huiqing
Ma, Yue
Dong, Chunlin
Li, Chun
Wang, Jingjing
Liu, Dan
CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks
title CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks
title_full CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks
title_fullStr CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks
title_full_unstemmed CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks
title_short CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks
title_sort cl-pmi: a precursor microrna identification method based on convolutional and long short-term memory networks
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798641/
https://www.ncbi.nlm.nih.gov/pubmed/31681416
http://dx.doi.org/10.3389/fgene.2019.00967
work_keys_str_mv AT wanghuiqing clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks
AT mayue clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks
AT dongchunlin clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks
AT lichun clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks
AT wangjingjing clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks
AT liudan clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks