Cargando…
CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks
MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798641/ https://www.ncbi.nlm.nih.gov/pubmed/31681416 http://dx.doi.org/10.3389/fgene.2019.00967 |
_version_ | 1783460091074183168 |
---|---|
author | Wang, Huiqing Ma, Yue Dong, Chunlin Li, Chun Wang, Jingjing Liu, Dan |
author_facet | Wang, Huiqing Ma, Yue Dong, Chunlin Li, Chun Wang, Jingjing Liu, Dan |
author_sort | Wang, Huiqing |
collection | PubMed |
description | MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containing more structural features, and it is difficult to identify mature miRNAs because of their short length. Therefore, most research focuses on the identification of pre-miRNAs. Most computational models rely on manual feature extraction to identify pre-miRNAs and do not consider the sequential and spatial characteristics of pre-miRNAs, resulting in a loss of information. As the number of unidentified pre-miRNAs is far greater than that of known pre-miRNAs, there is a dataset imbalance problem, which leads to a degradation of the performance of pre-miRNA identification methods. In order to overcome the limitations of existing methods, we propose a pre-miRNA identification algorithm based on a cascaded CNN-LSTM framework, called CL-PMI. We used a convolutional neural network to automatically extract features and obtain pre-miRNA spatial information. We also employed long short-term memory (LSTM) to capture time characteristics of pre-miRNAs and improve attention mechanisms for long-term dependence modeling. Focal loss was used to improve the dataset imbalance. Compared with existing methods, CL-PMI achieved better performance on all datasets. The results demonstrate that this method can effectively identify pre-miRNAs by simultaneously considering their spatial and sequential information, as well as dealing with imbalance in the datasets. |
format | Online Article Text |
id | pubmed-6798641 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-67986412019-11-01 CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks Wang, Huiqing Ma, Yue Dong, Chunlin Li, Chun Wang, Jingjing Liu, Dan Front Genet Genetics MicroRNAs (miRNAs) are the major class of gene-regulating molecules that bind mRNAs. They function mainly as translational repressors in mammals. Therefore, how to identify miRNAs is one of the most important problems in medical treatment. Many known pre-miRNAs have a hairpin ring structure containing more structural features, and it is difficult to identify mature miRNAs because of their short length. Therefore, most research focuses on the identification of pre-miRNAs. Most computational models rely on manual feature extraction to identify pre-miRNAs and do not consider the sequential and spatial characteristics of pre-miRNAs, resulting in a loss of information. As the number of unidentified pre-miRNAs is far greater than that of known pre-miRNAs, there is a dataset imbalance problem, which leads to a degradation of the performance of pre-miRNA identification methods. In order to overcome the limitations of existing methods, we propose a pre-miRNA identification algorithm based on a cascaded CNN-LSTM framework, called CL-PMI. We used a convolutional neural network to automatically extract features and obtain pre-miRNA spatial information. We also employed long short-term memory (LSTM) to capture time characteristics of pre-miRNAs and improve attention mechanisms for long-term dependence modeling. Focal loss was used to improve the dataset imbalance. Compared with existing methods, CL-PMI achieved better performance on all datasets. The results demonstrate that this method can effectively identify pre-miRNAs by simultaneously considering their spatial and sequential information, as well as dealing with imbalance in the datasets. Frontiers Media S.A. 2019-10-11 /pmc/articles/PMC6798641/ /pubmed/31681416 http://dx.doi.org/10.3389/fgene.2019.00967 Text en Copyright © 2019 Wang, Ma, Dong, Li, Wang and Liu http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Genetics Wang, Huiqing Ma, Yue Dong, Chunlin Li, Chun Wang, Jingjing Liu, Dan CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks |
title | CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks |
title_full | CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks |
title_fullStr | CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks |
title_full_unstemmed | CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks |
title_short | CL-PMI: A Precursor MicroRNA Identification Method Based on Convolutional and Long Short-Term Memory Networks |
title_sort | cl-pmi: a precursor microrna identification method based on convolutional and long short-term memory networks |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6798641/ https://www.ncbi.nlm.nih.gov/pubmed/31681416 http://dx.doi.org/10.3389/fgene.2019.00967 |
work_keys_str_mv | AT wanghuiqing clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks AT mayue clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks AT dongchunlin clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks AT lichun clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks AT wangjingjing clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks AT liudan clpmiaprecursormicrornaidentificationmethodbasedonconvolutionalandlongshorttermmemorynetworks |