Cargando…
A novel method to identify pre-microRNA in various species knowledge base on various species
BACKGROUND: More than 1/3 of human genes are regulated by microRNAs. The identification of microRNA (miRNA) is the precondition of discovering the regulatory mechanism of miRNA and developing the cure for genetic diseases. The traditional identification method is biological experiment, but it has th...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763424/ https://www.ncbi.nlm.nih.gov/pubmed/29297389 http://dx.doi.org/10.1186/s13326-017-0143-z |
_version_ | 1783291884936888320 |
---|---|
author | Zhao, Tianyi Zhang, Ningyi Zhang, Ying Ren, Jun Xu, Peigang Liu, Zhiyan Cheng, Liang Hu, Yang |
author_facet | Zhao, Tianyi Zhang, Ningyi Zhang, Ying Ren, Jun Xu, Peigang Liu, Zhiyan Cheng, Liang Hu, Yang |
author_sort | Zhao, Tianyi |
collection | PubMed |
description | BACKGROUND: More than 1/3 of human genes are regulated by microRNAs. The identification of microRNA (miRNA) is the precondition of discovering the regulatory mechanism of miRNA and developing the cure for genetic diseases. The traditional identification method is biological experiment, but it has the defects of long period, high cost, and missing the miRNAs that but also many other algorithms only exist in a specific period or low expression level. Therefore, to overcome these defects, machine learning method is applied to identify miRNAs. RESULTS: In this study, for identifying real and pseudo miRNAs and classifying different species, we extracted 98 dimensional features based on the primary and secondary structure, then we proposed the BP-Adaboost method to figure out the overfitting phenomenon of BP neural network by constructing multiple BP neural network classifiers and distributed weights to these classifiers. The novel method we proposed, from the 4 evaluation terms, have achieved greatly improvement on the effect of identifying true pre-RNA compared to other methods. And from the respect of identifying species of pre-RNA, the novel method achieved more accuracy than other algorithms. CONCLUSIONS: The BP-Adaboost method has achieved more than 98% accuracy in identifying real and pseudo miRNAs. It is much higher than not only BP but also many other algorithms. In the second experiment, restricted by the data, the algorithm could not get high accuracy in identifying 7 species, but also better than other algorithms. |
format | Online Article Text |
id | pubmed-5763424 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57634242018-01-17 A novel method to identify pre-microRNA in various species knowledge base on various species Zhao, Tianyi Zhang, Ningyi Zhang, Ying Ren, Jun Xu, Peigang Liu, Zhiyan Cheng, Liang Hu, Yang J Biomed Semantics Research BACKGROUND: More than 1/3 of human genes are regulated by microRNAs. The identification of microRNA (miRNA) is the precondition of discovering the regulatory mechanism of miRNA and developing the cure for genetic diseases. The traditional identification method is biological experiment, but it has the defects of long period, high cost, and missing the miRNAs that but also many other algorithms only exist in a specific period or low expression level. Therefore, to overcome these defects, machine learning method is applied to identify miRNAs. RESULTS: In this study, for identifying real and pseudo miRNAs and classifying different species, we extracted 98 dimensional features based on the primary and secondary structure, then we proposed the BP-Adaboost method to figure out the overfitting phenomenon of BP neural network by constructing multiple BP neural network classifiers and distributed weights to these classifiers. The novel method we proposed, from the 4 evaluation terms, have achieved greatly improvement on the effect of identifying true pre-RNA compared to other methods. And from the respect of identifying species of pre-RNA, the novel method achieved more accuracy than other algorithms. CONCLUSIONS: The BP-Adaboost method has achieved more than 98% accuracy in identifying real and pseudo miRNAs. It is much higher than not only BP but also many other algorithms. In the second experiment, restricted by the data, the algorithm could not get high accuracy in identifying 7 species, but also better than other algorithms. BioMed Central 2017-09-20 /pmc/articles/PMC5763424/ /pubmed/29297389 http://dx.doi.org/10.1186/s13326-017-0143-z Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Zhao, Tianyi Zhang, Ningyi Zhang, Ying Ren, Jun Xu, Peigang Liu, Zhiyan Cheng, Liang Hu, Yang A novel method to identify pre-microRNA in various species knowledge base on various species |
title | A novel method to identify pre-microRNA in various species knowledge base on various species |
title_full | A novel method to identify pre-microRNA in various species knowledge base on various species |
title_fullStr | A novel method to identify pre-microRNA in various species knowledge base on various species |
title_full_unstemmed | A novel method to identify pre-microRNA in various species knowledge base on various species |
title_short | A novel method to identify pre-microRNA in various species knowledge base on various species |
title_sort | novel method to identify pre-microrna in various species knowledge base on various species |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763424/ https://www.ncbi.nlm.nih.gov/pubmed/29297389 http://dx.doi.org/10.1186/s13326-017-0143-z |
work_keys_str_mv | AT zhaotianyi anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT zhangningyi anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT zhangying anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT renjun anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT xupeigang anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT liuzhiyan anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT chengliang anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT huyang anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT zhaotianyi novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT zhangningyi novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT zhangying novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT renjun novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT xupeigang novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT liuzhiyan novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT chengliang novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies AT huyang novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies |