Cargando…

A novel method to identify pre-microRNA in various species knowledge base on various species

BACKGROUND: More than 1/3 of human genes are regulated by microRNAs. The identification of microRNA (miRNA) is the precondition of discovering the regulatory mechanism of miRNA and developing the cure for genetic diseases. The traditional identification method is biological experiment, but it has th...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Tianyi, Zhang, Ningyi, Zhang, Ying, Ren, Jun, Xu, Peigang, Liu, Zhiyan, Cheng, Liang, Hu, Yang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763424/
https://www.ncbi.nlm.nih.gov/pubmed/29297389
http://dx.doi.org/10.1186/s13326-017-0143-z
_version_ 1783291884936888320
author Zhao, Tianyi
Zhang, Ningyi
Zhang, Ying
Ren, Jun
Xu, Peigang
Liu, Zhiyan
Cheng, Liang
Hu, Yang
author_facet Zhao, Tianyi
Zhang, Ningyi
Zhang, Ying
Ren, Jun
Xu, Peigang
Liu, Zhiyan
Cheng, Liang
Hu, Yang
author_sort Zhao, Tianyi
collection PubMed
description BACKGROUND: More than 1/3 of human genes are regulated by microRNAs. The identification of microRNA (miRNA) is the precondition of discovering the regulatory mechanism of miRNA and developing the cure for genetic diseases. The traditional identification method is biological experiment, but it has the defects of long period, high cost, and missing the miRNAs that but also many other algorithms only exist in a specific period or low expression level. Therefore, to overcome these defects, machine learning method is applied to identify miRNAs. RESULTS: In this study, for identifying real and pseudo miRNAs and classifying different species, we extracted 98 dimensional features based on the primary and secondary structure, then we proposed the BP-Adaboost method to figure out the overfitting phenomenon of BP neural network by constructing multiple BP neural network classifiers and distributed weights to these classifiers. The novel method we proposed, from the 4 evaluation terms, have achieved greatly improvement on the effect of identifying true pre-RNA compared to other methods. And from the respect of identifying species of pre-RNA, the novel method achieved more accuracy than other algorithms. CONCLUSIONS: The BP-Adaboost method has achieved more than 98% accuracy in identifying real and pseudo miRNAs. It is much higher than not only BP but also many other algorithms. In the second experiment, restricted by the data, the algorithm could not get high accuracy in identifying 7 species, but also better than other algorithms.
format Online
Article
Text
id pubmed-5763424
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57634242018-01-17 A novel method to identify pre-microRNA in various species knowledge base on various species Zhao, Tianyi Zhang, Ningyi Zhang, Ying Ren, Jun Xu, Peigang Liu, Zhiyan Cheng, Liang Hu, Yang J Biomed Semantics Research BACKGROUND: More than 1/3 of human genes are regulated by microRNAs. The identification of microRNA (miRNA) is the precondition of discovering the regulatory mechanism of miRNA and developing the cure for genetic diseases. The traditional identification method is biological experiment, but it has the defects of long period, high cost, and missing the miRNAs that but also many other algorithms only exist in a specific period or low expression level. Therefore, to overcome these defects, machine learning method is applied to identify miRNAs. RESULTS: In this study, for identifying real and pseudo miRNAs and classifying different species, we extracted 98 dimensional features based on the primary and secondary structure, then we proposed the BP-Adaboost method to figure out the overfitting phenomenon of BP neural network by constructing multiple BP neural network classifiers and distributed weights to these classifiers. The novel method we proposed, from the 4 evaluation terms, have achieved greatly improvement on the effect of identifying true pre-RNA compared to other methods. And from the respect of identifying species of pre-RNA, the novel method achieved more accuracy than other algorithms. CONCLUSIONS: The BP-Adaboost method has achieved more than 98% accuracy in identifying real and pseudo miRNAs. It is much higher than not only BP but also many other algorithms. In the second experiment, restricted by the data, the algorithm could not get high accuracy in identifying 7 species, but also better than other algorithms. BioMed Central 2017-09-20 /pmc/articles/PMC5763424/ /pubmed/29297389 http://dx.doi.org/10.1186/s13326-017-0143-z Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Zhao, Tianyi
Zhang, Ningyi
Zhang, Ying
Ren, Jun
Xu, Peigang
Liu, Zhiyan
Cheng, Liang
Hu, Yang
A novel method to identify pre-microRNA in various species knowledge base on various species
title A novel method to identify pre-microRNA in various species knowledge base on various species
title_full A novel method to identify pre-microRNA in various species knowledge base on various species
title_fullStr A novel method to identify pre-microRNA in various species knowledge base on various species
title_full_unstemmed A novel method to identify pre-microRNA in various species knowledge base on various species
title_short A novel method to identify pre-microRNA in various species knowledge base on various species
title_sort novel method to identify pre-microrna in various species knowledge base on various species
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5763424/
https://www.ncbi.nlm.nih.gov/pubmed/29297389
http://dx.doi.org/10.1186/s13326-017-0143-z
work_keys_str_mv AT zhaotianyi anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT zhangningyi anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT zhangying anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT renjun anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT xupeigang anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT liuzhiyan anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT chengliang anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT huyang anovelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT zhaotianyi novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT zhangningyi novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT zhangying novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT renjun novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT xupeigang novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT liuzhiyan novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT chengliang novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies
AT huyang novelmethodtoidentifypremicrornainvariousspeciesknowledgebaseonvariousspecies