Cargando…

Selecting Essential MicroRNAs Using a Novel Voting Method

Among the large number of known microRNAs (miRNAs), some miRNAs play negligible roles in cell regulation. Therefore, selecting essential miRNAs is an important initial step for a deeper understanding of miRNAs and their functions. In this study, we generated 60 classification models by combining 12...

Descripción completa

Detalles Bibliográficos
Autores principales: Ru, Xiaoqing, Cao, Peigang, Li, Lihong, Zou, Quan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Society of Gene & Cell Therapy 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6727015/
https://www.ncbi.nlm.nih.gov/pubmed/31479921
http://dx.doi.org/10.1016/j.omtn.2019.07.019
_version_ 1783449185195917312
author Ru, Xiaoqing
Cao, Peigang
Li, Lihong
Zou, Quan
author_facet Ru, Xiaoqing
Cao, Peigang
Li, Lihong
Zou, Quan
author_sort Ru, Xiaoqing
collection PubMed
description Among the large number of known microRNAs (miRNAs), some miRNAs play negligible roles in cell regulation. Therefore, selecting essential miRNAs is an important initial step for a deeper understanding of miRNAs and their functions. In this study, we generated 60 classification models by combining 12 representative feature extraction methods and 5 commonly used classification algorithms. The optimal model for essential miRNA classification that we obtained is based on the Mismatch feature extraction method combined with the random forest algorithm. The F-Measure, area under the curve, and accuracy values of this model were 93.2%, 96.7%, and 93.0%, respectively. We also found that the distribution of the positive and negative examples of the first few features greatly influenced the classification results. The feature extraction methods performed best when the differences between the positive and negative examples were obvious, and this led to better classification of essential miRNAs. Because each classifier’s predictions for the same sample may be different, we employed a novel voting method to improve the accuracy of the classification of essential miRNAs. The performance results showed that the best classification results were obtained when five classification models were used in the voting. The five classification models were constructed based on the Mismatch, pseudo-distance structure status pair composition, Subsequence, Kmer, and Triplet feature extraction methods. The voting result was 95.3%. Our results suggest that the voting method can be an important tool for selecting essential miRNAs.
format Online
Article
Text
id pubmed-6727015
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher American Society of Gene & Cell Therapy
record_format MEDLINE/PubMed
spelling pubmed-67270152019-09-10 Selecting Essential MicroRNAs Using a Novel Voting Method Ru, Xiaoqing Cao, Peigang Li, Lihong Zou, Quan Mol Ther Nucleic Acids Article Among the large number of known microRNAs (miRNAs), some miRNAs play negligible roles in cell regulation. Therefore, selecting essential miRNAs is an important initial step for a deeper understanding of miRNAs and their functions. In this study, we generated 60 classification models by combining 12 representative feature extraction methods and 5 commonly used classification algorithms. The optimal model for essential miRNA classification that we obtained is based on the Mismatch feature extraction method combined with the random forest algorithm. The F-Measure, area under the curve, and accuracy values of this model were 93.2%, 96.7%, and 93.0%, respectively. We also found that the distribution of the positive and negative examples of the first few features greatly influenced the classification results. The feature extraction methods performed best when the differences between the positive and negative examples were obvious, and this led to better classification of essential miRNAs. Because each classifier’s predictions for the same sample may be different, we employed a novel voting method to improve the accuracy of the classification of essential miRNAs. The performance results showed that the best classification results were obtained when five classification models were used in the voting. The five classification models were constructed based on the Mismatch, pseudo-distance structure status pair composition, Subsequence, Kmer, and Triplet feature extraction methods. The voting result was 95.3%. Our results suggest that the voting method can be an important tool for selecting essential miRNAs. American Society of Gene & Cell Therapy 2019-08-05 /pmc/articles/PMC6727015/ /pubmed/31479921 http://dx.doi.org/10.1016/j.omtn.2019.07.019 Text en © 2019 The Author(s) http://creativecommons.org/licenses/by-nc-nd/4.0/ This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Article
Ru, Xiaoqing
Cao, Peigang
Li, Lihong
Zou, Quan
Selecting Essential MicroRNAs Using a Novel Voting Method
title Selecting Essential MicroRNAs Using a Novel Voting Method
title_full Selecting Essential MicroRNAs Using a Novel Voting Method
title_fullStr Selecting Essential MicroRNAs Using a Novel Voting Method
title_full_unstemmed Selecting Essential MicroRNAs Using a Novel Voting Method
title_short Selecting Essential MicroRNAs Using a Novel Voting Method
title_sort selecting essential micrornas using a novel voting method
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6727015/
https://www.ncbi.nlm.nih.gov/pubmed/31479921
http://dx.doi.org/10.1016/j.omtn.2019.07.019
work_keys_str_mv AT ruxiaoqing selectingessentialmicrornasusinganovelvotingmethod
AT caopeigang selectingessentialmicrornasusinganovelvotingmethod
AT lilihong selectingessentialmicrornasusinganovelvotingmethod
AT zouquan selectingessentialmicrornasusinganovelvotingmethod