Cargando…

Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance

MicroRNAs (miRNAs) play important roles in the diverse biological processes of animals and plants. Although the prediction methods based on machine learning can identify nonhomologous and species-specific miRNAs, they suffered from severe class imbalance on real and pseudo pre-miRNAs. We propose a p...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhong, Yingli, Xuan, Ping, Han, Ke, Zhang, Weiping, Li, Jianzhong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4657081/
https://www.ncbi.nlm.nih.gov/pubmed/26640803
http://dx.doi.org/10.1155/2015/960108
_version_ 1782402332242214912
author Zhong, Yingli
Xuan, Ping
Han, Ke
Zhang, Weiping
Li, Jianzhong
author_facet Zhong, Yingli
Xuan, Ping
Han, Ke
Zhang, Weiping
Li, Jianzhong
author_sort Zhong, Yingli
collection PubMed
description MicroRNAs (miRNAs) play important roles in the diverse biological processes of animals and plants. Although the prediction methods based on machine learning can identify nonhomologous and species-specific miRNAs, they suffered from severe class imbalance on real and pseudo pre-miRNAs. We propose a pre-miRNA classification method based on cost-sensitive ensemble learning and refer to it as MiRNAClassify. Through a series of iterations, the information of all the positive and negative samples is completely exploited. In each iteration, a new classification instance is trained by the equal number of positive and negative samples. In this way, the negative effect of class imbalance is efficiently relieved. The new instance primarily focuses on those samples that are easy to be misclassified. In addition, the positive samples are assigned higher cost weight than the negative samples. MiRNAClassify is compared with several state-of-the-art methods and some well-known classification models by testing the datasets about human, animal, and plant. The result of cross validation indicates that MiRNAClassify significantly outperforms other methods and models. In addition, the newly added pre-miRNAs are used to further evaluate the ability of these methods to discover novel pre-miRNAs. MiRNAClassify still achieves consistently superior performance and can discover more pre-miRNAs.
format Online
Article
Text
id pubmed-4657081
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-46570812015-12-06 Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance Zhong, Yingli Xuan, Ping Han, Ke Zhang, Weiping Li, Jianzhong Biomed Res Int Research Article MicroRNAs (miRNAs) play important roles in the diverse biological processes of animals and plants. Although the prediction methods based on machine learning can identify nonhomologous and species-specific miRNAs, they suffered from severe class imbalance on real and pseudo pre-miRNAs. We propose a pre-miRNA classification method based on cost-sensitive ensemble learning and refer to it as MiRNAClassify. Through a series of iterations, the information of all the positive and negative samples is completely exploited. In each iteration, a new classification instance is trained by the equal number of positive and negative samples. In this way, the negative effect of class imbalance is efficiently relieved. The new instance primarily focuses on those samples that are easy to be misclassified. In addition, the positive samples are assigned higher cost weight than the negative samples. MiRNAClassify is compared with several state-of-the-art methods and some well-known classification models by testing the datasets about human, animal, and plant. The result of cross validation indicates that MiRNAClassify significantly outperforms other methods and models. In addition, the newly added pre-miRNAs are used to further evaluate the ability of these methods to discover novel pre-miRNAs. MiRNAClassify still achieves consistently superior performance and can discover more pre-miRNAs. Hindawi Publishing Corporation 2015 2015-11-10 /pmc/articles/PMC4657081/ /pubmed/26640803 http://dx.doi.org/10.1155/2015/960108 Text en Copyright © 2015 Yingli Zhong et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhong, Yingli
Xuan, Ping
Han, Ke
Zhang, Weiping
Li, Jianzhong
Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance
title Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance
title_full Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance
title_fullStr Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance
title_full_unstemmed Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance
title_short Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance
title_sort improved pre-mirna classification by reducing the effect of class imbalance
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4657081/
https://www.ncbi.nlm.nih.gov/pubmed/26640803
http://dx.doi.org/10.1155/2015/960108
work_keys_str_mv AT zhongyingli improvedpremirnaclassificationbyreducingtheeffectofclassimbalance
AT xuanping improvedpremirnaclassificationbyreducingtheeffectofclassimbalance
AT hanke improvedpremirnaclassificationbyreducingtheeffectofclassimbalance
AT zhangweiping improvedpremirnaclassificationbyreducingtheeffectofclassimbalance
AT lijianzhong improvedpremirnaclassificationbyreducingtheeffectofclassimbalance