Cargando…
Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance
MicroRNAs (miRNAs) play important roles in the diverse biological processes of animals and plants. Although the prediction methods based on machine learning can identify nonhomologous and species-specific miRNAs, they suffered from severe class imbalance on real and pseudo pre-miRNAs. We propose a p...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi Publishing Corporation
2015
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4657081/ https://www.ncbi.nlm.nih.gov/pubmed/26640803 http://dx.doi.org/10.1155/2015/960108 |
_version_ | 1782402332242214912 |
---|---|
author | Zhong, Yingli Xuan, Ping Han, Ke Zhang, Weiping Li, Jianzhong |
author_facet | Zhong, Yingli Xuan, Ping Han, Ke Zhang, Weiping Li, Jianzhong |
author_sort | Zhong, Yingli |
collection | PubMed |
description | MicroRNAs (miRNAs) play important roles in the diverse biological processes of animals and plants. Although the prediction methods based on machine learning can identify nonhomologous and species-specific miRNAs, they suffered from severe class imbalance on real and pseudo pre-miRNAs. We propose a pre-miRNA classification method based on cost-sensitive ensemble learning and refer to it as MiRNAClassify. Through a series of iterations, the information of all the positive and negative samples is completely exploited. In each iteration, a new classification instance is trained by the equal number of positive and negative samples. In this way, the negative effect of class imbalance is efficiently relieved. The new instance primarily focuses on those samples that are easy to be misclassified. In addition, the positive samples are assigned higher cost weight than the negative samples. MiRNAClassify is compared with several state-of-the-art methods and some well-known classification models by testing the datasets about human, animal, and plant. The result of cross validation indicates that MiRNAClassify significantly outperforms other methods and models. In addition, the newly added pre-miRNAs are used to further evaluate the ability of these methods to discover novel pre-miRNAs. MiRNAClassify still achieves consistently superior performance and can discover more pre-miRNAs. |
format | Online Article Text |
id | pubmed-4657081 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2015 |
publisher | Hindawi Publishing Corporation |
record_format | MEDLINE/PubMed |
spelling | pubmed-46570812015-12-06 Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance Zhong, Yingli Xuan, Ping Han, Ke Zhang, Weiping Li, Jianzhong Biomed Res Int Research Article MicroRNAs (miRNAs) play important roles in the diverse biological processes of animals and plants. Although the prediction methods based on machine learning can identify nonhomologous and species-specific miRNAs, they suffered from severe class imbalance on real and pseudo pre-miRNAs. We propose a pre-miRNA classification method based on cost-sensitive ensemble learning and refer to it as MiRNAClassify. Through a series of iterations, the information of all the positive and negative samples is completely exploited. In each iteration, a new classification instance is trained by the equal number of positive and negative samples. In this way, the negative effect of class imbalance is efficiently relieved. The new instance primarily focuses on those samples that are easy to be misclassified. In addition, the positive samples are assigned higher cost weight than the negative samples. MiRNAClassify is compared with several state-of-the-art methods and some well-known classification models by testing the datasets about human, animal, and plant. The result of cross validation indicates that MiRNAClassify significantly outperforms other methods and models. In addition, the newly added pre-miRNAs are used to further evaluate the ability of these methods to discover novel pre-miRNAs. MiRNAClassify still achieves consistently superior performance and can discover more pre-miRNAs. Hindawi Publishing Corporation 2015 2015-11-10 /pmc/articles/PMC4657081/ /pubmed/26640803 http://dx.doi.org/10.1155/2015/960108 Text en Copyright © 2015 Yingli Zhong et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhong, Yingli Xuan, Ping Han, Ke Zhang, Weiping Li, Jianzhong Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance |
title | Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance |
title_full | Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance |
title_fullStr | Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance |
title_full_unstemmed | Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance |
title_short | Improved Pre-miRNA Classification by Reducing the Effect of Class Imbalance |
title_sort | improved pre-mirna classification by reducing the effect of class imbalance |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4657081/ https://www.ncbi.nlm.nih.gov/pubmed/26640803 http://dx.doi.org/10.1155/2015/960108 |
work_keys_str_mv | AT zhongyingli improvedpremirnaclassificationbyreducingtheeffectofclassimbalance AT xuanping improvedpremirnaclassificationbyreducingtheeffectofclassimbalance AT hanke improvedpremirnaclassificationbyreducingtheeffectofclassimbalance AT zhangweiping improvedpremirnaclassificationbyreducingtheeffectofclassimbalance AT lijianzhong improvedpremirnaclassificationbyreducingtheeffectofclassimbalance |