Cargando…

Genetic Programming Based Ensemble System for Microarray Data Classification

Recently, more and more machine learning techniques have been applied to microarray data analysis. The aim of this study is to propose a genetic programming (GP) based new ensemble system (named GPES), which can be used to effectively classify different types of cancers. Decision trees are deployed...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Kun-Hong, Tong, Muchenxuan, Xie, Shu-Tong, Yee Ng, Vincent To
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4355811/
https://www.ncbi.nlm.nih.gov/pubmed/25810748
http://dx.doi.org/10.1155/2015/193406
_version_ 1782360909219364864
author Liu, Kun-Hong
Tong, Muchenxuan
Xie, Shu-Tong
Yee Ng, Vincent To
author_facet Liu, Kun-Hong
Tong, Muchenxuan
Xie, Shu-Tong
Yee Ng, Vincent To
author_sort Liu, Kun-Hong
collection PubMed
description Recently, more and more machine learning techniques have been applied to microarray data analysis. The aim of this study is to propose a genetic programming (GP) based new ensemble system (named GPES), which can be used to effectively classify different types of cancers. Decision trees are deployed as base classifiers in this ensemble framework with three operators: Min, Max, and Average. Each individual of the GP is an ensemble system, and they become more and more accurate in the evolutionary process. The feature selection technique and balanced subsampling technique are applied to increase the diversity in each ensemble system. The final ensemble committee is selected by a forward search algorithm, which is shown to be capable of fitting data automatically. The performance of GPES is evaluated using five binary class and six multiclass microarray datasets, and results show that the algorithm can achieve better results in most cases compared with some other ensemble systems. By using elaborate base classifiers or applying other sampling techniques, the performance of GPES may be further improved.
format Online
Article
Text
id pubmed-4355811
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-43558112015-03-25 Genetic Programming Based Ensemble System for Microarray Data Classification Liu, Kun-Hong Tong, Muchenxuan Xie, Shu-Tong Yee Ng, Vincent To Comput Math Methods Med Research Article Recently, more and more machine learning techniques have been applied to microarray data analysis. The aim of this study is to propose a genetic programming (GP) based new ensemble system (named GPES), which can be used to effectively classify different types of cancers. Decision trees are deployed as base classifiers in this ensemble framework with three operators: Min, Max, and Average. Each individual of the GP is an ensemble system, and they become more and more accurate in the evolutionary process. The feature selection technique and balanced subsampling technique are applied to increase the diversity in each ensemble system. The final ensemble committee is selected by a forward search algorithm, which is shown to be capable of fitting data automatically. The performance of GPES is evaluated using five binary class and six multiclass microarray datasets, and results show that the algorithm can achieve better results in most cases compared with some other ensemble systems. By using elaborate base classifiers or applying other sampling techniques, the performance of GPES may be further improved. Hindawi Publishing Corporation 2015 2015-02-25 /pmc/articles/PMC4355811/ /pubmed/25810748 http://dx.doi.org/10.1155/2015/193406 Text en Copyright © 2015 Kun-Hong Liu et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Liu, Kun-Hong
Tong, Muchenxuan
Xie, Shu-Tong
Yee Ng, Vincent To
Genetic Programming Based Ensemble System for Microarray Data Classification
title Genetic Programming Based Ensemble System for Microarray Data Classification
title_full Genetic Programming Based Ensemble System for Microarray Data Classification
title_fullStr Genetic Programming Based Ensemble System for Microarray Data Classification
title_full_unstemmed Genetic Programming Based Ensemble System for Microarray Data Classification
title_short Genetic Programming Based Ensemble System for Microarray Data Classification
title_sort genetic programming based ensemble system for microarray data classification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4355811/
https://www.ncbi.nlm.nih.gov/pubmed/25810748
http://dx.doi.org/10.1155/2015/193406
work_keys_str_mv AT liukunhong geneticprogrammingbasedensemblesystemformicroarraydataclassification
AT tongmuchenxuan geneticprogrammingbasedensemblesystemformicroarraydataclassification
AT xieshutong geneticprogrammingbasedensemblesystemformicroarraydataclassification
AT yeengvincentto geneticprogrammingbasedensemblesystemformicroarraydataclassification