Cargando…

Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm

BACKGROUND: In the application of microarray data, how to select a small number of informative genes from thousands of genes that may contribute to the occurrence of cancers is an important issue. Many researchers use various computational intelligence methods to analyzed gene expression data. RESUL...

Descripción completa

Detalles Bibliográficos
Autores principales: Chen, Kun-Huang, Wang, Kung-Jeng, Tsai, Min-Lung, Wang, Kung-Min, Adrian, Angelia Melani, Cheng, Wei-Chung, Yang, Tzu-Sen, Teng, Nai-Chia, Tan, Kuo-Pin, Chang, Ku-Shang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3944936/
https://www.ncbi.nlm.nih.gov/pubmed/24555567
http://dx.doi.org/10.1186/1471-2105-15-49
_version_ 1782306459479965696
author Chen, Kun-Huang
Wang, Kung-Jeng
Tsai, Min-Lung
Wang, Kung-Min
Adrian, Angelia Melani
Cheng, Wei-Chung
Yang, Tzu-Sen
Teng, Nai-Chia
Tan, Kuo-Pin
Chang, Ku-Shang
author_facet Chen, Kun-Huang
Wang, Kung-Jeng
Tsai, Min-Lung
Wang, Kung-Min
Adrian, Angelia Melani
Cheng, Wei-Chung
Yang, Tzu-Sen
Teng, Nai-Chia
Tan, Kuo-Pin
Chang, Ku-Shang
author_sort Chen, Kun-Huang
collection PubMed
description BACKGROUND: In the application of microarray data, how to select a small number of informative genes from thousands of genes that may contribute to the occurrence of cancers is an important issue. Many researchers use various computational intelligence methods to analyzed gene expression data. RESULTS: To achieve efficient gene selection from thousands of candidate genes that can contribute in identifying cancers, this study aims at developing a novel method utilizing particle swarm optimization combined with a decision tree as the classifier. This study also compares the performance of our proposed method with other well-known benchmark classification methods (support vector machine, self-organizing map, back propagation neural network, C4.5 decision tree, Naive Bayes, CART decision tree, and artificial immune recognition system) and conducts experiments on 11 gene expression cancer datasets. CONCLUSION: Based on statistical analysis, our proposed method outperforms other popular classifiers for all test datasets, and is compatible to SVM for certain specific datasets. Further, the housekeeping genes with various expression patterns and tissue-specific genes are identified. These genes provide a high discrimination power on cancer classification.
format Online
Article
Text
id pubmed-3944936
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-39449362014-03-17 Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm Chen, Kun-Huang Wang, Kung-Jeng Tsai, Min-Lung Wang, Kung-Min Adrian, Angelia Melani Cheng, Wei-Chung Yang, Tzu-Sen Teng, Nai-Chia Tan, Kuo-Pin Chang, Ku-Shang BMC Bioinformatics Methodology Article BACKGROUND: In the application of microarray data, how to select a small number of informative genes from thousands of genes that may contribute to the occurrence of cancers is an important issue. Many researchers use various computational intelligence methods to analyzed gene expression data. RESULTS: To achieve efficient gene selection from thousands of candidate genes that can contribute in identifying cancers, this study aims at developing a novel method utilizing particle swarm optimization combined with a decision tree as the classifier. This study also compares the performance of our proposed method with other well-known benchmark classification methods (support vector machine, self-organizing map, back propagation neural network, C4.5 decision tree, Naive Bayes, CART decision tree, and artificial immune recognition system) and conducts experiments on 11 gene expression cancer datasets. CONCLUSION: Based on statistical analysis, our proposed method outperforms other popular classifiers for all test datasets, and is compatible to SVM for certain specific datasets. Further, the housekeeping genes with various expression patterns and tissue-specific genes are identified. These genes provide a high discrimination power on cancer classification. BioMed Central 2014-02-20 /pmc/articles/PMC3944936/ /pubmed/24555567 http://dx.doi.org/10.1186/1471-2105-15-49 Text en Copyright © 2014 Chen et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited.
spellingShingle Methodology Article
Chen, Kun-Huang
Wang, Kung-Jeng
Tsai, Min-Lung
Wang, Kung-Min
Adrian, Angelia Melani
Cheng, Wei-Chung
Yang, Tzu-Sen
Teng, Nai-Chia
Tan, Kuo-Pin
Chang, Ku-Shang
Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm
title Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm
title_full Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm
title_fullStr Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm
title_full_unstemmed Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm
title_short Gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm
title_sort gene selection for cancer identification: a decision tree model empowered by particle swarm optimization algorithm
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3944936/
https://www.ncbi.nlm.nih.gov/pubmed/24555567
http://dx.doi.org/10.1186/1471-2105-15-49
work_keys_str_mv AT chenkunhuang geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm
AT wangkungjeng geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm
AT tsaiminlung geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm
AT wangkungmin geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm
AT adrianangeliamelani geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm
AT chengweichung geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm
AT yangtzusen geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm
AT tengnaichia geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm
AT tankuopin geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm
AT changkushang geneselectionforcanceridentificationadecisiontreemodelempoweredbyparticleswarmoptimizationalgorithm