Cargando…
A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification
For the DNA microarray datasets, tumor classification based on gene expression profiles has drawn great attention, and gene selection plays a significant role in improving the classification performance of microarray data. In this study, an effective hybrid gene selection method based on ReliefF and...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6586811/ https://www.ncbi.nlm.nih.gov/pubmed/31222027 http://dx.doi.org/10.1038/s41598-019-45223-x |
_version_ | 1783428948354400256 |
---|---|
author | Sun, Lin Kong, Xianglin Xu, Jiucheng Xue, Zhan’ao Zhai, Ruibing Zhang, Shiguang |
author_facet | Sun, Lin Kong, Xianglin Xu, Jiucheng Xue, Zhan’ao Zhai, Ruibing Zhang, Shiguang |
author_sort | Sun, Lin |
collection | PubMed |
description | For the DNA microarray datasets, tumor classification based on gene expression profiles has drawn great attention, and gene selection plays a significant role in improving the classification performance of microarray data. In this study, an effective hybrid gene selection method based on ReliefF and Ant colony optimization (ACO) algorithm for tumor classification is proposed. First, for the ReliefF algorithm, the average distance among k nearest or k non-nearest neighbor samples are introduced to estimate the difference among samples, based on which the distances between the samples in the same class or the different classes are defined, and then it can more effectively evaluate the weight values of genes for samples. To obtain the stable results in emergencies, a distance coefficient is developed to construct a new formula of updating weight coefficient of genes to further reduce the instability during calculations. When decreasing the distance between the same samples and increasing the distance between the different samples, the weight division is more obvious. Thus, the ReliefF algorithm can be improved to reduce the initial dimensionality of gene expression datasets and obtain a candidate gene subset. Second, a new pruning rule is designed to reduce dimensionality and obtain a new candidate subset with the smaller number of genes. The probability formula of the next point in the path selected by the ants is presented to highlight the closeness of the correlation relationship between the reaction variables. To increase the pheromone concentration of important genes, a new phenotype updating formula of the ACO algorithm is adopted to prevent the pheromone left by the ants that are overwhelmed with time, and then the weight coefficients of the genes are applied here to eliminate the interference of difference data as much as possible. It follows that the improved ACO algorithm has the ability of the strong positive feedback, which quickly converges to an optimal solution through the accumulation and the updating of pheromone. Finally, by combining the improved ReliefF algorithm and the improved ACO method, a hybrid filter-wrapper-based gene selection algorithm called as RFACO-GS is proposed. The experimental results under several public gene expression datasets demonstrate that the proposed method is very effective, which can significantly reduce the dimensionality of gene expression datasets, and select the most relevant genes with high classification accuracy. |
format | Online Article Text |
id | pubmed-6586811 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-65868112019-06-27 A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification Sun, Lin Kong, Xianglin Xu, Jiucheng Xue, Zhan’ao Zhai, Ruibing Zhang, Shiguang Sci Rep Article For the DNA microarray datasets, tumor classification based on gene expression profiles has drawn great attention, and gene selection plays a significant role in improving the classification performance of microarray data. In this study, an effective hybrid gene selection method based on ReliefF and Ant colony optimization (ACO) algorithm for tumor classification is proposed. First, for the ReliefF algorithm, the average distance among k nearest or k non-nearest neighbor samples are introduced to estimate the difference among samples, based on which the distances between the samples in the same class or the different classes are defined, and then it can more effectively evaluate the weight values of genes for samples. To obtain the stable results in emergencies, a distance coefficient is developed to construct a new formula of updating weight coefficient of genes to further reduce the instability during calculations. When decreasing the distance between the same samples and increasing the distance between the different samples, the weight division is more obvious. Thus, the ReliefF algorithm can be improved to reduce the initial dimensionality of gene expression datasets and obtain a candidate gene subset. Second, a new pruning rule is designed to reduce dimensionality and obtain a new candidate subset with the smaller number of genes. The probability formula of the next point in the path selected by the ants is presented to highlight the closeness of the correlation relationship between the reaction variables. To increase the pheromone concentration of important genes, a new phenotype updating formula of the ACO algorithm is adopted to prevent the pheromone left by the ants that are overwhelmed with time, and then the weight coefficients of the genes are applied here to eliminate the interference of difference data as much as possible. It follows that the improved ACO algorithm has the ability of the strong positive feedback, which quickly converges to an optimal solution through the accumulation and the updating of pheromone. Finally, by combining the improved ReliefF algorithm and the improved ACO method, a hybrid filter-wrapper-based gene selection algorithm called as RFACO-GS is proposed. The experimental results under several public gene expression datasets demonstrate that the proposed method is very effective, which can significantly reduce the dimensionality of gene expression datasets, and select the most relevant genes with high classification accuracy. Nature Publishing Group UK 2019-06-20 /pmc/articles/PMC6586811/ /pubmed/31222027 http://dx.doi.org/10.1038/s41598-019-45223-x Text en © The Author(s) 2019 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Sun, Lin Kong, Xianglin Xu, Jiucheng Xue, Zhan’ao Zhai, Ruibing Zhang, Shiguang A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification |
title | A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification |
title_full | A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification |
title_fullStr | A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification |
title_full_unstemmed | A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification |
title_short | A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification |
title_sort | hybrid gene selection method based on relieff and ant colony optimization algorithm for tumor classification |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6586811/ https://www.ncbi.nlm.nih.gov/pubmed/31222027 http://dx.doi.org/10.1038/s41598-019-45223-x |
work_keys_str_mv | AT sunlin ahybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT kongxianglin ahybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT xujiucheng ahybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT xuezhanao ahybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT zhairuibing ahybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT zhangshiguang ahybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT sunlin hybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT kongxianglin hybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT xujiucheng hybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT xuezhanao hybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT zhairuibing hybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification AT zhangshiguang hybridgeneselectionmethodbasedonrelieffandantcolonyoptimizationalgorithmfortumorclassification |