Cargando…

A novel feature extraction approach for microarray data based on multi-algorithm fusion

Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes...

Descripción completa

Detalles Bibliográficos
Autores principales: Jiang, Zhu, Xu, Rong
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Biomedical Informatics 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349936/
https://www.ncbi.nlm.nih.gov/pubmed/25780277
http://dx.doi.org/10.6026/97320630011027
_version_ 1782360110246395904
author Jiang, Zhu
Xu, Rong
author_facet Jiang, Zhu
Xu, Rong
author_sort Jiang, Zhu
collection PubMed
description Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions.
format Online
Article
Text
id pubmed-4349936
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Biomedical Informatics
record_format MEDLINE/PubMed
spelling pubmed-43499362015-03-16 A novel feature extraction approach for microarray data based on multi-algorithm fusion Jiang, Zhu Xu, Rong Bioinformation Hypothesis Feature extraction is one of the most important and effective method to reduce dimension in data mining, with emerging of high dimensional data such as microarray gene expression data. Feature extraction for gene selection, mainly serves two purposes. One is to identify certain disease-related genes. The other is to find a compact set of discriminative genes to build a pattern classifier with reduced complexity and improved generalization capabilities. Depending on the purpose of gene selection, two types of feature extraction algorithms including ranking-based feature extraction and set-based feature extraction are employed in microarray gene expression data analysis. In ranking-based feature extraction, features are evaluated on an individual basis, without considering inter-relationship between features in general, while set-based feature extraction evaluates features based on their role in a feature set by taking into account dependency between features. Just as learning methods, feature extraction has a problem in its generalization ability, which is robustness. However, the issue of robustness is often overlooked in feature extraction. In order to improve the accuracy and robustness of feature extraction for microarray data, a novel approach based on multi-algorithm fusion is proposed. By fusing different types of feature extraction algorithms to select the feature from the samples set, the proposed approach is able to improve feature extraction performance. The new approach is tested against gene expression dataset including Colon cancer data, CNS data, DLBCL data, and Leukemia data. The testing results show that the performance of this algorithm is better than existing solutions. Biomedical Informatics 2015-01-30 /pmc/articles/PMC4349936/ /pubmed/25780277 http://dx.doi.org/10.6026/97320630011027 Text en © 2015 Biomedical Informatics This is an open-access article, which permits unrestricted use, distribution, and reproduction in any medium, for non-commercial purposes, provided the original author and source are credited.
spellingShingle Hypothesis
Jiang, Zhu
Xu, Rong
A novel feature extraction approach for microarray data based on multi-algorithm fusion
title A novel feature extraction approach for microarray data based on multi-algorithm fusion
title_full A novel feature extraction approach for microarray data based on multi-algorithm fusion
title_fullStr A novel feature extraction approach for microarray data based on multi-algorithm fusion
title_full_unstemmed A novel feature extraction approach for microarray data based on multi-algorithm fusion
title_short A novel feature extraction approach for microarray data based on multi-algorithm fusion
title_sort novel feature extraction approach for microarray data based on multi-algorithm fusion
topic Hypothesis
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4349936/
https://www.ncbi.nlm.nih.gov/pubmed/25780277
http://dx.doi.org/10.6026/97320630011027
work_keys_str_mv AT jiangzhu anovelfeatureextractionapproachformicroarraydatabasedonmultialgorithmfusion
AT xurong anovelfeatureextractionapproachformicroarraydatabasedonmultialgorithmfusion
AT jiangzhu novelfeatureextractionapproachformicroarraydatabasedonmultialgorithmfusion
AT xurong novelfeatureextractionapproachformicroarraydatabasedonmultialgorithmfusion