Cargando…
ANMM4CBR: a case-based reasoning method for gene expression data classification
BACKGROUND: Accurate classification of microarray data is critical for successful clinical diagnosis and treatment. The "curse of dimensionality" problem and noise in the data, however, undermines the performance of many algorithms. METHOD: In order to obtain a robust classifier, a novel A...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2843690/ https://www.ncbi.nlm.nih.gov/pubmed/20051140 http://dx.doi.org/10.1186/1748-7188-5-14 |
_version_ | 1782179253957165056 |
---|---|
author | Yao, Bangpeng Li, Shao |
author_facet | Yao, Bangpeng Li, Shao |
author_sort | Yao, Bangpeng |
collection | PubMed |
description | BACKGROUND: Accurate classification of microarray data is critical for successful clinical diagnosis and treatment. The "curse of dimensionality" problem and noise in the data, however, undermines the performance of many algorithms. METHOD: In order to obtain a robust classifier, a novel Additive Nonparametric Margin Maximum for Case-Based Reasoning (ANMM4CBR) method is proposed in this article. ANMM4CBR employs a case-based reasoning (CBR) method for classification. CBR is a suitable paradigm for microarray analysis, where the rules that define the domain knowledge are difficult to obtain because usually only a small number of training samples are available. Moreover, in order to select the most informative genes, we propose to perform feature selection via additively optimizing a nonparametric margin maximum criterion, which is defined based on gene pre-selection and sample clustering. Our feature selection method is very robust to noise in the data. RESULTS: The effectiveness of our method is demonstrated on both simulated and real data sets. We show that the ANMM4CBR method performs better than some state-of-the-art methods such as support vector machine (SVM) and k nearest neighbor (kNN), especially when the data contains a high level of noise. AVAILABILITY: The source code is attached as an additional file of this paper. |
format | Text |
id | pubmed-2843690 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28436902010-03-23 ANMM4CBR: a case-based reasoning method for gene expression data classification Yao, Bangpeng Li, Shao Algorithms Mol Biol Research BACKGROUND: Accurate classification of microarray data is critical for successful clinical diagnosis and treatment. The "curse of dimensionality" problem and noise in the data, however, undermines the performance of many algorithms. METHOD: In order to obtain a robust classifier, a novel Additive Nonparametric Margin Maximum for Case-Based Reasoning (ANMM4CBR) method is proposed in this article. ANMM4CBR employs a case-based reasoning (CBR) method for classification. CBR is a suitable paradigm for microarray analysis, where the rules that define the domain knowledge are difficult to obtain because usually only a small number of training samples are available. Moreover, in order to select the most informative genes, we propose to perform feature selection via additively optimizing a nonparametric margin maximum criterion, which is defined based on gene pre-selection and sample clustering. Our feature selection method is very robust to noise in the data. RESULTS: The effectiveness of our method is demonstrated on both simulated and real data sets. We show that the ANMM4CBR method performs better than some state-of-the-art methods such as support vector machine (SVM) and k nearest neighbor (kNN), especially when the data contains a high level of noise. AVAILABILITY: The source code is attached as an additional file of this paper. BioMed Central 2010-01-06 /pmc/articles/PMC2843690/ /pubmed/20051140 http://dx.doi.org/10.1186/1748-7188-5-14 Text en Copyright ©2010 Yao and Li; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Yao, Bangpeng Li, Shao ANMM4CBR: a case-based reasoning method for gene expression data classification |
title | ANMM4CBR: a case-based reasoning method for gene expression data classification |
title_full | ANMM4CBR: a case-based reasoning method for gene expression data classification |
title_fullStr | ANMM4CBR: a case-based reasoning method for gene expression data classification |
title_full_unstemmed | ANMM4CBR: a case-based reasoning method for gene expression data classification |
title_short | ANMM4CBR: a case-based reasoning method for gene expression data classification |
title_sort | anmm4cbr: a case-based reasoning method for gene expression data classification |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2843690/ https://www.ncbi.nlm.nih.gov/pubmed/20051140 http://dx.doi.org/10.1186/1748-7188-5-14 |
work_keys_str_mv | AT yaobangpeng anmm4cbracasebasedreasoningmethodforgeneexpressiondataclassification AT lishao anmm4cbracasebasedreasoningmethodforgeneexpressiondataclassification |