Cargando…
Kernel-based distance metric learning for microarray data classification
BACKGROUND: The most fundamental task using gene expression data in clinical oncology is to classify tissue samples according to their gene expression levels. Compared with traditional pattern classifications, gene expression-based data classification is typically characterized by high dimensionalit...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2006
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1513256/ https://www.ncbi.nlm.nih.gov/pubmed/16774678 http://dx.doi.org/10.1186/1471-2105-7-299 |
_version_ | 1782128469458550784 |
---|---|
author | Xiong, Huilin Chen, Xue-wen |
author_facet | Xiong, Huilin Chen, Xue-wen |
author_sort | Xiong, Huilin |
collection | PubMed |
description | BACKGROUND: The most fundamental task using gene expression data in clinical oncology is to classify tissue samples according to their gene expression levels. Compared with traditional pattern classifications, gene expression-based data classification is typically characterized by high dimensionality and small sample size, which make the task quite challenging. RESULTS: In this paper, we present a modified K-nearest-neighbor (KNN) scheme, which is based on learning an adaptive distance metric in the data space, for cancer classification using microarray data. The distance metric, derived from the procedure of a data-dependent kernel optimization, can substantially increase the class separability of the data and, consequently, lead to a significant improvement in the performance of the KNN classifier. Intensive experiments show that the performance of the proposed kernel-based KNN scheme is competitive to those of some sophisticated classifiers such as support vector machines (SVMs) and the uncorrelated linear discriminant analysis (ULDA) in classifying the gene expression data. CONCLUSION: A novel distance metric is developed and incorporated into the KNN scheme for cancer classification. This metric can substantially increase the class separability of the data in the feature space and, hence, lead to a significant improvement in the performance of the KNN classifier. |
format | Text |
id | pubmed-1513256 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2006 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-15132562006-07-20 Kernel-based distance metric learning for microarray data classification Xiong, Huilin Chen, Xue-wen BMC Bioinformatics Research Article BACKGROUND: The most fundamental task using gene expression data in clinical oncology is to classify tissue samples according to their gene expression levels. Compared with traditional pattern classifications, gene expression-based data classification is typically characterized by high dimensionality and small sample size, which make the task quite challenging. RESULTS: In this paper, we present a modified K-nearest-neighbor (KNN) scheme, which is based on learning an adaptive distance metric in the data space, for cancer classification using microarray data. The distance metric, derived from the procedure of a data-dependent kernel optimization, can substantially increase the class separability of the data and, consequently, lead to a significant improvement in the performance of the KNN classifier. Intensive experiments show that the performance of the proposed kernel-based KNN scheme is competitive to those of some sophisticated classifiers such as support vector machines (SVMs) and the uncorrelated linear discriminant analysis (ULDA) in classifying the gene expression data. CONCLUSION: A novel distance metric is developed and incorporated into the KNN scheme for cancer classification. This metric can substantially increase the class separability of the data in the feature space and, hence, lead to a significant improvement in the performance of the KNN classifier. BioMed Central 2006-06-14 /pmc/articles/PMC1513256/ /pubmed/16774678 http://dx.doi.org/10.1186/1471-2105-7-299 Text en Copyright © 2006 Xiong and Chen; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Xiong, Huilin Chen, Xue-wen Kernel-based distance metric learning for microarray data classification |
title | Kernel-based distance metric learning for microarray data classification |
title_full | Kernel-based distance metric learning for microarray data classification |
title_fullStr | Kernel-based distance metric learning for microarray data classification |
title_full_unstemmed | Kernel-based distance metric learning for microarray data classification |
title_short | Kernel-based distance metric learning for microarray data classification |
title_sort | kernel-based distance metric learning for microarray data classification |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1513256/ https://www.ncbi.nlm.nih.gov/pubmed/16774678 http://dx.doi.org/10.1186/1471-2105-7-299 |
work_keys_str_mv | AT xionghuilin kernelbaseddistancemetriclearningformicroarraydataclassification AT chenxuewen kernelbaseddistancemetriclearningformicroarraydataclassification |