Cargando…

Kernel-based distance metric learning for microarray data classification

BACKGROUND: The most fundamental task using gene expression data in clinical oncology is to classify tissue samples according to their gene expression levels. Compared with traditional pattern classifications, gene expression-based data classification is typically characterized by high dimensionalit...

Descripción completa

Detalles Bibliográficos
Autores principales: Xiong, Huilin, Chen, Xue-wen
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2006
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1513256/
https://www.ncbi.nlm.nih.gov/pubmed/16774678
http://dx.doi.org/10.1186/1471-2105-7-299
_version_ 1782128469458550784
author Xiong, Huilin
Chen, Xue-wen
author_facet Xiong, Huilin
Chen, Xue-wen
author_sort Xiong, Huilin
collection PubMed
description BACKGROUND: The most fundamental task using gene expression data in clinical oncology is to classify tissue samples according to their gene expression levels. Compared with traditional pattern classifications, gene expression-based data classification is typically characterized by high dimensionality and small sample size, which make the task quite challenging. RESULTS: In this paper, we present a modified K-nearest-neighbor (KNN) scheme, which is based on learning an adaptive distance metric in the data space, for cancer classification using microarray data. The distance metric, derived from the procedure of a data-dependent kernel optimization, can substantially increase the class separability of the data and, consequently, lead to a significant improvement in the performance of the KNN classifier. Intensive experiments show that the performance of the proposed kernel-based KNN scheme is competitive to those of some sophisticated classifiers such as support vector machines (SVMs) and the uncorrelated linear discriminant analysis (ULDA) in classifying the gene expression data. CONCLUSION: A novel distance metric is developed and incorporated into the KNN scheme for cancer classification. This metric can substantially increase the class separability of the data in the feature space and, hence, lead to a significant improvement in the performance of the KNN classifier.
format Text
id pubmed-1513256
institution National Center for Biotechnology Information
language English
publishDate 2006
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-15132562006-07-20 Kernel-based distance metric learning for microarray data classification Xiong, Huilin Chen, Xue-wen BMC Bioinformatics Research Article BACKGROUND: The most fundamental task using gene expression data in clinical oncology is to classify tissue samples according to their gene expression levels. Compared with traditional pattern classifications, gene expression-based data classification is typically characterized by high dimensionality and small sample size, which make the task quite challenging. RESULTS: In this paper, we present a modified K-nearest-neighbor (KNN) scheme, which is based on learning an adaptive distance metric in the data space, for cancer classification using microarray data. The distance metric, derived from the procedure of a data-dependent kernel optimization, can substantially increase the class separability of the data and, consequently, lead to a significant improvement in the performance of the KNN classifier. Intensive experiments show that the performance of the proposed kernel-based KNN scheme is competitive to those of some sophisticated classifiers such as support vector machines (SVMs) and the uncorrelated linear discriminant analysis (ULDA) in classifying the gene expression data. CONCLUSION: A novel distance metric is developed and incorporated into the KNN scheme for cancer classification. This metric can substantially increase the class separability of the data in the feature space and, hence, lead to a significant improvement in the performance of the KNN classifier. BioMed Central 2006-06-14 /pmc/articles/PMC1513256/ /pubmed/16774678 http://dx.doi.org/10.1186/1471-2105-7-299 Text en Copyright © 2006 Xiong and Chen; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xiong, Huilin
Chen, Xue-wen
Kernel-based distance metric learning for microarray data classification
title Kernel-based distance metric learning for microarray data classification
title_full Kernel-based distance metric learning for microarray data classification
title_fullStr Kernel-based distance metric learning for microarray data classification
title_full_unstemmed Kernel-based distance metric learning for microarray data classification
title_short Kernel-based distance metric learning for microarray data classification
title_sort kernel-based distance metric learning for microarray data classification
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1513256/
https://www.ncbi.nlm.nih.gov/pubmed/16774678
http://dx.doi.org/10.1186/1471-2105-7-299
work_keys_str_mv AT xionghuilin kernelbaseddistancemetriclearningformicroarraydataclassification
AT chenxuewen kernelbaseddistancemetriclearningformicroarraydataclassification