Cargando…

Sparse Representation for Classification of Tumors Using Gene Expression Data

Personalized drug design requires the classification of cancer patients as accurate as possible. With advances in genome sequencing and microarray technology, a large amount of gene expression data has been and will continuously be produced from various cancerous patients. Such cancer-alerted gene e...

Descripción completa

Detalles Bibliográficos
Autores principales: Hang, Xiyi, Wu, Fang-Xiang
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2655631/
https://www.ncbi.nlm.nih.gov/pubmed/19300522
http://dx.doi.org/10.1155/2009/403689
_version_ 1782165452554764288
author Hang, Xiyi
Wu, Fang-Xiang
author_facet Hang, Xiyi
Wu, Fang-Xiang
author_sort Hang, Xiyi
collection PubMed
description Personalized drug design requires the classification of cancer patients as accurate as possible. With advances in genome sequencing and microarray technology, a large amount of gene expression data has been and will continuously be produced from various cancerous patients. Such cancer-alerted gene expression data allows us to classify tumors at the genomewide level. However, cancer-alerted gene expression datasets typically have much more number of genes (features) than that of samples (patients), which imposes a challenge for classification of tumors. In this paper, a new method is proposed for cancer diagnosis using gene expression data by casting the classification problem as finding sparse representations of test samples with respect to training samples. The sparse representation is computed by the l(1)-regularized least square method. To investigate its performance, the proposed method is applied to six tumor gene expression datasets and compared with various support vector machine (SVM) methods. The experimental results have shown that the performance of the proposed method is comparable with or better than those of SVMs. In addition, the proposed method is more efficient than SVMs as it has no need of model selection.
format Text
id pubmed-2655631
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-26556312009-03-19 Sparse Representation for Classification of Tumors Using Gene Expression Data Hang, Xiyi Wu, Fang-Xiang J Biomed Biotechnol Research Article Personalized drug design requires the classification of cancer patients as accurate as possible. With advances in genome sequencing and microarray technology, a large amount of gene expression data has been and will continuously be produced from various cancerous patients. Such cancer-alerted gene expression data allows us to classify tumors at the genomewide level. However, cancer-alerted gene expression datasets typically have much more number of genes (features) than that of samples (patients), which imposes a challenge for classification of tumors. In this paper, a new method is proposed for cancer diagnosis using gene expression data by casting the classification problem as finding sparse representations of test samples with respect to training samples. The sparse representation is computed by the l(1)-regularized least square method. To investigate its performance, the proposed method is applied to six tumor gene expression datasets and compared with various support vector machine (SVM) methods. The experimental results have shown that the performance of the proposed method is comparable with or better than those of SVMs. In addition, the proposed method is more efficient than SVMs as it has no need of model selection. Hindawi Publishing Corporation 2009 2009-03-15 /pmc/articles/PMC2655631/ /pubmed/19300522 http://dx.doi.org/10.1155/2009/403689 Text en Copyright © 2009 X. Hang and F.-X. Wu. This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hang, Xiyi
Wu, Fang-Xiang
Sparse Representation for Classification of Tumors Using Gene Expression Data
title Sparse Representation for Classification of Tumors Using Gene Expression Data
title_full Sparse Representation for Classification of Tumors Using Gene Expression Data
title_fullStr Sparse Representation for Classification of Tumors Using Gene Expression Data
title_full_unstemmed Sparse Representation for Classification of Tumors Using Gene Expression Data
title_short Sparse Representation for Classification of Tumors Using Gene Expression Data
title_sort sparse representation for classification of tumors using gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2655631/
https://www.ncbi.nlm.nih.gov/pubmed/19300522
http://dx.doi.org/10.1155/2009/403689
work_keys_str_mv AT hangxiyi sparserepresentationforclassificationoftumorsusinggeneexpressiondata
AT wufangxiang sparserepresentationforclassificationoftumorsusinggeneexpressiondata