Cargando…

Classification and Selection of Biomarkers in Genomic Data Using LASSO

High-throughput gene expression technologies such as microarrays have been utilized in a variety of scientific applications. Most of the work has been done on assessing univariate associations between gene expression profiles with clinical outcome (variable selection) or on developing classification...

Descripción completa

Detalles Bibliográficos
Autores principales: Ghosh, Debashis, Chinnaiyan, Arul M.
Formato: Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184048/
https://www.ncbi.nlm.nih.gov/pubmed/16046820
http://dx.doi.org/10.1155/JBB.2005.147
_version_ 1782124709600559104
author Ghosh, Debashis
Chinnaiyan, Arul M.
author_facet Ghosh, Debashis
Chinnaiyan, Arul M.
author_sort Ghosh, Debashis
collection PubMed
description High-throughput gene expression technologies such as microarrays have been utilized in a variety of scientific applications. Most of the work has been done on assessing univariate associations between gene expression profiles with clinical outcome (variable selection) or on developing classification procedures with gene expression data (supervised learning). We consider a hybrid variable selection/classification approach that is based on linear combinations of the gene expression profiles that maximize an accuracy measure summarized using the receiver operating characteristic curve. Under a specific probability model, this leads to the consideration of linear discriminant functions. We incorporate an automated variable selection approach using LASSO. An equivalence between LASSO estimation with support vector machines allows for model fitting using standard software. We apply the proposed method to simulated data as well as data from a recently published prostate cancer study.
format Text
id pubmed-1184048
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-11840482005-09-07 Classification and Selection of Biomarkers in Genomic Data Using LASSO Ghosh, Debashis Chinnaiyan, Arul M. J Biomed Biotechnol Research Article High-throughput gene expression technologies such as microarrays have been utilized in a variety of scientific applications. Most of the work has been done on assessing univariate associations between gene expression profiles with clinical outcome (variable selection) or on developing classification procedures with gene expression data (supervised learning). We consider a hybrid variable selection/classification approach that is based on linear combinations of the gene expression profiles that maximize an accuracy measure summarized using the receiver operating characteristic curve. Under a specific probability model, this leads to the consideration of linear discriminant functions. We incorporate an automated variable selection approach using LASSO. An equivalence between LASSO estimation with support vector machines allows for model fitting using standard software. We apply the proposed method to simulated data as well as data from a recently published prostate cancer study. Hindawi Publishing Corporation 2005 /pmc/articles/PMC1184048/ /pubmed/16046820 http://dx.doi.org/10.1155/JBB.2005.147 Text en Hindawi Publishing Corporation
spellingShingle Research Article
Ghosh, Debashis
Chinnaiyan, Arul M.
Classification and Selection of Biomarkers in Genomic Data Using LASSO
title Classification and Selection of Biomarkers in Genomic Data Using LASSO
title_full Classification and Selection of Biomarkers in Genomic Data Using LASSO
title_fullStr Classification and Selection of Biomarkers in Genomic Data Using LASSO
title_full_unstemmed Classification and Selection of Biomarkers in Genomic Data Using LASSO
title_short Classification and Selection of Biomarkers in Genomic Data Using LASSO
title_sort classification and selection of biomarkers in genomic data using lasso
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1184048/
https://www.ncbi.nlm.nih.gov/pubmed/16046820
http://dx.doi.org/10.1155/JBB.2005.147
work_keys_str_mv AT ghoshdebashis classificationandselectionofbiomarkersingenomicdatausinglasso
AT chinnaiyanarulm classificationandselectionofbiomarkersingenomicdatausinglasso