Cargando…

Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies

Mining of gene expression data to identify genes associated with patient survival is an ongoing problem in cancer prognostic studies using microarrays in order to use such genes to achieve more accurate prognoses. The least absolute shrinkage and selection operator (lasso) is often used for gene sel...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaneko, Shuhei, Hirakawa, Akihiro, Hamada, Chikuma
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Libertas Academica 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3298378/
https://www.ncbi.nlm.nih.gov/pubmed/22442625
http://dx.doi.org/10.4137/CIN.S9048
_version_ 1782225990521454592
author Kaneko, Shuhei
Hirakawa, Akihiro
Hamada, Chikuma
author_facet Kaneko, Shuhei
Hirakawa, Akihiro
Hamada, Chikuma
author_sort Kaneko, Shuhei
collection PubMed
description Mining of gene expression data to identify genes associated with patient survival is an ongoing problem in cancer prognostic studies using microarrays in order to use such genes to achieve more accurate prognoses. The least absolute shrinkage and selection operator (lasso) is often used for gene selection and parameter estimation in high-dimensional microarray data. The lasso shrinks some of the coefficients to zero, and the amount of shrinkage is determined by the tuning parameter, often determined by cross validation. The model determined by this cross validation contains many false positives whose coefficients are actually zero. We propose a method for estimating the false positive rate (FPR) for lasso estimates in a high-dimensional Cox model. We performed a simulation study to examine the precision of the FPR estimate by the proposed method. We applied the proposed method to real data and illustrated the identification of false positive genes.
format Online
Article
Text
id pubmed-3298378
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-32983782012-03-22 Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies Kaneko, Shuhei Hirakawa, Akihiro Hamada, Chikuma Cancer Inform Original Research Mining of gene expression data to identify genes associated with patient survival is an ongoing problem in cancer prognostic studies using microarrays in order to use such genes to achieve more accurate prognoses. The least absolute shrinkage and selection operator (lasso) is often used for gene selection and parameter estimation in high-dimensional microarray data. The lasso shrinks some of the coefficients to zero, and the amount of shrinkage is determined by the tuning parameter, often determined by cross validation. The model determined by this cross validation contains many false positives whose coefficients are actually zero. We propose a method for estimating the false positive rate (FPR) for lasso estimates in a high-dimensional Cox model. We performed a simulation study to examine the precision of the FPR estimate by the proposed method. We applied the proposed method to real data and illustrated the identification of false positive genes. Libertas Academica 2012-02-27 /pmc/articles/PMC3298378/ /pubmed/22442625 http://dx.doi.org/10.4137/CIN.S9048 Text en © 2012 the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Original Research
Kaneko, Shuhei
Hirakawa, Akihiro
Hamada, Chikuma
Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies
title Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies
title_full Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies
title_fullStr Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies
title_full_unstemmed Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies
title_short Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies
title_sort gene selection using a high-dimensional regression model with microarrays in cancer prognostic studies
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3298378/
https://www.ncbi.nlm.nih.gov/pubmed/22442625
http://dx.doi.org/10.4137/CIN.S9048
work_keys_str_mv AT kanekoshuhei geneselectionusingahighdimensionalregressionmodelwithmicroarraysincancerprognosticstudies
AT hirakawaakihiro geneselectionusingahighdimensionalregressionmodelwithmicroarraysincancerprognosticstudies
AT hamadachikuma geneselectionusingahighdimensionalregressionmodelwithmicroarraysincancerprognosticstudies