Cargando…
Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies
Mining of gene expression data to identify genes associated with patient survival is an ongoing problem in cancer prognostic studies using microarrays in order to use such genes to achieve more accurate prognoses. The least absolute shrinkage and selection operator (lasso) is often used for gene sel...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Libertas Academica
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3298378/ https://www.ncbi.nlm.nih.gov/pubmed/22442625 http://dx.doi.org/10.4137/CIN.S9048 |
_version_ | 1782225990521454592 |
---|---|
author | Kaneko, Shuhei Hirakawa, Akihiro Hamada, Chikuma |
author_facet | Kaneko, Shuhei Hirakawa, Akihiro Hamada, Chikuma |
author_sort | Kaneko, Shuhei |
collection | PubMed |
description | Mining of gene expression data to identify genes associated with patient survival is an ongoing problem in cancer prognostic studies using microarrays in order to use such genes to achieve more accurate prognoses. The least absolute shrinkage and selection operator (lasso) is often used for gene selection and parameter estimation in high-dimensional microarray data. The lasso shrinks some of the coefficients to zero, and the amount of shrinkage is determined by the tuning parameter, often determined by cross validation. The model determined by this cross validation contains many false positives whose coefficients are actually zero. We propose a method for estimating the false positive rate (FPR) for lasso estimates in a high-dimensional Cox model. We performed a simulation study to examine the precision of the FPR estimate by the proposed method. We applied the proposed method to real data and illustrated the identification of false positive genes. |
format | Online Article Text |
id | pubmed-3298378 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Libertas Academica |
record_format | MEDLINE/PubMed |
spelling | pubmed-32983782012-03-22 Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies Kaneko, Shuhei Hirakawa, Akihiro Hamada, Chikuma Cancer Inform Original Research Mining of gene expression data to identify genes associated with patient survival is an ongoing problem in cancer prognostic studies using microarrays in order to use such genes to achieve more accurate prognoses. The least absolute shrinkage and selection operator (lasso) is often used for gene selection and parameter estimation in high-dimensional microarray data. The lasso shrinks some of the coefficients to zero, and the amount of shrinkage is determined by the tuning parameter, often determined by cross validation. The model determined by this cross validation contains many false positives whose coefficients are actually zero. We propose a method for estimating the false positive rate (FPR) for lasso estimates in a high-dimensional Cox model. We performed a simulation study to examine the precision of the FPR estimate by the proposed method. We applied the proposed method to real data and illustrated the identification of false positive genes. Libertas Academica 2012-02-27 /pmc/articles/PMC3298378/ /pubmed/22442625 http://dx.doi.org/10.4137/CIN.S9048 Text en © 2012 the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited. |
spellingShingle | Original Research Kaneko, Shuhei Hirakawa, Akihiro Hamada, Chikuma Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies |
title | Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies |
title_full | Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies |
title_fullStr | Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies |
title_full_unstemmed | Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies |
title_short | Gene Selection using a High-Dimensional Regression Model with Microarrays in Cancer Prognostic Studies |
title_sort | gene selection using a high-dimensional regression model with microarrays in cancer prognostic studies |
topic | Original Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3298378/ https://www.ncbi.nlm.nih.gov/pubmed/22442625 http://dx.doi.org/10.4137/CIN.S9048 |
work_keys_str_mv | AT kanekoshuhei geneselectionusingahighdimensionalregressionmodelwithmicroarraysincancerprognosticstudies AT hirakawaakihiro geneselectionusingahighdimensionalregressionmodelwithmicroarraysincancerprognosticstudies AT hamadachikuma geneselectionusingahighdimensionalregressionmodelwithmicroarraysincancerprognosticstudies |