Cargando…

Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data

In the past decade, researchers in oncology have sought to develop survival prediction models using gene expression data. The least absolute shrinkage and selection operator (lasso) has been widely used to select genes that truly correlated with a patient's survival. The lasso selects genes for...

Descripción completa

Detalles Bibliográficos
Autores principales: Kaneko, Shuhei, Hirakawa, Akihiro, Hamada, Chikuma
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi Publishing Corporation 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4469838/
https://www.ncbi.nlm.nih.gov/pubmed/26146513
http://dx.doi.org/10.1155/2015/259474
_version_ 1782376669848272896
author Kaneko, Shuhei
Hirakawa, Akihiro
Hamada, Chikuma
author_facet Kaneko, Shuhei
Hirakawa, Akihiro
Hamada, Chikuma
author_sort Kaneko, Shuhei
collection PubMed
description In the past decade, researchers in oncology have sought to develop survival prediction models using gene expression data. The least absolute shrinkage and selection operator (lasso) has been widely used to select genes that truly correlated with a patient's survival. The lasso selects genes for prediction by shrinking a large number of coefficients of the candidate genes towards zero based on a tuning parameter that is often determined by a cross-validation (CV). However, this method can pass over (or fail to identify) true positive genes (i.e., it identifies false negatives) in certain instances, because the lasso tends to favor the development of a simple prediction model. Here, we attempt to monitor the identification of false negatives by developing a method for estimating the number of true positive (TP) genes for a series of values of a tuning parameter that assumes a mixture distribution for the lasso estimates. Using our developed method, we performed a simulation study to examine its precision in estimating the number of TP genes. Additionally, we applied our method to a real gene expression dataset and found that it was able to identify genes correlated with survival that a CV method was unable to detect.
format Online
Article
Text
id pubmed-4469838
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Hindawi Publishing Corporation
record_format MEDLINE/PubMed
spelling pubmed-44698382015-07-05 Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data Kaneko, Shuhei Hirakawa, Akihiro Hamada, Chikuma Comput Math Methods Med Research Article In the past decade, researchers in oncology have sought to develop survival prediction models using gene expression data. The least absolute shrinkage and selection operator (lasso) has been widely used to select genes that truly correlated with a patient's survival. The lasso selects genes for prediction by shrinking a large number of coefficients of the candidate genes towards zero based on a tuning parameter that is often determined by a cross-validation (CV). However, this method can pass over (or fail to identify) true positive genes (i.e., it identifies false negatives) in certain instances, because the lasso tends to favor the development of a simple prediction model. Here, we attempt to monitor the identification of false negatives by developing a method for estimating the number of true positive (TP) genes for a series of values of a tuning parameter that assumes a mixture distribution for the lasso estimates. Using our developed method, we performed a simulation study to examine its precision in estimating the number of TP genes. Additionally, we applied our method to a real gene expression dataset and found that it was able to identify genes correlated with survival that a CV method was unable to detect. Hindawi Publishing Corporation 2015 2015-06-03 /pmc/articles/PMC4469838/ /pubmed/26146513 http://dx.doi.org/10.1155/2015/259474 Text en Copyright © 2015 Shuhei Kaneko et al. https://creativecommons.org/licenses/by/3.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Kaneko, Shuhei
Hirakawa, Akihiro
Hamada, Chikuma
Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data
title Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data
title_full Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data
title_fullStr Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data
title_full_unstemmed Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data
title_short Enhancing the Lasso Approach for Developing a Survival Prediction Model Based on Gene Expression Data
title_sort enhancing the lasso approach for developing a survival prediction model based on gene expression data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4469838/
https://www.ncbi.nlm.nih.gov/pubmed/26146513
http://dx.doi.org/10.1155/2015/259474
work_keys_str_mv AT kanekoshuhei enhancingthelassoapproachfordevelopingasurvivalpredictionmodelbasedongeneexpressiondata
AT hirakawaakihiro enhancingthelassoapproachfordevelopingasurvivalpredictionmodelbasedongeneexpressiondata
AT hamadachikuma enhancingthelassoapproachfordevelopingasurvivalpredictionmodelbasedongeneexpressiondata