Cargando…

Development and Validation of Predictive Indices for a Continuous Outcome Using Gene Expression Profiles

There have been relatively few publications using linear regression models to predict a continuous response based on microarray expression profiles. Standard linear regression methods are problematic when the number of predictor variables exceeds the number of cases. We have evaluated three linear r...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yingdong, Simon, Richard
Formato: Texto
Lenguaje:English
Publicado: Libertas Academica 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879606/
https://www.ncbi.nlm.nih.gov/pubmed/20523915
_version_ 1782181950369300480
author Zhao, Yingdong
Simon, Richard
author_facet Zhao, Yingdong
Simon, Richard
author_sort Zhao, Yingdong
collection PubMed
description There have been relatively few publications using linear regression models to predict a continuous response based on microarray expression profiles. Standard linear regression methods are problematic when the number of predictor variables exceeds the number of cases. We have evaluated three linear regression algorithms that can be used for the prediction of a continuous response based on high dimensional gene expression data. The three algorithms are the least angle regression (LAR), the least absolute shrinkage and selection operator (LASSO), and the averaged linear regression method (ALM). All methods are tested using simulations based on a real gene expression dataset and analyses of two sets of real gene expression data and using an unbiased complete cross validation approach. Our results show that the LASSO algorithm often provides a model with somewhat lower prediction error than the LAR method, but both of them perform more efficiently than the ALM predictor. We have developed a plug-in for BRB-ArrayTools that implements the LAR and the LASSO algorithms with complete cross-validation.
format Text
id pubmed-2879606
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher Libertas Academica
record_format MEDLINE/PubMed
spelling pubmed-28796062010-06-03 Development and Validation of Predictive Indices for a Continuous Outcome Using Gene Expression Profiles Zhao, Yingdong Simon, Richard Cancer Inform Original Research There have been relatively few publications using linear regression models to predict a continuous response based on microarray expression profiles. Standard linear regression methods are problematic when the number of predictor variables exceeds the number of cases. We have evaluated three linear regression algorithms that can be used for the prediction of a continuous response based on high dimensional gene expression data. The three algorithms are the least angle regression (LAR), the least absolute shrinkage and selection operator (LASSO), and the averaged linear regression method (ALM). All methods are tested using simulations based on a real gene expression dataset and analyses of two sets of real gene expression data and using an unbiased complete cross validation approach. Our results show that the LASSO algorithm often provides a model with somewhat lower prediction error than the LAR method, but both of them perform more efficiently than the ALM predictor. We have developed a plug-in for BRB-ArrayTools that implements the LAR and the LASSO algorithms with complete cross-validation. Libertas Academica 2010-05-07 /pmc/articles/PMC2879606/ /pubmed/20523915 Text en © 2010 the author(s), publisher and licensee Libertas Academica Ltd. This is an open access article. Unrestricted non-commercial use is permitted provided the original work is properly cited.
spellingShingle Original Research
Zhao, Yingdong
Simon, Richard
Development and Validation of Predictive Indices for a Continuous Outcome Using Gene Expression Profiles
title Development and Validation of Predictive Indices for a Continuous Outcome Using Gene Expression Profiles
title_full Development and Validation of Predictive Indices for a Continuous Outcome Using Gene Expression Profiles
title_fullStr Development and Validation of Predictive Indices for a Continuous Outcome Using Gene Expression Profiles
title_full_unstemmed Development and Validation of Predictive Indices for a Continuous Outcome Using Gene Expression Profiles
title_short Development and Validation of Predictive Indices for a Continuous Outcome Using Gene Expression Profiles
title_sort development and validation of predictive indices for a continuous outcome using gene expression profiles
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2879606/
https://www.ncbi.nlm.nih.gov/pubmed/20523915
work_keys_str_mv AT zhaoyingdong developmentandvalidationofpredictiveindicesforacontinuousoutcomeusinggeneexpressionprofiles
AT simonrichard developmentandvalidationofpredictiveindicesforacontinuousoutcomeusinggeneexpressionprofiles