Cargando…

Survival prediction from clinico-genomic models - a comparative study

BACKGROUND: Survival prediction from high-dimensional genomic data is an active field in today's medical research. Most of the proposed prediction methods make use of genomic data alone without considering established clinical covariates that often are available and known to have predictive val...

Descripción completa

Detalles Bibliográficos
Autores principales: Bøvelstad, Hege M, Nygård, Ståle, Borgan, Ørnulf
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2811121/
https://www.ncbi.nlm.nih.gov/pubmed/20003386
http://dx.doi.org/10.1186/1471-2105-10-413
_version_ 1782176730696384512
author Bøvelstad, Hege M
Nygård, Ståle
Borgan, Ørnulf
author_facet Bøvelstad, Hege M
Nygård, Ståle
Borgan, Ørnulf
author_sort Bøvelstad, Hege M
collection PubMed
description BACKGROUND: Survival prediction from high-dimensional genomic data is an active field in today's medical research. Most of the proposed prediction methods make use of genomic data alone without considering established clinical covariates that often are available and known to have predictive value. Recent studies suggest that combining clinical and genomic information may improve predictions, but there is a lack of systematic studies on the topic. Also, for the widely used Cox regression model, it is not obvious how to handle such combined models. RESULTS: We propose a way to combine classical clinical covariates with genomic data in a clinico-genomic prediction model based on the Cox regression model. The prediction model is obtained by a simultaneous use of both types of covariates, but applying dimension reduction only to the high-dimensional genomic variables. We describe how this can be done for seven well-known prediction methods: variable selection, unsupervised and supervised principal components regression and partial least squares regression, ridge regression, and the lasso. We further perform a systematic comparison of the performance of prediction models using clinical covariates only, genomic data only, or a combination of the two. The comparison is done using three survival data sets containing both clinical information and microarray gene expression data. Matlab code for the clinico-genomic prediction methods is available at http://www.med.uio.no/imb/stat/bmms/software/clinico-genomic/. CONCLUSIONS: Based on our three data sets, the comparison shows that established clinical covariates will often lead to better predictions than what can be obtained from genomic data alone. In the cases where the genomic models are better than the clinical, ridge regression is used for dimension reduction. We also find that the clinico-genomic models tend to outperform the models based on only genomic data. Further, clinico-genomic models and the use of ridge regression gives for all three data sets better predictions than models based on the clinical covariates alone.
format Text
id pubmed-2811121
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28111212010-01-26 Survival prediction from clinico-genomic models - a comparative study Bøvelstad, Hege M Nygård, Ståle Borgan, Ørnulf BMC Bioinformatics Research article BACKGROUND: Survival prediction from high-dimensional genomic data is an active field in today's medical research. Most of the proposed prediction methods make use of genomic data alone without considering established clinical covariates that often are available and known to have predictive value. Recent studies suggest that combining clinical and genomic information may improve predictions, but there is a lack of systematic studies on the topic. Also, for the widely used Cox regression model, it is not obvious how to handle such combined models. RESULTS: We propose a way to combine classical clinical covariates with genomic data in a clinico-genomic prediction model based on the Cox regression model. The prediction model is obtained by a simultaneous use of both types of covariates, but applying dimension reduction only to the high-dimensional genomic variables. We describe how this can be done for seven well-known prediction methods: variable selection, unsupervised and supervised principal components regression and partial least squares regression, ridge regression, and the lasso. We further perform a systematic comparison of the performance of prediction models using clinical covariates only, genomic data only, or a combination of the two. The comparison is done using three survival data sets containing both clinical information and microarray gene expression data. Matlab code for the clinico-genomic prediction methods is available at http://www.med.uio.no/imb/stat/bmms/software/clinico-genomic/. CONCLUSIONS: Based on our three data sets, the comparison shows that established clinical covariates will often lead to better predictions than what can be obtained from genomic data alone. In the cases where the genomic models are better than the clinical, ridge regression is used for dimension reduction. We also find that the clinico-genomic models tend to outperform the models based on only genomic data. Further, clinico-genomic models and the use of ridge regression gives for all three data sets better predictions than models based on the clinical covariates alone. BioMed Central 2009-12-13 /pmc/articles/PMC2811121/ /pubmed/20003386 http://dx.doi.org/10.1186/1471-2105-10-413 Text en Copyright ©2009 Bøvelstad et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Bøvelstad, Hege M
Nygård, Ståle
Borgan, Ørnulf
Survival prediction from clinico-genomic models - a comparative study
title Survival prediction from clinico-genomic models - a comparative study
title_full Survival prediction from clinico-genomic models - a comparative study
title_fullStr Survival prediction from clinico-genomic models - a comparative study
title_full_unstemmed Survival prediction from clinico-genomic models - a comparative study
title_short Survival prediction from clinico-genomic models - a comparative study
title_sort survival prediction from clinico-genomic models - a comparative study
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2811121/
https://www.ncbi.nlm.nih.gov/pubmed/20003386
http://dx.doi.org/10.1186/1471-2105-10-413
work_keys_str_mv AT bøvelstadhegem survivalpredictionfromclinicogenomicmodelsacomparativestudy
AT nygardstale survivalpredictionfromclinicogenomicmodelsacomparativestudy
AT borganørnulf survivalpredictionfromclinicogenomicmodelsacomparativestudy