Cargando…

Combining Gene Signatures Improves Prediction of Breast Cancer Survival

BACKGROUND: Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specif...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Xi, Rødland, Einar Andreas, Sørlie, Therese, Naume, Bjørn, Langerød, Anita, Frigessi, Arnoldo, Kristensen, Vessela N., Børresen-Dale, Anne-Lise, Lingjærde, Ole Christian
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3053398/
https://www.ncbi.nlm.nih.gov/pubmed/21423775
http://dx.doi.org/10.1371/journal.pone.0017845
_version_ 1782199740079800320
author Zhao, Xi
Rødland, Einar Andreas
Sørlie, Therese
Naume, Bjørn
Langerød, Anita
Frigessi, Arnoldo
Kristensen, Vessela N.
Børresen-Dale, Anne-Lise
Lingjærde, Ole Christian
author_facet Zhao, Xi
Rødland, Einar Andreas
Sørlie, Therese
Naume, Bjørn
Langerød, Anita
Frigessi, Arnoldo
Kristensen, Vessela N.
Børresen-Dale, Anne-Lise
Lingjærde, Ole Christian
author_sort Zhao, Xi
collection PubMed
description BACKGROUND: Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123) and test set (n = 81), respectively. Gene sets from eleven previously published gene signatures are included in the study. PRINCIPAL FINDINGS: To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014). Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001). The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. CONCLUSION: Combining the predictive strength of multiple gene signatures improves prediction of breast cancer survival. The presented methodology is broadly applicable to breast cancer risk assessment using any new identified gene set.
format Text
id pubmed-3053398
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-30533982011-03-18 Combining Gene Signatures Improves Prediction of Breast Cancer Survival Zhao, Xi Rødland, Einar Andreas Sørlie, Therese Naume, Bjørn Langerød, Anita Frigessi, Arnoldo Kristensen, Vessela N. Børresen-Dale, Anne-Lise Lingjærde, Ole Christian PLoS One Research Article BACKGROUND: Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123) and test set (n = 81), respectively. Gene sets from eleven previously published gene signatures are included in the study. PRINCIPAL FINDINGS: To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014). Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001). The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. CONCLUSION: Combining the predictive strength of multiple gene signatures improves prediction of breast cancer survival. The presented methodology is broadly applicable to breast cancer risk assessment using any new identified gene set. Public Library of Science 2011-03-10 /pmc/articles/PMC3053398/ /pubmed/21423775 http://dx.doi.org/10.1371/journal.pone.0017845 Text en Zhao et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Zhao, Xi
Rødland, Einar Andreas
Sørlie, Therese
Naume, Bjørn
Langerød, Anita
Frigessi, Arnoldo
Kristensen, Vessela N.
Børresen-Dale, Anne-Lise
Lingjærde, Ole Christian
Combining Gene Signatures Improves Prediction of Breast Cancer Survival
title Combining Gene Signatures Improves Prediction of Breast Cancer Survival
title_full Combining Gene Signatures Improves Prediction of Breast Cancer Survival
title_fullStr Combining Gene Signatures Improves Prediction of Breast Cancer Survival
title_full_unstemmed Combining Gene Signatures Improves Prediction of Breast Cancer Survival
title_short Combining Gene Signatures Improves Prediction of Breast Cancer Survival
title_sort combining gene signatures improves prediction of breast cancer survival
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3053398/
https://www.ncbi.nlm.nih.gov/pubmed/21423775
http://dx.doi.org/10.1371/journal.pone.0017845
work_keys_str_mv AT zhaoxi combininggenesignaturesimprovespredictionofbreastcancersurvival
AT rødlandeinarandreas combininggenesignaturesimprovespredictionofbreastcancersurvival
AT sørlietherese combininggenesignaturesimprovespredictionofbreastcancersurvival
AT naumebjørn combininggenesignaturesimprovespredictionofbreastcancersurvival
AT langerødanita combininggenesignaturesimprovespredictionofbreastcancersurvival
AT frigessiarnoldo combininggenesignaturesimprovespredictionofbreastcancersurvival
AT kristensenvesselan combininggenesignaturesimprovespredictionofbreastcancersurvival
AT børresendaleannelise combininggenesignaturesimprovespredictionofbreastcancersurvival
AT lingjærdeolechristian combininggenesignaturesimprovespredictionofbreastcancersurvival