Cargando…
Combining Gene Signatures Improves Prediction of Breast Cancer Survival
BACKGROUND: Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specif...
Autores principales: | , , , , , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2011
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3053398/ https://www.ncbi.nlm.nih.gov/pubmed/21423775 http://dx.doi.org/10.1371/journal.pone.0017845 |
_version_ | 1782199740079800320 |
---|---|
author | Zhao, Xi Rødland, Einar Andreas Sørlie, Therese Naume, Bjørn Langerød, Anita Frigessi, Arnoldo Kristensen, Vessela N. Børresen-Dale, Anne-Lise Lingjærde, Ole Christian |
author_facet | Zhao, Xi Rødland, Einar Andreas Sørlie, Therese Naume, Bjørn Langerød, Anita Frigessi, Arnoldo Kristensen, Vessela N. Børresen-Dale, Anne-Lise Lingjærde, Ole Christian |
author_sort | Zhao, Xi |
collection | PubMed |
description | BACKGROUND: Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123) and test set (n = 81), respectively. Gene sets from eleven previously published gene signatures are included in the study. PRINCIPAL FINDINGS: To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014). Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001). The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. CONCLUSION: Combining the predictive strength of multiple gene signatures improves prediction of breast cancer survival. The presented methodology is broadly applicable to breast cancer risk assessment using any new identified gene set. |
format | Text |
id | pubmed-3053398 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2011 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-30533982011-03-18 Combining Gene Signatures Improves Prediction of Breast Cancer Survival Zhao, Xi Rødland, Einar Andreas Sørlie, Therese Naume, Bjørn Langerød, Anita Frigessi, Arnoldo Kristensen, Vessela N. Børresen-Dale, Anne-Lise Lingjærde, Ole Christian PLoS One Research Article BACKGROUND: Several gene sets for prediction of breast cancer survival have been derived from whole-genome mRNA expression profiles. Here, we develop a statistical framework to explore whether combination of the information from such sets may improve prediction of recurrence and breast cancer specific death in early-stage breast cancers. Microarray data from two clinically similar cohorts of breast cancer patients are used as training (n = 123) and test set (n = 81), respectively. Gene sets from eleven previously published gene signatures are included in the study. PRINCIPAL FINDINGS: To investigate the relationship between breast cancer survival and gene expression on a particular gene set, a Cox proportional hazards model is applied using partial likelihood regression with an L2 penalty to avoid overfitting and using cross-validation to determine the penalty weight. The fitted models are applied to an independent test set to obtain a predicted risk for each individual and each gene set. Hierarchical clustering of the test individuals on the basis of the vector of predicted risks results in two clusters with distinct clinical characteristics in terms of the distribution of molecular subtypes, ER, PR status, TP53 mutation status and histological grade category, and associated with significantly different survival probabilities (recurrence: p = 0.005; breast cancer death: p = 0.014). Finally, principal components analysis of the gene signatures is used to derive combined predictors used to fit a new Cox model. This model classifies test individuals into two risk groups with distinct survival characteristics (recurrence: p = 0.003; breast cancer death: p = 0.001). The latter classifier outperforms all the individual gene signatures, as well as Cox models based on traditional clinical parameters and the Adjuvant! Online for survival prediction. CONCLUSION: Combining the predictive strength of multiple gene signatures improves prediction of breast cancer survival. The presented methodology is broadly applicable to breast cancer risk assessment using any new identified gene set. Public Library of Science 2011-03-10 /pmc/articles/PMC3053398/ /pubmed/21423775 http://dx.doi.org/10.1371/journal.pone.0017845 Text en Zhao et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Zhao, Xi Rødland, Einar Andreas Sørlie, Therese Naume, Bjørn Langerød, Anita Frigessi, Arnoldo Kristensen, Vessela N. Børresen-Dale, Anne-Lise Lingjærde, Ole Christian Combining Gene Signatures Improves Prediction of Breast Cancer Survival |
title | Combining Gene Signatures Improves Prediction of Breast Cancer Survival |
title_full | Combining Gene Signatures Improves Prediction of Breast Cancer Survival |
title_fullStr | Combining Gene Signatures Improves Prediction of Breast Cancer Survival |
title_full_unstemmed | Combining Gene Signatures Improves Prediction of Breast Cancer Survival |
title_short | Combining Gene Signatures Improves Prediction of Breast Cancer Survival |
title_sort | combining gene signatures improves prediction of breast cancer survival |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3053398/ https://www.ncbi.nlm.nih.gov/pubmed/21423775 http://dx.doi.org/10.1371/journal.pone.0017845 |
work_keys_str_mv | AT zhaoxi combininggenesignaturesimprovespredictionofbreastcancersurvival AT rødlandeinarandreas combininggenesignaturesimprovespredictionofbreastcancersurvival AT sørlietherese combininggenesignaturesimprovespredictionofbreastcancersurvival AT naumebjørn combininggenesignaturesimprovespredictionofbreastcancersurvival AT langerødanita combininggenesignaturesimprovespredictionofbreastcancersurvival AT frigessiarnoldo combininggenesignaturesimprovespredictionofbreastcancersurvival AT kristensenvesselan combininggenesignaturesimprovespredictionofbreastcancersurvival AT børresendaleannelise combininggenesignaturesimprovespredictionofbreastcancersurvival AT lingjærdeolechristian combininggenesignaturesimprovespredictionofbreastcancersurvival |