Cargando…

Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context

BACKGROUND: Different microarray studies have compiled gene lists for predicting outcomes of a range of treatments and diseases. These have produced gene lists that have little overlap, indicating that the results from any one study are unstable. It has been suggested that the underlying pathways ar...

Descripción completa

Detalles Bibliográficos
Autores principales: Abraham, Gad, Kowalczyk, Adam, Loi, Sherene, Haviv, Izhak, Zobel, Justin
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2010
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2895626/
https://www.ncbi.nlm.nih.gov/pubmed/20500821
http://dx.doi.org/10.1186/1471-2105-11-277
_version_ 1782183271455522816
author Abraham, Gad
Kowalczyk, Adam
Loi, Sherene
Haviv, Izhak
Zobel, Justin
author_facet Abraham, Gad
Kowalczyk, Adam
Loi, Sherene
Haviv, Izhak
Zobel, Justin
author_sort Abraham, Gad
collection PubMed
description BACKGROUND: Different microarray studies have compiled gene lists for predicting outcomes of a range of treatments and diseases. These have produced gene lists that have little overlap, indicating that the results from any one study are unstable. It has been suggested that the underlying pathways are essentially identical, and that the expression of gene sets, rather than that of individual genes, may be more informative with respect to prognosis and understanding of the underlying biological process. RESULTS: We sought to examine the stability of prognostic signatures based on gene sets rather than individual genes. We classified breast cancer cases from five microarray studies according to the risk of metastasis, using features derived from predefined gene sets. The expression levels of genes in the sets are aggregated, using what we call a set statistic. The resulting prognostic gene sets were as predictive as the lists of individual genes, but displayed more consistent rankings via bootstrap replications within datasets, produced more stable classifiers across different datasets, and are potentially more interpretable in the biological context since they examine gene expression in the context of their neighbouring genes in the pathway. In addition, we performed this analysis in each breast cancer molecular subtype, based on ER/HER2 status. The prognostic gene sets found in each subtype were consistent with the biology based on previous analysis of individual genes. CONCLUSIONS: To date, most analyses of gene expression data have focused at the level of the individual genes. We show that a complementary approach of examining the data using predefined gene sets can reduce the noise and could provide increased insight into the underlying biological pathways.
format Text
id pubmed-2895626
institution National Center for Biotechnology Information
language English
publishDate 2010
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-28956262010-07-02 Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context Abraham, Gad Kowalczyk, Adam Loi, Sherene Haviv, Izhak Zobel, Justin BMC Bioinformatics Research article BACKGROUND: Different microarray studies have compiled gene lists for predicting outcomes of a range of treatments and diseases. These have produced gene lists that have little overlap, indicating that the results from any one study are unstable. It has been suggested that the underlying pathways are essentially identical, and that the expression of gene sets, rather than that of individual genes, may be more informative with respect to prognosis and understanding of the underlying biological process. RESULTS: We sought to examine the stability of prognostic signatures based on gene sets rather than individual genes. We classified breast cancer cases from five microarray studies according to the risk of metastasis, using features derived from predefined gene sets. The expression levels of genes in the sets are aggregated, using what we call a set statistic. The resulting prognostic gene sets were as predictive as the lists of individual genes, but displayed more consistent rankings via bootstrap replications within datasets, produced more stable classifiers across different datasets, and are potentially more interpretable in the biological context since they examine gene expression in the context of their neighbouring genes in the pathway. In addition, we performed this analysis in each breast cancer molecular subtype, based on ER/HER2 status. The prognostic gene sets found in each subtype were consistent with the biology based on previous analysis of individual genes. CONCLUSIONS: To date, most analyses of gene expression data have focused at the level of the individual genes. We show that a complementary approach of examining the data using predefined gene sets can reduce the noise and could provide increased insight into the underlying biological pathways. BioMed Central 2010-05-25 /pmc/articles/PMC2895626/ /pubmed/20500821 http://dx.doi.org/10.1186/1471-2105-11-277 Text en Copyright ©2010 Abraham et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research article
Abraham, Gad
Kowalczyk, Adam
Loi, Sherene
Haviv, Izhak
Zobel, Justin
Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context
title Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context
title_full Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context
title_fullStr Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context
title_full_unstemmed Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context
title_short Prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context
title_sort prediction of breast cancer prognosis using gene set statistics provides signature stability and biological context
topic Research article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2895626/
https://www.ncbi.nlm.nih.gov/pubmed/20500821
http://dx.doi.org/10.1186/1471-2105-11-277
work_keys_str_mv AT abrahamgad predictionofbreastcancerprognosisusinggenesetstatisticsprovidessignaturestabilityandbiologicalcontext
AT kowalczykadam predictionofbreastcancerprognosisusinggenesetstatisticsprovidessignaturestabilityandbiologicalcontext
AT loisherene predictionofbreastcancerprognosisusinggenesetstatisticsprovidessignaturestabilityandbiologicalcontext
AT havivizhak predictionofbreastcancerprognosisusinggenesetstatisticsprovidessignaturestabilityandbiologicalcontext
AT zobeljustin predictionofbreastcancerprognosisusinggenesetstatisticsprovidessignaturestabilityandbiologicalcontext