Cargando…

Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data

BACKGROUND: Microarray technology is increasingly used to identify potential biomarkers for cancer prognostics and diagnostics. Previously, we have developed the iterative Bayesian Model Averaging (BMA) algorithm for use in classification. Here, we extend the iterative BMA algorithm for application...

Descripción completa

Detalles Bibliográficos
Autores principales: Annest, Amalia, Bumgarner, Roger E, Raftery, Adrian E, Yeung, Ka Yee
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2657791/
https://www.ncbi.nlm.nih.gov/pubmed/19245714
http://dx.doi.org/10.1186/1471-2105-10-72
_version_ 1782165615934439424
author Annest, Amalia
Bumgarner, Roger E
Raftery, Adrian E
Yeung, Ka Yee
author_facet Annest, Amalia
Bumgarner, Roger E
Raftery, Adrian E
Yeung, Ka Yee
author_sort Annest, Amalia
collection PubMed
description BACKGROUND: Microarray technology is increasingly used to identify potential biomarkers for cancer prognostics and diagnostics. Previously, we have developed the iterative Bayesian Model Averaging (BMA) algorithm for use in classification. Here, we extend the iterative BMA algorithm for application to survival analysis on high-dimensional microarray data. The main goal in applying survival analysis to microarray data is to determine a highly predictive model of patients' time to event (such as death, relapse, or metastasis) using a small number of selected genes. Our multivariate procedure combines the effectiveness of multiple contending models by calculating the weighted average of their posterior probability distributions. Our results demonstrate that our iterative BMA algorithm for survival analysis achieves high prediction accuracy while consistently selecting a small and cost-effective number of predictor genes. RESULTS: We applied the iterative BMA algorithm to two cancer datasets: breast cancer and diffuse large B-cell lymphoma (DLBCL) data. On the breast cancer data, the algorithm selected a total of 15 predictor genes across 84 contending models from the training data. The maximum likelihood estimates of the selected genes and the posterior probabilities of the selected models from the training data were used to divide patients in the test (or validation) dataset into high- and low-risk categories. Using the genes and models determined from the training data, we assigned patients from the test data into highly distinct risk groups (as indicated by a p-value of 7.26e-05 from the log-rank test). Moreover, we achieved comparable results using only the 5 top selected genes with 100% posterior probabilities. On the DLBCL data, our iterative BMA procedure selected a total of 25 genes across 3 contending models from the training data. Once again, we assigned the patients in the validation set to significantly distinct risk groups (p-value = 0.00139). CONCLUSION: The strength of the iterative BMA algorithm for survival analysis lies in its ability to account for model uncertainty. The results from this study demonstrate that our procedure selects a small number of genes while eclipsing other methods in predictive performance, making it a highly accurate and cost-effective prognostic tool in the clinical setting.
format Text
id pubmed-2657791
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26577912009-03-19 Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data Annest, Amalia Bumgarner, Roger E Raftery, Adrian E Yeung, Ka Yee BMC Bioinformatics Methodology Article BACKGROUND: Microarray technology is increasingly used to identify potential biomarkers for cancer prognostics and diagnostics. Previously, we have developed the iterative Bayesian Model Averaging (BMA) algorithm for use in classification. Here, we extend the iterative BMA algorithm for application to survival analysis on high-dimensional microarray data. The main goal in applying survival analysis to microarray data is to determine a highly predictive model of patients' time to event (such as death, relapse, or metastasis) using a small number of selected genes. Our multivariate procedure combines the effectiveness of multiple contending models by calculating the weighted average of their posterior probability distributions. Our results demonstrate that our iterative BMA algorithm for survival analysis achieves high prediction accuracy while consistently selecting a small and cost-effective number of predictor genes. RESULTS: We applied the iterative BMA algorithm to two cancer datasets: breast cancer and diffuse large B-cell lymphoma (DLBCL) data. On the breast cancer data, the algorithm selected a total of 15 predictor genes across 84 contending models from the training data. The maximum likelihood estimates of the selected genes and the posterior probabilities of the selected models from the training data were used to divide patients in the test (or validation) dataset into high- and low-risk categories. Using the genes and models determined from the training data, we assigned patients from the test data into highly distinct risk groups (as indicated by a p-value of 7.26e-05 from the log-rank test). Moreover, we achieved comparable results using only the 5 top selected genes with 100% posterior probabilities. On the DLBCL data, our iterative BMA procedure selected a total of 25 genes across 3 contending models from the training data. Once again, we assigned the patients in the validation set to significantly distinct risk groups (p-value = 0.00139). CONCLUSION: The strength of the iterative BMA algorithm for survival analysis lies in its ability to account for model uncertainty. The results from this study demonstrate that our procedure selects a small number of genes while eclipsing other methods in predictive performance, making it a highly accurate and cost-effective prognostic tool in the clinical setting. BioMed Central 2009-02-26 /pmc/articles/PMC2657791/ /pubmed/19245714 http://dx.doi.org/10.1186/1471-2105-10-72 Text en Copyright © 2009 Annest et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Annest, Amalia
Bumgarner, Roger E
Raftery, Adrian E
Yeung, Ka Yee
Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data
title Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data
title_full Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data
title_fullStr Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data
title_full_unstemmed Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data
title_short Iterative Bayesian Model Averaging: a method for the application of survival analysis to high-dimensional microarray data
title_sort iterative bayesian model averaging: a method for the application of survival analysis to high-dimensional microarray data
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2657791/
https://www.ncbi.nlm.nih.gov/pubmed/19245714
http://dx.doi.org/10.1186/1471-2105-10-72
work_keys_str_mv AT annestamalia iterativebayesianmodelaveragingamethodfortheapplicationofsurvivalanalysistohighdimensionalmicroarraydata
AT bumgarnerrogere iterativebayesianmodelaveragingamethodfortheapplicationofsurvivalanalysistohighdimensionalmicroarraydata
AT rafteryadriane iterativebayesianmodelaveragingamethodfortheapplicationofsurvivalanalysistohighdimensionalmicroarraydata
AT yeungkayee iterativebayesianmodelaveragingamethodfortheapplicationofsurvivalanalysistohighdimensionalmicroarraydata