Cargando…
Integrating multi-platform genomic data using hierarchical Bayesian relevance vector machines
BACKGROUND: Recent advances in genome technologies and the subsequent collection of genomic information at various molecular resolutions hold promise to accelerate the discovery of new therapeutic targets. A critical step in achieving these goals is to develop efficient clinical prediction models th...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3726335/ https://www.ncbi.nlm.nih.gov/pubmed/23809014 http://dx.doi.org/10.1186/1687-4153-2013-9 |
_version_ | 1782278619531313152 |
---|---|
author | Srivastava, Sanvesh Wang, Wenyi Manyam, Ganiraju Ordonez, Carlos Baladandayuthapani, Veerabhadran |
author_facet | Srivastava, Sanvesh Wang, Wenyi Manyam, Ganiraju Ordonez, Carlos Baladandayuthapani, Veerabhadran |
author_sort | Srivastava, Sanvesh |
collection | PubMed |
description | BACKGROUND: Recent advances in genome technologies and the subsequent collection of genomic information at various molecular resolutions hold promise to accelerate the discovery of new therapeutic targets. A critical step in achieving these goals is to develop efficient clinical prediction models that integrate these diverse sources of high-throughput data. This step is challenging due to the presence of high-dimensionality and complex interactions in the data. For predicting relevant clinical outcomes, we propose a flexible statistical machine learning approach that acknowledges and models the interaction between platform-specific measurements through nonlinear kernel machines and borrows information within and between platforms through a hierarchical Bayesian framework. Our model has parameters with direct interpretations in terms of the effects of platforms and data interactions within and across platforms. The parameter estimation algorithm in our model uses a computationally efficient variational Bayes approach that scales well to large high-throughput datasets. RESULTS: We apply our methods of integrating gene/mRNA expression and microRNA profiles for predicting patient survival times to The Cancer Genome Atlas (TCGA) based glioblastoma multiforme (GBM) dataset. In terms of prediction accuracy, we show that our non-linear and interaction-based integrative methods perform better than linear alternatives and non-integrative methods that do not account for interactions between the platforms. We also find several prognostic mRNAs and microRNAs that are related to tumor invasion and are known to drive tumor metastasis and severe inflammatory response in GBM. In addition, our analysis reveals several interesting mRNA and microRNA interactions that have known implications in the etiology of GBM. CONCLUSIONS: Our approach gains its flexibility and power by modeling the non-linear interaction structures between and within the platforms. Our framework is a useful tool for biomedical researchers, since clinical prediction using multi-platform genomic information is an important step towards personalized treatment of many cancers. We have a freely available software at: http://odin.mdacc.tmc.edu/~vbaladan. |
format | Online Article Text |
id | pubmed-3726335 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-37263352013-07-31 Integrating multi-platform genomic data using hierarchical Bayesian relevance vector machines Srivastava, Sanvesh Wang, Wenyi Manyam, Ganiraju Ordonez, Carlos Baladandayuthapani, Veerabhadran EURASIP J Bioinform Syst Biol Research BACKGROUND: Recent advances in genome technologies and the subsequent collection of genomic information at various molecular resolutions hold promise to accelerate the discovery of new therapeutic targets. A critical step in achieving these goals is to develop efficient clinical prediction models that integrate these diverse sources of high-throughput data. This step is challenging due to the presence of high-dimensionality and complex interactions in the data. For predicting relevant clinical outcomes, we propose a flexible statistical machine learning approach that acknowledges and models the interaction between platform-specific measurements through nonlinear kernel machines and borrows information within and between platforms through a hierarchical Bayesian framework. Our model has parameters with direct interpretations in terms of the effects of platforms and data interactions within and across platforms. The parameter estimation algorithm in our model uses a computationally efficient variational Bayes approach that scales well to large high-throughput datasets. RESULTS: We apply our methods of integrating gene/mRNA expression and microRNA profiles for predicting patient survival times to The Cancer Genome Atlas (TCGA) based glioblastoma multiforme (GBM) dataset. In terms of prediction accuracy, we show that our non-linear and interaction-based integrative methods perform better than linear alternatives and non-integrative methods that do not account for interactions between the platforms. We also find several prognostic mRNAs and microRNAs that are related to tumor invasion and are known to drive tumor metastasis and severe inflammatory response in GBM. In addition, our analysis reveals several interesting mRNA and microRNA interactions that have known implications in the etiology of GBM. CONCLUSIONS: Our approach gains its flexibility and power by modeling the non-linear interaction structures between and within the platforms. Our framework is a useful tool for biomedical researchers, since clinical prediction using multi-platform genomic information is an important step towards personalized treatment of many cancers. We have a freely available software at: http://odin.mdacc.tmc.edu/~vbaladan. BioMed Central 2013 2013-06-28 /pmc/articles/PMC3726335/ /pubmed/23809014 http://dx.doi.org/10.1186/1687-4153-2013-9 Text en Copyright © 2013 Srivastava et al.; licensee Springer. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Srivastava, Sanvesh Wang, Wenyi Manyam, Ganiraju Ordonez, Carlos Baladandayuthapani, Veerabhadran Integrating multi-platform genomic data using hierarchical Bayesian relevance vector machines |
title | Integrating multi-platform genomic data using hierarchical Bayesian relevance vector machines |
title_full | Integrating multi-platform genomic data using hierarchical Bayesian relevance vector machines |
title_fullStr | Integrating multi-platform genomic data using hierarchical Bayesian relevance vector machines |
title_full_unstemmed | Integrating multi-platform genomic data using hierarchical Bayesian relevance vector machines |
title_short | Integrating multi-platform genomic data using hierarchical Bayesian relevance vector machines |
title_sort | integrating multi-platform genomic data using hierarchical bayesian relevance vector machines |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3726335/ https://www.ncbi.nlm.nih.gov/pubmed/23809014 http://dx.doi.org/10.1186/1687-4153-2013-9 |
work_keys_str_mv | AT srivastavasanvesh integratingmultiplatformgenomicdatausinghierarchicalbayesianrelevancevectormachines AT wangwenyi integratingmultiplatformgenomicdatausinghierarchicalbayesianrelevancevectormachines AT manyamganiraju integratingmultiplatformgenomicdatausinghierarchicalbayesianrelevancevectormachines AT ordonezcarlos integratingmultiplatformgenomicdatausinghierarchicalbayesianrelevancevectormachines AT baladandayuthapaniveerabhadran integratingmultiplatformgenomicdatausinghierarchicalbayesianrelevancevectormachines |