Cargando…
An eScience-Bayes strategy for analyzing omics data
BACKGROUND: The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of mo...
Autores principales: | , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2010
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2887810/ https://www.ncbi.nlm.nih.gov/pubmed/20504364 http://dx.doi.org/10.1186/1471-2105-11-282 |
_version_ | 1782182588487565312 |
---|---|
author | Eklund, Martin Spjuth, Ola Wikberg, Jarl ES |
author_facet | Eklund, Martin Spjuth, Ola Wikberg, Jarl ES |
author_sort | Eklund, Martin |
collection | PubMed |
description | BACKGROUND: The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of model parameters and performance. Moreover, the integration of omics data with other data sources is difficult to shoehorn into classical statistical models. This has resulted in ad hoc approaches to address specific problems. RESULTS: We present a general approach to omics data analysis that alleviates these problems. By combining eScience and Bayesian methods, we retrieve scientific information and data from multiple sources and coherently incorporate them into large models. These models improve the accuracy of predictions and offer new insights into the underlying mechanisms. This "eScience-Bayes" approach is demonstrated in two proof-of-principle applications, one for breast cancer prognosis prediction from transcriptomic data and one for protein-protein interaction studies based on proteomic data. CONCLUSIONS: Bayesian statistics provide the flexibility to tailor statistical models to the complex data structures in omics biology as well as permitting coherent integration of multiple data sources. However, Bayesian methods are in general computationally demanding and require specification of possibly thousands of prior distributions. eScience can help us overcome these difficulties. The eScience-Bayes thus approach permits us to fully leverage on the advantages of Bayesian methods, resulting in models with improved predictive performance that gives more information about the underlying biological system. |
format | Text |
id | pubmed-2887810 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2010 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-28878102010-06-19 An eScience-Bayes strategy for analyzing omics data Eklund, Martin Spjuth, Ola Wikberg, Jarl ES BMC Bioinformatics Methodology article BACKGROUND: The omics fields promise to revolutionize our understanding of biology and biomedicine. However, their potential is compromised by the challenge to analyze the huge datasets produced. Analysis of omics data is plagued by the curse of dimensionality, resulting in imprecise estimates of model parameters and performance. Moreover, the integration of omics data with other data sources is difficult to shoehorn into classical statistical models. This has resulted in ad hoc approaches to address specific problems. RESULTS: We present a general approach to omics data analysis that alleviates these problems. By combining eScience and Bayesian methods, we retrieve scientific information and data from multiple sources and coherently incorporate them into large models. These models improve the accuracy of predictions and offer new insights into the underlying mechanisms. This "eScience-Bayes" approach is demonstrated in two proof-of-principle applications, one for breast cancer prognosis prediction from transcriptomic data and one for protein-protein interaction studies based on proteomic data. CONCLUSIONS: Bayesian statistics provide the flexibility to tailor statistical models to the complex data structures in omics biology as well as permitting coherent integration of multiple data sources. However, Bayesian methods are in general computationally demanding and require specification of possibly thousands of prior distributions. eScience can help us overcome these difficulties. The eScience-Bayes thus approach permits us to fully leverage on the advantages of Bayesian methods, resulting in models with improved predictive performance that gives more information about the underlying biological system. BioMed Central 2010-05-26 /pmc/articles/PMC2887810/ /pubmed/20504364 http://dx.doi.org/10.1186/1471-2105-11-282 Text en Copyright ©2010 Eklund et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology article Eklund, Martin Spjuth, Ola Wikberg, Jarl ES An eScience-Bayes strategy for analyzing omics data |
title | An eScience-Bayes strategy for analyzing omics data |
title_full | An eScience-Bayes strategy for analyzing omics data |
title_fullStr | An eScience-Bayes strategy for analyzing omics data |
title_full_unstemmed | An eScience-Bayes strategy for analyzing omics data |
title_short | An eScience-Bayes strategy for analyzing omics data |
title_sort | escience-bayes strategy for analyzing omics data |
topic | Methodology article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2887810/ https://www.ncbi.nlm.nih.gov/pubmed/20504364 http://dx.doi.org/10.1186/1471-2105-11-282 |
work_keys_str_mv | AT eklundmartin anesciencebayesstrategyforanalyzingomicsdata AT spjuthola anesciencebayesstrategyforanalyzingomicsdata AT wikbergjarles anesciencebayesstrategyforanalyzingomicsdata AT eklundmartin esciencebayesstrategyforanalyzingomicsdata AT spjuthola esciencebayesstrategyforanalyzingomicsdata AT wikbergjarles esciencebayesstrategyforanalyzingomicsdata |