Cargando…
Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations
Motivation: Data analysis for metabolomics suffers from uncertainty because of the noisy measurement technology and the small sample size of experiments. Noise and the small sample size lead to a high probability of false findings. Further, individual compounds have natural variation between samples...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147908/ https://www.ncbi.nlm.nih.gov/pubmed/25161234 http://dx.doi.org/10.1093/bioinformatics/btu455 |
_version_ | 1782332535181672448 |
---|---|
author | Suvitaival, Tommi Rogers, Simon Kaski, Samuel |
author_facet | Suvitaival, Tommi Rogers, Simon Kaski, Samuel |
author_sort | Suvitaival, Tommi |
collection | PubMed |
description | Motivation: Data analysis for metabolomics suffers from uncertainty because of the noisy measurement technology and the small sample size of experiments. Noise and the small sample size lead to a high probability of false findings. Further, individual compounds have natural variation between samples, which in many cases renders them unreliable as biomarkers. However, the levels of similar compounds are typically highly correlated, which is a phenomenon that we model in this work. Results: We propose a hierarchical Bayesian model for inferring differences between groups of samples more accurately in metabolomic studies, where the observed compounds are collinear. We discover that the method decreases the error of weak and non-existent covariate effects, and thereby reduces false-positive findings. To achieve this, the method makes use of the mass spectral peak data by clustering similar peaks into latent compounds, and by further clustering latent compounds into groups that respond in a coherent way to the experimental covariates. We demonstrate the method with three simulated studies and validate it with a metabolomic benchmark dataset. Availability and implementation: An implementation in R is available at http://research.ics.aalto.fi/mi/software/peakANOVA/. Contact: samuel.kaski@aalto.fi. |
format | Online Article Text |
id | pubmed-4147908 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-41479082014-09-02 Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations Suvitaival, Tommi Rogers, Simon Kaski, Samuel Bioinformatics Eccb 2014 Proceedings Papers Committee Motivation: Data analysis for metabolomics suffers from uncertainty because of the noisy measurement technology and the small sample size of experiments. Noise and the small sample size lead to a high probability of false findings. Further, individual compounds have natural variation between samples, which in many cases renders them unreliable as biomarkers. However, the levels of similar compounds are typically highly correlated, which is a phenomenon that we model in this work. Results: We propose a hierarchical Bayesian model for inferring differences between groups of samples more accurately in metabolomic studies, where the observed compounds are collinear. We discover that the method decreases the error of weak and non-existent covariate effects, and thereby reduces false-positive findings. To achieve this, the method makes use of the mass spectral peak data by clustering similar peaks into latent compounds, and by further clustering latent compounds into groups that respond in a coherent way to the experimental covariates. We demonstrate the method with three simulated studies and validate it with a metabolomic benchmark dataset. Availability and implementation: An implementation in R is available at http://research.ics.aalto.fi/mi/software/peakANOVA/. Contact: samuel.kaski@aalto.fi. Oxford University Press 2014-09-01 2014-08-22 /pmc/articles/PMC4147908/ /pubmed/25161234 http://dx.doi.org/10.1093/bioinformatics/btu455 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Eccb 2014 Proceedings Papers Committee Suvitaival, Tommi Rogers, Simon Kaski, Samuel Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations |
title | Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations |
title_full | Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations |
title_fullStr | Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations |
title_full_unstemmed | Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations |
title_short | Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations |
title_sort | stronger findings for metabolomics through bayesian modeling of multiple peaks and compound correlations |
topic | Eccb 2014 Proceedings Papers Committee |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147908/ https://www.ncbi.nlm.nih.gov/pubmed/25161234 http://dx.doi.org/10.1093/bioinformatics/btu455 |
work_keys_str_mv | AT suvitaivaltommi strongerfindingsformetabolomicsthroughbayesianmodelingofmultiplepeaksandcompoundcorrelations AT rogerssimon strongerfindingsformetabolomicsthroughbayesianmodelingofmultiplepeaksandcompoundcorrelations AT kaskisamuel strongerfindingsformetabolomicsthroughbayesianmodelingofmultiplepeaksandcompoundcorrelations |