Cargando…

Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations

Motivation: Data analysis for metabolomics suffers from uncertainty because of the noisy measurement technology and the small sample size of experiments. Noise and the small sample size lead to a high probability of false findings. Further, individual compounds have natural variation between samples...

Descripción completa

Detalles Bibliográficos
Autores principales: Suvitaival, Tommi, Rogers, Simon, Kaski, Samuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147908/
https://www.ncbi.nlm.nih.gov/pubmed/25161234
http://dx.doi.org/10.1093/bioinformatics/btu455
Descripción
Sumario:Motivation: Data analysis for metabolomics suffers from uncertainty because of the noisy measurement technology and the small sample size of experiments. Noise and the small sample size lead to a high probability of false findings. Further, individual compounds have natural variation between samples, which in many cases renders them unreliable as biomarkers. However, the levels of similar compounds are typically highly correlated, which is a phenomenon that we model in this work. Results: We propose a hierarchical Bayesian model for inferring differences between groups of samples more accurately in metabolomic studies, where the observed compounds are collinear. We discover that the method decreases the error of weak and non-existent covariate effects, and thereby reduces false-positive findings. To achieve this, the method makes use of the mass spectral peak data by clustering similar peaks into latent compounds, and by further clustering latent compounds into groups that respond in a coherent way to the experimental covariates. We demonstrate the method with three simulated studies and validate it with a metabolomic benchmark dataset. Availability and implementation: An implementation in R is available at http://research.ics.aalto.fi/mi/software/peakANOVA/. Contact: samuel.kaski@aalto.fi.