Cargando…

Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations

Motivation: Data analysis for metabolomics suffers from uncertainty because of the noisy measurement technology and the small sample size of experiments. Noise and the small sample size lead to a high probability of false findings. Further, individual compounds have natural variation between samples...

Descripción completa

Detalles Bibliográficos
Autores principales: Suvitaival, Tommi, Rogers, Simon, Kaski, Samuel
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147908/
https://www.ncbi.nlm.nih.gov/pubmed/25161234
http://dx.doi.org/10.1093/bioinformatics/btu455
_version_ 1782332535181672448
author Suvitaival, Tommi
Rogers, Simon
Kaski, Samuel
author_facet Suvitaival, Tommi
Rogers, Simon
Kaski, Samuel
author_sort Suvitaival, Tommi
collection PubMed
description Motivation: Data analysis for metabolomics suffers from uncertainty because of the noisy measurement technology and the small sample size of experiments. Noise and the small sample size lead to a high probability of false findings. Further, individual compounds have natural variation between samples, which in many cases renders them unreliable as biomarkers. However, the levels of similar compounds are typically highly correlated, which is a phenomenon that we model in this work. Results: We propose a hierarchical Bayesian model for inferring differences between groups of samples more accurately in metabolomic studies, where the observed compounds are collinear. We discover that the method decreases the error of weak and non-existent covariate effects, and thereby reduces false-positive findings. To achieve this, the method makes use of the mass spectral peak data by clustering similar peaks into latent compounds, and by further clustering latent compounds into groups that respond in a coherent way to the experimental covariates. We demonstrate the method with three simulated studies and validate it with a metabolomic benchmark dataset. Availability and implementation: An implementation in R is available at http://research.ics.aalto.fi/mi/software/peakANOVA/. Contact: samuel.kaski@aalto.fi.
format Online
Article
Text
id pubmed-4147908
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-41479082014-09-02 Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations Suvitaival, Tommi Rogers, Simon Kaski, Samuel Bioinformatics Eccb 2014 Proceedings Papers Committee Motivation: Data analysis for metabolomics suffers from uncertainty because of the noisy measurement technology and the small sample size of experiments. Noise and the small sample size lead to a high probability of false findings. Further, individual compounds have natural variation between samples, which in many cases renders them unreliable as biomarkers. However, the levels of similar compounds are typically highly correlated, which is a phenomenon that we model in this work. Results: We propose a hierarchical Bayesian model for inferring differences between groups of samples more accurately in metabolomic studies, where the observed compounds are collinear. We discover that the method decreases the error of weak and non-existent covariate effects, and thereby reduces false-positive findings. To achieve this, the method makes use of the mass spectral peak data by clustering similar peaks into latent compounds, and by further clustering latent compounds into groups that respond in a coherent way to the experimental covariates. We demonstrate the method with three simulated studies and validate it with a metabolomic benchmark dataset. Availability and implementation: An implementation in R is available at http://research.ics.aalto.fi/mi/software/peakANOVA/. Contact: samuel.kaski@aalto.fi. Oxford University Press 2014-09-01 2014-08-22 /pmc/articles/PMC4147908/ /pubmed/25161234 http://dx.doi.org/10.1093/bioinformatics/btu455 Text en © The Author 2014. Published by Oxford University Press. http://creativecommons.org/licenses/by-nc/4.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle Eccb 2014 Proceedings Papers Committee
Suvitaival, Tommi
Rogers, Simon
Kaski, Samuel
Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations
title Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations
title_full Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations
title_fullStr Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations
title_full_unstemmed Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations
title_short Stronger findings for metabolomics through Bayesian modeling of multiple peaks and compound correlations
title_sort stronger findings for metabolomics through bayesian modeling of multiple peaks and compound correlations
topic Eccb 2014 Proceedings Papers Committee
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4147908/
https://www.ncbi.nlm.nih.gov/pubmed/25161234
http://dx.doi.org/10.1093/bioinformatics/btu455
work_keys_str_mv AT suvitaivaltommi strongerfindingsformetabolomicsthroughbayesianmodelingofmultiplepeaksandcompoundcorrelations
AT rogerssimon strongerfindingsformetabolomicsthroughbayesianmodelingofmultiplepeaksandcompoundcorrelations
AT kaskisamuel strongerfindingsformetabolomicsthroughbayesianmodelingofmultiplepeaksandcompoundcorrelations