Cargando…
Bayesian Statistics Improves Biological Interpretability of Metabolomics Data from Human Cohorts
Univariate analyses of metabolomics data currently follow a frequentist approach, using p-values to reject a null hypothesis. We here propose the use of Bayesian statistics to quantify evidence supporting different hypotheses and discriminate between the null hypothesis versus the lack of statistica...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
MDPI
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10535181/ https://www.ncbi.nlm.nih.gov/pubmed/37755264 http://dx.doi.org/10.3390/metabo13090984 |
_version_ | 1785112569652969472 |
---|---|
author | Brydges, Christopher Che, Xiaoyu Lipkin, Walter Ian Fiehn, Oliver |
author_facet | Brydges, Christopher Che, Xiaoyu Lipkin, Walter Ian Fiehn, Oliver |
author_sort | Brydges, Christopher |
collection | PubMed |
description | Univariate analyses of metabolomics data currently follow a frequentist approach, using p-values to reject a null hypothesis. We here propose the use of Bayesian statistics to quantify evidence supporting different hypotheses and discriminate between the null hypothesis versus the lack of statistical power. We used metabolomics data from three independent human cohorts that studied the plasma signatures of subjects with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS). The data are publicly available, covering 84–197 subjects in each study with 562–888 identified metabolites of which 777 were common between the two studies and 93 were compounds reported in all three studies. We show how Bayesian statistics incorporates results from one study as “prior information” into the next study, thereby improving the overall assessment of the likelihood of finding specific differences between plasma metabolite levels. Using classic statistics and Benjamini–Hochberg FDR-corrections, Study 1 detected 18 metabolic differences and Study 2 detected no differences. Using Bayesian statistics on the same data, we found a high likelihood that 97 compounds were altered in concentration in Study 2, after using the results of Study 1 as the prior distributions. These findings included lower levels of peroxisome-produced ether-lipids, higher levels of long-chain unsaturated triacylglycerides, and the presence of exposome compounds that are explained by the difference in diet and medication between healthy subjects and ME/CFS patients. Although Study 3 reported only 92 compounds in common with the other two studies, these major differences were confirmed. We also found that prostaglandin F2alpha, a lipid mediator of physiological relevance, was reduced in ME/CFS patients across all three studies. The use of Bayesian statistics led to biological conclusions from metabolomic data that were not found through frequentist approaches. We propose that Bayesian statistics is highly useful for studies with similar research designs if similar metabolomic assays are used. |
format | Online Article Text |
id | pubmed-10535181 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | MDPI |
record_format | MEDLINE/PubMed |
spelling | pubmed-105351812023-09-29 Bayesian Statistics Improves Biological Interpretability of Metabolomics Data from Human Cohorts Brydges, Christopher Che, Xiaoyu Lipkin, Walter Ian Fiehn, Oliver Metabolites Article Univariate analyses of metabolomics data currently follow a frequentist approach, using p-values to reject a null hypothesis. We here propose the use of Bayesian statistics to quantify evidence supporting different hypotheses and discriminate between the null hypothesis versus the lack of statistical power. We used metabolomics data from three independent human cohorts that studied the plasma signatures of subjects with myalgic encephalomyelitis/chronic fatigue syndrome (ME/CFS). The data are publicly available, covering 84–197 subjects in each study with 562–888 identified metabolites of which 777 were common between the two studies and 93 were compounds reported in all three studies. We show how Bayesian statistics incorporates results from one study as “prior information” into the next study, thereby improving the overall assessment of the likelihood of finding specific differences between plasma metabolite levels. Using classic statistics and Benjamini–Hochberg FDR-corrections, Study 1 detected 18 metabolic differences and Study 2 detected no differences. Using Bayesian statistics on the same data, we found a high likelihood that 97 compounds were altered in concentration in Study 2, after using the results of Study 1 as the prior distributions. These findings included lower levels of peroxisome-produced ether-lipids, higher levels of long-chain unsaturated triacylglycerides, and the presence of exposome compounds that are explained by the difference in diet and medication between healthy subjects and ME/CFS patients. Although Study 3 reported only 92 compounds in common with the other two studies, these major differences were confirmed. We also found that prostaglandin F2alpha, a lipid mediator of physiological relevance, was reduced in ME/CFS patients across all three studies. The use of Bayesian statistics led to biological conclusions from metabolomic data that were not found through frequentist approaches. We propose that Bayesian statistics is highly useful for studies with similar research designs if similar metabolomic assays are used. MDPI 2023-08-31 /pmc/articles/PMC10535181/ /pubmed/37755264 http://dx.doi.org/10.3390/metabo13090984 Text en © 2023 by the authors. https://creativecommons.org/licenses/by/4.0/Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Article Brydges, Christopher Che, Xiaoyu Lipkin, Walter Ian Fiehn, Oliver Bayesian Statistics Improves Biological Interpretability of Metabolomics Data from Human Cohorts |
title | Bayesian Statistics Improves Biological Interpretability of Metabolomics Data from Human Cohorts |
title_full | Bayesian Statistics Improves Biological Interpretability of Metabolomics Data from Human Cohorts |
title_fullStr | Bayesian Statistics Improves Biological Interpretability of Metabolomics Data from Human Cohorts |
title_full_unstemmed | Bayesian Statistics Improves Biological Interpretability of Metabolomics Data from Human Cohorts |
title_short | Bayesian Statistics Improves Biological Interpretability of Metabolomics Data from Human Cohorts |
title_sort | bayesian statistics improves biological interpretability of metabolomics data from human cohorts |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10535181/ https://www.ncbi.nlm.nih.gov/pubmed/37755264 http://dx.doi.org/10.3390/metabo13090984 |
work_keys_str_mv | AT brydgeschristopher bayesianstatisticsimprovesbiologicalinterpretabilityofmetabolomicsdatafromhumancohorts AT chexiaoyu bayesianstatisticsimprovesbiologicalinterpretabilityofmetabolomicsdatafromhumancohorts AT lipkinwalterian bayesianstatisticsimprovesbiologicalinterpretabilityofmetabolomicsdatafromhumancohorts AT fiehnoliver bayesianstatisticsimprovesbiologicalinterpretabilityofmetabolomicsdatafromhumancohorts |