Cargando…
Challenges and Opportunities for Bayesian Statistics in Proteomics
[Image: see text] Proteomics is a data-rich science with complex experimental designs and an intricate measurement process. To obtain insights from the large data sets produced, statistical methods, including machine learning, are routinely applied. For a quantity of interest, many of these approach...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
American Chemical Society
2022
|
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982455/ https://www.ncbi.nlm.nih.gov/pubmed/35258980 http://dx.doi.org/10.1021/acs.jproteome.1c00859 |
_version_ | 1784681815821254656 |
---|---|
author | Crook, Oliver M. Chung, Chun-wa Deane, Charlotte M. |
author_facet | Crook, Oliver M. Chung, Chun-wa Deane, Charlotte M. |
author_sort | Crook, Oliver M. |
collection | PubMed |
description | [Image: see text] Proteomics is a data-rich science with complex experimental designs and an intricate measurement process. To obtain insights from the large data sets produced, statistical methods, including machine learning, are routinely applied. For a quantity of interest, many of these approaches only produce a point estimate, such as a mean, leaving little room for more nuanced interpretations. By contrast, Bayesian statistics allows quantification of uncertainty through the use of probability distributions. These probability distributions enable scientists to ask complex questions of their proteomics data. Bayesian statistics also offers a modular framework for data analysis by making dependencies between data and parameters explicit. Hence, specifying complex hierarchies of parameter dependencies is straightforward in the Bayesian framework. This allows us to use a statistical methodology which equals, rather than neglects, the sophistication of experimental design and instrumentation present in proteomics. Here, we review Bayesian methods applied to proteomics, demonstrating their potential power, alongside the challenges posed by adopting this new statistical framework. To illustrate our review, we give a walk-through of the development of a Bayesian model for dynamic organic orthogonal phase-separation (OOPS) data. |
format | Online Article Text |
id | pubmed-8982455 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | American Chemical Society |
record_format | MEDLINE/PubMed |
spelling | pubmed-89824552023-03-08 Challenges and Opportunities for Bayesian Statistics in Proteomics Crook, Oliver M. Chung, Chun-wa Deane, Charlotte M. J Proteome Res [Image: see text] Proteomics is a data-rich science with complex experimental designs and an intricate measurement process. To obtain insights from the large data sets produced, statistical methods, including machine learning, are routinely applied. For a quantity of interest, many of these approaches only produce a point estimate, such as a mean, leaving little room for more nuanced interpretations. By contrast, Bayesian statistics allows quantification of uncertainty through the use of probability distributions. These probability distributions enable scientists to ask complex questions of their proteomics data. Bayesian statistics also offers a modular framework for data analysis by making dependencies between data and parameters explicit. Hence, specifying complex hierarchies of parameter dependencies is straightforward in the Bayesian framework. This allows us to use a statistical methodology which equals, rather than neglects, the sophistication of experimental design and instrumentation present in proteomics. Here, we review Bayesian methods applied to proteomics, demonstrating their potential power, alongside the challenges posed by adopting this new statistical framework. To illustrate our review, we give a walk-through of the development of a Bayesian model for dynamic organic orthogonal phase-separation (OOPS) data. American Chemical Society 2022-03-08 2022-04-01 /pmc/articles/PMC8982455/ /pubmed/35258980 http://dx.doi.org/10.1021/acs.jproteome.1c00859 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/). |
spellingShingle | Crook, Oliver M. Chung, Chun-wa Deane, Charlotte M. Challenges and Opportunities for Bayesian Statistics in Proteomics |
title | Challenges and
Opportunities for Bayesian Statistics
in Proteomics |
title_full | Challenges and
Opportunities for Bayesian Statistics
in Proteomics |
title_fullStr | Challenges and
Opportunities for Bayesian Statistics
in Proteomics |
title_full_unstemmed | Challenges and
Opportunities for Bayesian Statistics
in Proteomics |
title_short | Challenges and
Opportunities for Bayesian Statistics
in Proteomics |
title_sort | challenges and
opportunities for bayesian statistics
in proteomics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982455/ https://www.ncbi.nlm.nih.gov/pubmed/35258980 http://dx.doi.org/10.1021/acs.jproteome.1c00859 |
work_keys_str_mv | AT crookoliverm challengesandopportunitiesforbayesianstatisticsinproteomics AT chungchunwa challengesandopportunitiesforbayesianstatisticsinproteomics AT deanecharlottem challengesandopportunitiesforbayesianstatisticsinproteomics |