Cargando…

Challenges and Opportunities for Bayesian Statistics in Proteomics

[Image: see text] Proteomics is a data-rich science with complex experimental designs and an intricate measurement process. To obtain insights from the large data sets produced, statistical methods, including machine learning, are routinely applied. For a quantity of interest, many of these approach...

Descripción completa

Detalles Bibliográficos
Autores principales: Crook, Oliver M., Chung, Chun-wa, Deane, Charlotte M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: American Chemical Society 2022
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982455/
https://www.ncbi.nlm.nih.gov/pubmed/35258980
http://dx.doi.org/10.1021/acs.jproteome.1c00859
_version_ 1784681815821254656
author Crook, Oliver M.
Chung, Chun-wa
Deane, Charlotte M.
author_facet Crook, Oliver M.
Chung, Chun-wa
Deane, Charlotte M.
author_sort Crook, Oliver M.
collection PubMed
description [Image: see text] Proteomics is a data-rich science with complex experimental designs and an intricate measurement process. To obtain insights from the large data sets produced, statistical methods, including machine learning, are routinely applied. For a quantity of interest, many of these approaches only produce a point estimate, such as a mean, leaving little room for more nuanced interpretations. By contrast, Bayesian statistics allows quantification of uncertainty through the use of probability distributions. These probability distributions enable scientists to ask complex questions of their proteomics data. Bayesian statistics also offers a modular framework for data analysis by making dependencies between data and parameters explicit. Hence, specifying complex hierarchies of parameter dependencies is straightforward in the Bayesian framework. This allows us to use a statistical methodology which equals, rather than neglects, the sophistication of experimental design and instrumentation present in proteomics. Here, we review Bayesian methods applied to proteomics, demonstrating their potential power, alongside the challenges posed by adopting this new statistical framework. To illustrate our review, we give a walk-through of the development of a Bayesian model for dynamic organic orthogonal phase-separation (OOPS) data.
format Online
Article
Text
id pubmed-8982455
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher American Chemical Society
record_format MEDLINE/PubMed
spelling pubmed-89824552023-03-08 Challenges and Opportunities for Bayesian Statistics in Proteomics Crook, Oliver M. Chung, Chun-wa Deane, Charlotte M. J Proteome Res [Image: see text] Proteomics is a data-rich science with complex experimental designs and an intricate measurement process. To obtain insights from the large data sets produced, statistical methods, including machine learning, are routinely applied. For a quantity of interest, many of these approaches only produce a point estimate, such as a mean, leaving little room for more nuanced interpretations. By contrast, Bayesian statistics allows quantification of uncertainty through the use of probability distributions. These probability distributions enable scientists to ask complex questions of their proteomics data. Bayesian statistics also offers a modular framework for data analysis by making dependencies between data and parameters explicit. Hence, specifying complex hierarchies of parameter dependencies is straightforward in the Bayesian framework. This allows us to use a statistical methodology which equals, rather than neglects, the sophistication of experimental design and instrumentation present in proteomics. Here, we review Bayesian methods applied to proteomics, demonstrating their potential power, alongside the challenges posed by adopting this new statistical framework. To illustrate our review, we give a walk-through of the development of a Bayesian model for dynamic organic orthogonal phase-separation (OOPS) data. American Chemical Society 2022-03-08 2022-04-01 /pmc/articles/PMC8982455/ /pubmed/35258980 http://dx.doi.org/10.1021/acs.jproteome.1c00859 Text en © 2022 The Authors. Published by American Chemical Society https://creativecommons.org/licenses/by/4.0/Permits the broadest form of re-use including for commercial purposes, provided that author attribution and integrity are maintained (https://creativecommons.org/licenses/by/4.0/).
spellingShingle Crook, Oliver M.
Chung, Chun-wa
Deane, Charlotte M.
Challenges and Opportunities for Bayesian Statistics in Proteomics
title Challenges and Opportunities for Bayesian Statistics in Proteomics
title_full Challenges and Opportunities for Bayesian Statistics in Proteomics
title_fullStr Challenges and Opportunities for Bayesian Statistics in Proteomics
title_full_unstemmed Challenges and Opportunities for Bayesian Statistics in Proteomics
title_short Challenges and Opportunities for Bayesian Statistics in Proteomics
title_sort challenges and opportunities for bayesian statistics in proteomics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8982455/
https://www.ncbi.nlm.nih.gov/pubmed/35258980
http://dx.doi.org/10.1021/acs.jproteome.1c00859
work_keys_str_mv AT crookoliverm challengesandopportunitiesforbayesianstatisticsinproteomics
AT chungchunwa challengesandopportunitiesforbayesianstatisticsinproteomics
AT deanecharlottem challengesandopportunitiesforbayesianstatisticsinproteomics