Cargando…

A puzzle of proportions: Two popular Bayesian tests can yield dramatically different conclusions

Testing the equality of two proportions is a common procedure in science, especially in medicine and public health. In these domains, it is crucial to be able to quantify evidence for the absence of a treatment effect. Bayesian hypothesis testing by means of the Bayes factor provides one avenue to d...

Descripción completa

Detalles Bibliográficos
Autores principales: Dablander, Fabian, Huth, Karoline, Gronau, Quentin F., Etz, Alexander, Wagenmakers, Eric‐Jan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9299731/
https://www.ncbi.nlm.nih.gov/pubmed/34897784
http://dx.doi.org/10.1002/sim.9278
_version_ 1784751043514466304
author Dablander, Fabian
Huth, Karoline
Gronau, Quentin F.
Etz, Alexander
Wagenmakers, Eric‐Jan
author_facet Dablander, Fabian
Huth, Karoline
Gronau, Quentin F.
Etz, Alexander
Wagenmakers, Eric‐Jan
author_sort Dablander, Fabian
collection PubMed
description Testing the equality of two proportions is a common procedure in science, especially in medicine and public health. In these domains, it is crucial to be able to quantify evidence for the absence of a treatment effect. Bayesian hypothesis testing by means of the Bayes factor provides one avenue to do so, requiring the specification of prior distributions for parameters. The most popular analysis approach views the comparison of proportions from a contingency table perspective, assigning prior distributions directly to the two proportions. Another, less popular approach views the problem from a logistic regression perspective, assigning prior distributions to logit‐transformed parameters. Reanalyzing 39 null results from the New England Journal of Medicine with both approaches, we find that they can lead to markedly different conclusions, especially when the observed proportions are at the extremes (ie, very low or very high). We explain these stark differences and provide recommendations for researchers interested in testing the equality of two proportions and users of Bayes factors more generally. The test that assigns prior distributions to logit‐transformed parameters creates prior dependence between the two proportions and yields weaker evidence when the observations are at the extremes. When comparing two proportions, we argue that this test should become the new default.
format Online
Article
Text
id pubmed-9299731
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-92997312022-07-21 A puzzle of proportions: Two popular Bayesian tests can yield dramatically different conclusions Dablander, Fabian Huth, Karoline Gronau, Quentin F. Etz, Alexander Wagenmakers, Eric‐Jan Stat Med Research Articles Testing the equality of two proportions is a common procedure in science, especially in medicine and public health. In these domains, it is crucial to be able to quantify evidence for the absence of a treatment effect. Bayesian hypothesis testing by means of the Bayes factor provides one avenue to do so, requiring the specification of prior distributions for parameters. The most popular analysis approach views the comparison of proportions from a contingency table perspective, assigning prior distributions directly to the two proportions. Another, less popular approach views the problem from a logistic regression perspective, assigning prior distributions to logit‐transformed parameters. Reanalyzing 39 null results from the New England Journal of Medicine with both approaches, we find that they can lead to markedly different conclusions, especially when the observed proportions are at the extremes (ie, very low or very high). We explain these stark differences and provide recommendations for researchers interested in testing the equality of two proportions and users of Bayes factors more generally. The test that assigns prior distributions to logit‐transformed parameters creates prior dependence between the two proportions and yields weaker evidence when the observations are at the extremes. When comparing two proportions, we argue that this test should become the new default. John Wiley and Sons Inc. 2021-12-12 2022-04-15 /pmc/articles/PMC9299731/ /pubmed/34897784 http://dx.doi.org/10.1002/sim.9278 Text en © 2021 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. https://creativecommons.org/licenses/by-nc/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc/4.0/ (https://creativecommons.org/licenses/by-nc/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.
spellingShingle Research Articles
Dablander, Fabian
Huth, Karoline
Gronau, Quentin F.
Etz, Alexander
Wagenmakers, Eric‐Jan
A puzzle of proportions: Two popular Bayesian tests can yield dramatically different conclusions
title A puzzle of proportions: Two popular Bayesian tests can yield dramatically different conclusions
title_full A puzzle of proportions: Two popular Bayesian tests can yield dramatically different conclusions
title_fullStr A puzzle of proportions: Two popular Bayesian tests can yield dramatically different conclusions
title_full_unstemmed A puzzle of proportions: Two popular Bayesian tests can yield dramatically different conclusions
title_short A puzzle of proportions: Two popular Bayesian tests can yield dramatically different conclusions
title_sort puzzle of proportions: two popular bayesian tests can yield dramatically different conclusions
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9299731/
https://www.ncbi.nlm.nih.gov/pubmed/34897784
http://dx.doi.org/10.1002/sim.9278
work_keys_str_mv AT dablanderfabian apuzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions
AT huthkaroline apuzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions
AT gronauquentinf apuzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions
AT etzalexander apuzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions
AT wagenmakersericjan apuzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions
AT dablanderfabian puzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions
AT huthkaroline puzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions
AT gronauquentinf puzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions
AT etzalexander puzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions
AT wagenmakersericjan puzzleofproportionstwopopularbayesiantestscanyielddramaticallydifferentconclusions