Cargando…

The P Value Line Dance: When Does the Music Stop?

When should a trial stop? Such a seemingly innocent question evokes concerns of type I and II errors among those who believe that certainty can be the product of uncertainty and among researchers who have been told that they need to carefully calculate sample sizes, consider multiplicity, and not sp...

Descripción completa

Detalles Bibliográficos
Autor principal: Bendtsen, Marcus
Formato: Online Artículo Texto
Lenguaje:English
Publicado: JMIR Publications 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7484773/
https://www.ncbi.nlm.nih.gov/pubmed/32852275
http://dx.doi.org/10.2196/21345
_version_ 1783581040591241216
author Bendtsen, Marcus
author_facet Bendtsen, Marcus
author_sort Bendtsen, Marcus
collection PubMed
description When should a trial stop? Such a seemingly innocent question evokes concerns of type I and II errors among those who believe that certainty can be the product of uncertainty and among researchers who have been told that they need to carefully calculate sample sizes, consider multiplicity, and not spend P values on interim analyses. However, the endeavor to dichotomize evidence into significant and nonsignificant has led to the basic driving force of science, namely uncertainty, to take a back seat. In this viewpoint we discuss that if testing the null hypothesis is the ultimate goal of science, then we need not worry about writing protocols, consider ethics, apply for funding, or run any experiments at all—all null hypotheses will be rejected at some point—everything has an effect. The job of science should be to unearth the uncertainties of the effects of treatments, not to test their difference from zero. We also show the fickleness of P values, how they may one day point to statistically significant results; and after a few more participants have been recruited, the once statistically significant effect suddenly disappears. We show plots which we hope would intuitively highlight that all assessments of evidence will fluctuate over time. Finally, we discuss the remedy in the form of Bayesian methods, where uncertainty leads; and which allows for continuous decision making to stop or continue recruitment, as new data from a trial is accumulated.
format Online
Article
Text
id pubmed-7484773
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher JMIR Publications
record_format MEDLINE/PubMed
spelling pubmed-74847732020-09-21 The P Value Line Dance: When Does the Music Stop? Bendtsen, Marcus J Med Internet Res Viewpoint When should a trial stop? Such a seemingly innocent question evokes concerns of type I and II errors among those who believe that certainty can be the product of uncertainty and among researchers who have been told that they need to carefully calculate sample sizes, consider multiplicity, and not spend P values on interim analyses. However, the endeavor to dichotomize evidence into significant and nonsignificant has led to the basic driving force of science, namely uncertainty, to take a back seat. In this viewpoint we discuss that if testing the null hypothesis is the ultimate goal of science, then we need not worry about writing protocols, consider ethics, apply for funding, or run any experiments at all—all null hypotheses will be rejected at some point—everything has an effect. The job of science should be to unearth the uncertainties of the effects of treatments, not to test their difference from zero. We also show the fickleness of P values, how they may one day point to statistically significant results; and after a few more participants have been recruited, the once statistically significant effect suddenly disappears. We show plots which we hope would intuitively highlight that all assessments of evidence will fluctuate over time. Finally, we discuss the remedy in the form of Bayesian methods, where uncertainty leads; and which allows for continuous decision making to stop or continue recruitment, as new data from a trial is accumulated. JMIR Publications 2020-08-27 /pmc/articles/PMC7484773/ /pubmed/32852275 http://dx.doi.org/10.2196/21345 Text en ©Marcus Bendtsen. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 27.08.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included.
spellingShingle Viewpoint
Bendtsen, Marcus
The P Value Line Dance: When Does the Music Stop?
title The P Value Line Dance: When Does the Music Stop?
title_full The P Value Line Dance: When Does the Music Stop?
title_fullStr The P Value Line Dance: When Does the Music Stop?
title_full_unstemmed The P Value Line Dance: When Does the Music Stop?
title_short The P Value Line Dance: When Does the Music Stop?
title_sort p value line dance: when does the music stop?
topic Viewpoint
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7484773/
https://www.ncbi.nlm.nih.gov/pubmed/32852275
http://dx.doi.org/10.2196/21345
work_keys_str_mv AT bendtsenmarcus thepvaluelinedancewhendoesthemusicstop
AT bendtsenmarcus pvaluelinedancewhendoesthemusicstop