Cargando…
The P Value Line Dance: When Does the Music Stop?
When should a trial stop? Such a seemingly innocent question evokes concerns of type I and II errors among those who believe that certainty can be the product of uncertainty and among researchers who have been told that they need to carefully calculate sample sizes, consider multiplicity, and not sp...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
JMIR Publications
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7484773/ https://www.ncbi.nlm.nih.gov/pubmed/32852275 http://dx.doi.org/10.2196/21345 |
_version_ | 1783581040591241216 |
---|---|
author | Bendtsen, Marcus |
author_facet | Bendtsen, Marcus |
author_sort | Bendtsen, Marcus |
collection | PubMed |
description | When should a trial stop? Such a seemingly innocent question evokes concerns of type I and II errors among those who believe that certainty can be the product of uncertainty and among researchers who have been told that they need to carefully calculate sample sizes, consider multiplicity, and not spend P values on interim analyses. However, the endeavor to dichotomize evidence into significant and nonsignificant has led to the basic driving force of science, namely uncertainty, to take a back seat. In this viewpoint we discuss that if testing the null hypothesis is the ultimate goal of science, then we need not worry about writing protocols, consider ethics, apply for funding, or run any experiments at all—all null hypotheses will be rejected at some point—everything has an effect. The job of science should be to unearth the uncertainties of the effects of treatments, not to test their difference from zero. We also show the fickleness of P values, how they may one day point to statistically significant results; and after a few more participants have been recruited, the once statistically significant effect suddenly disappears. We show plots which we hope would intuitively highlight that all assessments of evidence will fluctuate over time. Finally, we discuss the remedy in the form of Bayesian methods, where uncertainty leads; and which allows for continuous decision making to stop or continue recruitment, as new data from a trial is accumulated. |
format | Online Article Text |
id | pubmed-7484773 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | JMIR Publications |
record_format | MEDLINE/PubMed |
spelling | pubmed-74847732020-09-21 The P Value Line Dance: When Does the Music Stop? Bendtsen, Marcus J Med Internet Res Viewpoint When should a trial stop? Such a seemingly innocent question evokes concerns of type I and II errors among those who believe that certainty can be the product of uncertainty and among researchers who have been told that they need to carefully calculate sample sizes, consider multiplicity, and not spend P values on interim analyses. However, the endeavor to dichotomize evidence into significant and nonsignificant has led to the basic driving force of science, namely uncertainty, to take a back seat. In this viewpoint we discuss that if testing the null hypothesis is the ultimate goal of science, then we need not worry about writing protocols, consider ethics, apply for funding, or run any experiments at all—all null hypotheses will be rejected at some point—everything has an effect. The job of science should be to unearth the uncertainties of the effects of treatments, not to test their difference from zero. We also show the fickleness of P values, how they may one day point to statistically significant results; and after a few more participants have been recruited, the once statistically significant effect suddenly disappears. We show plots which we hope would intuitively highlight that all assessments of evidence will fluctuate over time. Finally, we discuss the remedy in the form of Bayesian methods, where uncertainty leads; and which allows for continuous decision making to stop or continue recruitment, as new data from a trial is accumulated. JMIR Publications 2020-08-27 /pmc/articles/PMC7484773/ /pubmed/32852275 http://dx.doi.org/10.2196/21345 Text en ©Marcus Bendtsen. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 27.08.2020. https://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (https://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work, first published in the Journal of Medical Internet Research, is properly cited. The complete bibliographic information, a link to the original publication on http://www.jmir.org/, as well as this copyright and license information must be included. |
spellingShingle | Viewpoint Bendtsen, Marcus The P Value Line Dance: When Does the Music Stop? |
title | The P Value Line Dance: When Does the Music Stop? |
title_full | The P Value Line Dance: When Does the Music Stop? |
title_fullStr | The P Value Line Dance: When Does the Music Stop? |
title_full_unstemmed | The P Value Line Dance: When Does the Music Stop? |
title_short | The P Value Line Dance: When Does the Music Stop? |
title_sort | p value line dance: when does the music stop? |
topic | Viewpoint |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7484773/ https://www.ncbi.nlm.nih.gov/pubmed/32852275 http://dx.doi.org/10.2196/21345 |
work_keys_str_mv | AT bendtsenmarcus thepvaluelinedancewhendoesthemusicstop AT bendtsenmarcus pvaluelinedancewhendoesthemusicstop |