Cargando…

Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials

BACKGROUND: Misinterpretations of the p-value in null-hypothesis statistical testing are common. We aimed to determine the implications of observed p-values in critical care randomized controlled trials (RCTs). METHODS: We included three cohorts of published RCTs: Adult-RCTs reporting a mortality ou...

Descripción completa

Detalles Bibliográficos
Autores principales: Nostedt, Sarah, Joffe, Ari R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: SAGE Publications 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9149268/
https://www.ncbi.nlm.nih.gov/pubmed/34841950
http://dx.doi.org/10.1177/08850666211053793
_version_ 1784717172954628096
author Nostedt, Sarah
Joffe, Ari R.
author_facet Nostedt, Sarah
Joffe, Ari R.
author_sort Nostedt, Sarah
collection PubMed
description BACKGROUND: Misinterpretations of the p-value in null-hypothesis statistical testing are common. We aimed to determine the implications of observed p-values in critical care randomized controlled trials (RCTs). METHODS: We included three cohorts of published RCTs: Adult-RCTs reporting a mortality outcome, Pediatric-RCTs reporting a mortality outcome, and recent Consecutive-RCTs reporting p-value ≤.10 in six higher-impact journals. We recorded descriptive information from RCTs. Reverse Bayesian implications of obtained p-values were calculated, reported as percentages with inter-quartile ranges. RESULTS: Obtained p-value was ≤.005 in 11/216 (5.1%) Adult-RCTs, 2/120 (1.7%) Pediatric-RCTs, and 37/90 (41.1%) Consecutive-RCTs. An obtained p-value .05–.0051 had high False Positive Rates; in Adult-RCTs, minimum (assuming prior probability of the alternative hypothesis was 50%) and realistic (assuming prior probability of the alternative hypothesis was 10%) False Positive Rates were 16.7% [11.2, 21.8] and 64.3% [53.2, 71.4]. An obtained p-value ≤.005 had lower False Positive Rates; in Adult-RCTs the realistic False Positive Rate was 7.7% [7.7, 16.0]. The realistic probability of the alternative hypothesis for obtained p-value .05–.0051 (ie, Positive Predictive Value) was 28.0% [24.1, 34.8], 30.6% [27.7, 48.5], 29.3% [24.3, 41.0], and 32.7% [24.1, 43.5] for Adult-RCTs, Pediatric-RCTs, Consecutive-RCTs primary and secondary outcome, respectively. The maximum Positive Predictive Value for p-value category .05–.0051 was median 77.8%, 79.8%, 78.8%, and 81.4% respectively. To have maximum or realistic Positive Predictive Value >90% or >80%, RCTs needed to have obtained p-value ≤.005. The credibility of p-value .05–.0051 findings were easy to challenge, and the credibility to rule-out an effect with p-value >.05 to .10 was low. The probability that a replication study would obtain p-value ≤.05 did not approach 90% unless the obtained p-value was ≤.005. CONCLUSIONS: Unless the obtained p-value was ≤.005, the False Positive Rate was high, and the Positive Predictive Value and probability of replication of “statistically significant” findings were low.
format Online
Article
Text
id pubmed-9149268
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher SAGE Publications
record_format MEDLINE/PubMed
spelling pubmed-91492682022-05-31 Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials Nostedt, Sarah Joffe, Ari R. J Intensive Care Med Original Research BACKGROUND: Misinterpretations of the p-value in null-hypothesis statistical testing are common. We aimed to determine the implications of observed p-values in critical care randomized controlled trials (RCTs). METHODS: We included three cohorts of published RCTs: Adult-RCTs reporting a mortality outcome, Pediatric-RCTs reporting a mortality outcome, and recent Consecutive-RCTs reporting p-value ≤.10 in six higher-impact journals. We recorded descriptive information from RCTs. Reverse Bayesian implications of obtained p-values were calculated, reported as percentages with inter-quartile ranges. RESULTS: Obtained p-value was ≤.005 in 11/216 (5.1%) Adult-RCTs, 2/120 (1.7%) Pediatric-RCTs, and 37/90 (41.1%) Consecutive-RCTs. An obtained p-value .05–.0051 had high False Positive Rates; in Adult-RCTs, minimum (assuming prior probability of the alternative hypothesis was 50%) and realistic (assuming prior probability of the alternative hypothesis was 10%) False Positive Rates were 16.7% [11.2, 21.8] and 64.3% [53.2, 71.4]. An obtained p-value ≤.005 had lower False Positive Rates; in Adult-RCTs the realistic False Positive Rate was 7.7% [7.7, 16.0]. The realistic probability of the alternative hypothesis for obtained p-value .05–.0051 (ie, Positive Predictive Value) was 28.0% [24.1, 34.8], 30.6% [27.7, 48.5], 29.3% [24.3, 41.0], and 32.7% [24.1, 43.5] for Adult-RCTs, Pediatric-RCTs, Consecutive-RCTs primary and secondary outcome, respectively. The maximum Positive Predictive Value for p-value category .05–.0051 was median 77.8%, 79.8%, 78.8%, and 81.4% respectively. To have maximum or realistic Positive Predictive Value >90% or >80%, RCTs needed to have obtained p-value ≤.005. The credibility of p-value .05–.0051 findings were easy to challenge, and the credibility to rule-out an effect with p-value >.05 to .10 was low. The probability that a replication study would obtain p-value ≤.05 did not approach 90% unless the obtained p-value was ≤.005. CONCLUSIONS: Unless the obtained p-value was ≤.005, the False Positive Rate was high, and the Positive Predictive Value and probability of replication of “statistically significant” findings were low. SAGE Publications 2021-11-29 2022-07 /pmc/articles/PMC9149268/ /pubmed/34841950 http://dx.doi.org/10.1177/08850666211053793 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by-nc/4.0/This article is distributed under the terms of the Creative Commons Attribution-NonCommercial 4.0 License (https://creativecommons.org/licenses/by-nc/4.0/) which permits non-commercial use, reproduction and distribution of the work without further permission provided the original work is attributed as specified on the SAGE and Open Access page (https://us.sagepub.com/en-us/nam/open-access-at-sage).
spellingShingle Original Research
Nostedt, Sarah
Joffe, Ari R.
Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials
title Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials
title_full Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials
title_fullStr Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials
title_full_unstemmed Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials
title_short Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials
title_sort reverse bayesian implications of p-values reported in critical care randomized trials
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9149268/
https://www.ncbi.nlm.nih.gov/pubmed/34841950
http://dx.doi.org/10.1177/08850666211053793
work_keys_str_mv AT nostedtsarah reversebayesianimplicationsofpvaluesreportedincriticalcarerandomizedtrials
AT joffearir reversebayesianimplicationsofpvaluesreportedincriticalcarerandomizedtrials