Cargando…

False-positive and false-negative risks for individual multicentre trials in critical care

BACKGROUND: In medical research, null hypothesis significance testing (NHST) is the dominant framework for statistical inference. NHST involves calculating P-values and confidence intervals to quantify the evidence against the null hypothesis of no effect. However, P-values and confidence intervals...

Descripción completa

Detalles Bibliográficos
Autores principales: Sidebotham, David, Barlow, C. Jake
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Elsevier 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10430847/
https://www.ncbi.nlm.nih.gov/pubmed/37588693
http://dx.doi.org/10.1016/j.bjao.2022.100003
_version_ 1785091060135886848
author Sidebotham, David
Barlow, C. Jake
author_facet Sidebotham, David
Barlow, C. Jake
author_sort Sidebotham, David
collection PubMed
description BACKGROUND: In medical research, null hypothesis significance testing (NHST) is the dominant framework for statistical inference. NHST involves calculating P-values and confidence intervals to quantify the evidence against the null hypothesis of no effect. However, P-values and confidence intervals cannot tell us the probability that the hypothesis is true. In contrast, false-positive risk (FPR) and false-negative risk (FNR) are post-test probabilities concerning the truth of the hypothesis, that is to say, the probability a real effect exists. METHODS: We calculated the FPR or FNR for 53 individual multicentre trials in critical care based on a pretest probability of 0.5 that the hypothesis was true. RESULTS: For trials reporting statistical significance, the FPR varied between 0.1% and 57.6%. For trials reporting non-significance, the FNR varied between 1.7% and 36.9%. Twenty-six of 47 trials (55.3%) reporting non-significance provided strong or very strong evidence in favour of the null hypothesis; the remaining trials provided limited evidence. There was no obvious relationship between the P-value and the FNR. CONCLUSIONS: The FPR and FNR showed marked variability, indicating that the probability of a real or absent treatment effect differed substantially between trials. Only one trial reporting statistical significance provided convincing evidence of a real treatment effect, and nearly half of all trials reporting non-significance provided limited evidence for the absence of a treatment effect. Our findings suggest that the quality of evidence from multicentre trials in critical care is highly variable.
format Online
Article
Text
id pubmed-10430847
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Elsevier
record_format MEDLINE/PubMed
spelling pubmed-104308472023-08-16 False-positive and false-negative risks for individual multicentre trials in critical care Sidebotham, David Barlow, C. Jake BJA Open Original Research Article BACKGROUND: In medical research, null hypothesis significance testing (NHST) is the dominant framework for statistical inference. NHST involves calculating P-values and confidence intervals to quantify the evidence against the null hypothesis of no effect. However, P-values and confidence intervals cannot tell us the probability that the hypothesis is true. In contrast, false-positive risk (FPR) and false-negative risk (FNR) are post-test probabilities concerning the truth of the hypothesis, that is to say, the probability a real effect exists. METHODS: We calculated the FPR or FNR for 53 individual multicentre trials in critical care based on a pretest probability of 0.5 that the hypothesis was true. RESULTS: For trials reporting statistical significance, the FPR varied between 0.1% and 57.6%. For trials reporting non-significance, the FNR varied between 1.7% and 36.9%. Twenty-six of 47 trials (55.3%) reporting non-significance provided strong or very strong evidence in favour of the null hypothesis; the remaining trials provided limited evidence. There was no obvious relationship between the P-value and the FNR. CONCLUSIONS: The FPR and FNR showed marked variability, indicating that the probability of a real or absent treatment effect differed substantially between trials. Only one trial reporting statistical significance provided convincing evidence of a real treatment effect, and nearly half of all trials reporting non-significance provided limited evidence for the absence of a treatment effect. Our findings suggest that the quality of evidence from multicentre trials in critical care is highly variable. Elsevier 2022-03-01 /pmc/articles/PMC10430847/ /pubmed/37588693 http://dx.doi.org/10.1016/j.bjao.2022.100003 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle Original Research Article
Sidebotham, David
Barlow, C. Jake
False-positive and false-negative risks for individual multicentre trials in critical care
title False-positive and false-negative risks for individual multicentre trials in critical care
title_full False-positive and false-negative risks for individual multicentre trials in critical care
title_fullStr False-positive and false-negative risks for individual multicentre trials in critical care
title_full_unstemmed False-positive and false-negative risks for individual multicentre trials in critical care
title_short False-positive and false-negative risks for individual multicentre trials in critical care
title_sort false-positive and false-negative risks for individual multicentre trials in critical care
topic Original Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10430847/
https://www.ncbi.nlm.nih.gov/pubmed/37588693
http://dx.doi.org/10.1016/j.bjao.2022.100003
work_keys_str_mv AT sidebothamdavid falsepositiveandfalsenegativerisksforindividualmulticentretrialsincriticalcare
AT barlowcjake falsepositiveandfalsenegativerisksforindividualmulticentretrialsincriticalcare