Cargando…

False-positive and false-negative risks for individual multicentre trials in critical care

BACKGROUND: In medical research, null hypothesis significance testing (NHST) is the dominant framework for statistical inference. NHST involves calculating P-values and confidence intervals to quantify the evidence against the null hypothesis of no effect. However, P-values and confidence intervals...

Descripción completa

Detalles Bibliográficos
Autores principales:	Sidebotham, David, Barlow, C. Jake
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Elsevier 2022
Materias:	Original Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10430847/ https://www.ncbi.nlm.nih.gov/pubmed/37588693 http://dx.doi.org/10.1016/j.bjao.2022.100003

_version_	1785091060135886848
author	Sidebotham, David Barlow, C. Jake
author_facet	Sidebotham, David Barlow, C. Jake
author_sort	Sidebotham, David
collection	PubMed
description	BACKGROUND: In medical research, null hypothesis significance testing (NHST) is the dominant framework for statistical inference. NHST involves calculating P-values and confidence intervals to quantify the evidence against the null hypothesis of no effect. However, P-values and confidence intervals cannot tell us the probability that the hypothesis is true. In contrast, false-positive risk (FPR) and false-negative risk (FNR) are post-test probabilities concerning the truth of the hypothesis, that is to say, the probability a real effect exists. METHODS: We calculated the FPR or FNR for 53 individual multicentre trials in critical care based on a pretest probability of 0.5 that the hypothesis was true. RESULTS: For trials reporting statistical significance, the FPR varied between 0.1% and 57.6%. For trials reporting non-significance, the FNR varied between 1.7% and 36.9%. Twenty-six of 47 trials (55.3%) reporting non-significance provided strong or very strong evidence in favour of the null hypothesis; the remaining trials provided limited evidence. There was no obvious relationship between the P-value and the FNR. CONCLUSIONS: The FPR and FNR showed marked variability, indicating that the probability of a real or absent treatment effect differed substantially between trials. Only one trial reporting statistical significance provided convincing evidence of a real treatment effect, and nearly half of all trials reporting non-significance provided limited evidence for the absence of a treatment effect. Our findings suggest that the quality of evidence from multicentre trials in critical care is highly variable.
format	Online Article Text
id	pubmed-10430847
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	Elsevier
record_format	MEDLINE/PubMed
spelling	pubmed-104308472023-08-16 False-positive and false-negative risks for individual multicentre trials in critical care Sidebotham, David Barlow, C. Jake BJA Open Original Research Article BACKGROUND: In medical research, null hypothesis significance testing (NHST) is the dominant framework for statistical inference. NHST involves calculating P-values and confidence intervals to quantify the evidence against the null hypothesis of no effect. However, P-values and confidence intervals cannot tell us the probability that the hypothesis is true. In contrast, false-positive risk (FPR) and false-negative risk (FNR) are post-test probabilities concerning the truth of the hypothesis, that is to say, the probability a real effect exists. METHODS: We calculated the FPR or FNR for 53 individual multicentre trials in critical care based on a pretest probability of 0.5 that the hypothesis was true. RESULTS: For trials reporting statistical significance, the FPR varied between 0.1% and 57.6%. For trials reporting non-significance, the FNR varied between 1.7% and 36.9%. Twenty-six of 47 trials (55.3%) reporting non-significance provided strong or very strong evidence in favour of the null hypothesis; the remaining trials provided limited evidence. There was no obvious relationship between the P-value and the FNR. CONCLUSIONS: The FPR and FNR showed marked variability, indicating that the probability of a real or absent treatment effect differed substantially between trials. Only one trial reporting statistical significance provided convincing evidence of a real treatment effect, and nearly half of all trials reporting non-significance provided limited evidence for the absence of a treatment effect. Our findings suggest that the quality of evidence from multicentre trials in critical care is highly variable. Elsevier 2022-03-01 /pmc/articles/PMC10430847/ /pubmed/37588693 http://dx.doi.org/10.1016/j.bjao.2022.100003 Text en © 2022 The Author(s) https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/).
spellingShingle	Original Research Article Sidebotham, David Barlow, C. Jake False-positive and false-negative risks for individual multicentre trials in critical care
title	False-positive and false-negative risks for individual multicentre trials in critical care
title_full	False-positive and false-negative risks for individual multicentre trials in critical care
title_fullStr	False-positive and false-negative risks for individual multicentre trials in critical care
title_full_unstemmed	False-positive and false-negative risks for individual multicentre trials in critical care
title_short	False-positive and false-negative risks for individual multicentre trials in critical care
title_sort	false-positive and false-negative risks for individual multicentre trials in critical care
topic	Original Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10430847/ https://www.ncbi.nlm.nih.gov/pubmed/37588693 http://dx.doi.org/10.1016/j.bjao.2022.100003
work_keys_str_mv	AT sidebothamdavid falsepositiveandfalsenegativerisksforindividualmulticentretrialsincriticalcare AT barlowcjake falsepositiveandfalsenegativerisksforindividualmulticentretrialsincriticalcare

False-positive and false-negative risks for individual multicentre trials in critical care

Ejemplares similares