Cargando…

A logical analysis of null hypothesis significance testing using popular terminology

BACKGROUND: Null Hypothesis Significance Testing (NHST) has been well criticised over the years yet remains a pillar of statistical inference. Although NHST is well described in terms of statistical models, most textbooks for non-statisticians present the null and alternative hypotheses (H(0) and H(...

Descripción completa

Detalles Bibliográficos
Autor principal:	McNulty, Richard
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2022
Materias:	Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487069/ https://www.ncbi.nlm.nih.gov/pubmed/36123631 http://dx.doi.org/10.1186/s12874-022-01696-5

_version_	1784792414793236480
author	McNulty, Richard
author_facet	McNulty, Richard
author_sort	McNulty, Richard
collection	PubMed
description	BACKGROUND: Null Hypothesis Significance Testing (NHST) has been well criticised over the years yet remains a pillar of statistical inference. Although NHST is well described in terms of statistical models, most textbooks for non-statisticians present the null and alternative hypotheses (H(0) and H(A), respectively) in terms of differences between groups such as (μ(1) = μ(2)) and (μ(1) ≠ μ(2)) and H(A) is often stated to be the research hypothesis. Here we use propositional calculus to analyse the internal logic of NHST when couched in this popular terminology. The testable H(0) is determined by analysing the scope and limits of the P-value and the test statistic’s probability distribution curve. RESULTS: We propose a minimum axiom set NHST in which it is taken as axiomatic that H(0) is rejected if P-value< α. Using the common scenario of the comparison of the means of two sample groups as an example, the testable H(0) is {(μ(1) = μ(2)) and [([Formula: see text] (1) ≠ [Formula: see text] (2)) due to chance alone]}. The H(0) and H(A) pair should be exhaustive to avoid false dichotomies. This entails that H(A) is ¬{(μ(1) = μ(2)) and [([Formula: see text] (1) ≠ [Formula: see text] (2)) due to chance alone]}, rather than the research hypothesis (H(T)). To see the relationship between H(A) and H(T), H(A) can be rewritten as the disjunction H(A): ({(μ(1) = μ(2)) ∧ [([Formula: see text] (1) ≠ [Formula: see text] (2)) not due to chance alone]} ∨ {(μ(1) ≠ μ(2)) ∧ [[Formula: see text] (1) ≠ [Formula: see text] (2)) not due to (μ(1) ≠ μ(2)) alone]} ∨ {(μ(1) ≠ μ(2)) ∧ [([Formula: see text] (1) ≠ [Formula: see text] (2)) due to (μ(1) ≠ μ(2)) alone]}). This reveals that H(T) (the last disjunct in bold) is just one possibility within H(A). It is only by adding premises to NHST that H(T) or other conclusions can be reached. CONCLUSIONS: Using this popular terminology for NHST, analysis shows that the definitions of H(0) and H(A) differ from those found in textbooks. In this framework, achieving a statistically significant result only justifies the broad conclusion that the results are not due to chance alone, not that the research hypothesis is true. More transparency is needed concerning the premises added to NHST to rig particular conclusions such as H(T). There are also ramifications for the interpretation of Type I and II errors, as well as power, which do not specifically refer to H(T) as claimed by texts.
format	Online Article Text
id	pubmed-9487069
institution	National Center for Biotechnology Information
language	English
publishDate	2022
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-94870692022-09-21 A logical analysis of null hypothesis significance testing using popular terminology McNulty, Richard BMC Med Res Methodol Research BACKGROUND: Null Hypothesis Significance Testing (NHST) has been well criticised over the years yet remains a pillar of statistical inference. Although NHST is well described in terms of statistical models, most textbooks for non-statisticians present the null and alternative hypotheses (H(0) and H(A), respectively) in terms of differences between groups such as (μ(1) = μ(2)) and (μ(1) ≠ μ(2)) and H(A) is often stated to be the research hypothesis. Here we use propositional calculus to analyse the internal logic of NHST when couched in this popular terminology. The testable H(0) is determined by analysing the scope and limits of the P-value and the test statistic’s probability distribution curve. RESULTS: We propose a minimum axiom set NHST in which it is taken as axiomatic that H(0) is rejected if P-value< α. Using the common scenario of the comparison of the means of two sample groups as an example, the testable H(0) is {(μ(1) = μ(2)) and [([Formula: see text] (1) ≠ [Formula: see text] (2)) due to chance alone]}. The H(0) and H(A) pair should be exhaustive to avoid false dichotomies. This entails that H(A) is ¬{(μ(1) = μ(2)) and [([Formula: see text] (1) ≠ [Formula: see text] (2)) due to chance alone]}, rather than the research hypothesis (H(T)). To see the relationship between H(A) and H(T), H(A) can be rewritten as the disjunction H(A): ({(μ(1) = μ(2)) ∧ [([Formula: see text] (1) ≠ [Formula: see text] (2)) not due to chance alone]} ∨ {(μ(1) ≠ μ(2)) ∧ [[Formula: see text] (1) ≠ [Formula: see text] (2)) not due to (μ(1) ≠ μ(2)) alone]} ∨ {(μ(1) ≠ μ(2)) ∧ [([Formula: see text] (1) ≠ [Formula: see text] (2)) due to (μ(1) ≠ μ(2)) alone]}). This reveals that H(T) (the last disjunct in bold) is just one possibility within H(A). It is only by adding premises to NHST that H(T) or other conclusions can be reached. CONCLUSIONS: Using this popular terminology for NHST, analysis shows that the definitions of H(0) and H(A) differ from those found in textbooks. In this framework, achieving a statistically significant result only justifies the broad conclusion that the results are not due to chance alone, not that the research hypothesis is true. More transparency is needed concerning the premises added to NHST to rig particular conclusions such as H(T). There are also ramifications for the interpretation of Type I and II errors, as well as power, which do not specifically refer to H(T) as claimed by texts. BioMed Central 2022-09-19 /pmc/articles/PMC9487069/ /pubmed/36123631 http://dx.doi.org/10.1186/s12874-022-01696-5 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle	Research McNulty, Richard A logical analysis of null hypothesis significance testing using popular terminology
title	A logical analysis of null hypothesis significance testing using popular terminology
title_full	A logical analysis of null hypothesis significance testing using popular terminology
title_fullStr	A logical analysis of null hypothesis significance testing using popular terminology
title_full_unstemmed	A logical analysis of null hypothesis significance testing using popular terminology
title_short	A logical analysis of null hypothesis significance testing using popular terminology
title_sort	logical analysis of null hypothesis significance testing using popular terminology
topic	Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9487069/ https://www.ncbi.nlm.nih.gov/pubmed/36123631 http://dx.doi.org/10.1186/s12874-022-01696-5
work_keys_str_mv	AT mcnultyrichard alogicalanalysisofnullhypothesissignificancetestingusingpopularterminology AT mcnultyrichard logicalanalysisofnullhypothesissignificancetestingusingpopularterminology

A logical analysis of null hypothesis significance testing using popular terminology

Ejemplares similares