Cargando…

Elucidating the Foundations of Statistical Inference with 2 x 2 Tables

To many, the foundations of statistical inference are cryptic and irrelevant to routine statistical practice. The analysis of 2 x 2 contingency tables, omnipresent in the scientific literature, is a case in point. Fisher's exact test is routinely used even though it has been fraught with contro...

Descripción completa

Detalles Bibliográficos
Autores principales: Choi, Leena, Blume, Jeffrey D., Dupont, William D.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4388855/
https://www.ncbi.nlm.nih.gov/pubmed/25849515
http://dx.doi.org/10.1371/journal.pone.0121263
_version_ 1782365448018329600
author Choi, Leena
Blume, Jeffrey D.
Dupont, William D.
author_facet Choi, Leena
Blume, Jeffrey D.
Dupont, William D.
author_sort Choi, Leena
collection PubMed
description To many, the foundations of statistical inference are cryptic and irrelevant to routine statistical practice. The analysis of 2 x 2 contingency tables, omnipresent in the scientific literature, is a case in point. Fisher's exact test is routinely used even though it has been fraught with controversy for over 70 years. The problem, not widely acknowledged, is that several different p-values can be associated with a single table, making scientific inference inconsistent. The root cause of this controversy lies in the table's origins and the manner in which nuisance parameters are eliminated. However, fundamental statistical principles (e.g., sufficiency, ancillarity, conditionality, and likelihood) can shed light on the controversy and guide our approach in using this test. In this paper, we use these fundamental principles to show how much information is lost when the tables origins are ignored and when various approaches are used to eliminate unknown nuisance parameters. We present novel likelihood contours to aid in the visualization of information loss and show that the information loss is often virtually non-existent. We find that problems arising from the discreteness of the sample space are exacerbated by p-value-based inference. Accordingly, methods that are less sensitive to this discreteness - likelihood ratios, posterior probabilities and mid-p-values - lead to more consistent inferences.
format Online
Article
Text
id pubmed-4388855
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-43888552015-04-21 Elucidating the Foundations of Statistical Inference with 2 x 2 Tables Choi, Leena Blume, Jeffrey D. Dupont, William D. PLoS One Research Article To many, the foundations of statistical inference are cryptic and irrelevant to routine statistical practice. The analysis of 2 x 2 contingency tables, omnipresent in the scientific literature, is a case in point. Fisher's exact test is routinely used even though it has been fraught with controversy for over 70 years. The problem, not widely acknowledged, is that several different p-values can be associated with a single table, making scientific inference inconsistent. The root cause of this controversy lies in the table's origins and the manner in which nuisance parameters are eliminated. However, fundamental statistical principles (e.g., sufficiency, ancillarity, conditionality, and likelihood) can shed light on the controversy and guide our approach in using this test. In this paper, we use these fundamental principles to show how much information is lost when the tables origins are ignored and when various approaches are used to eliminate unknown nuisance parameters. We present novel likelihood contours to aid in the visualization of information loss and show that the information loss is often virtually non-existent. We find that problems arising from the discreteness of the sample space are exacerbated by p-value-based inference. Accordingly, methods that are less sensitive to this discreteness - likelihood ratios, posterior probabilities and mid-p-values - lead to more consistent inferences. Public Library of Science 2015-04-07 /pmc/articles/PMC4388855/ /pubmed/25849515 http://dx.doi.org/10.1371/journal.pone.0121263 Text en © 2015 Choi et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Choi, Leena
Blume, Jeffrey D.
Dupont, William D.
Elucidating the Foundations of Statistical Inference with 2 x 2 Tables
title Elucidating the Foundations of Statistical Inference with 2 x 2 Tables
title_full Elucidating the Foundations of Statistical Inference with 2 x 2 Tables
title_fullStr Elucidating the Foundations of Statistical Inference with 2 x 2 Tables
title_full_unstemmed Elucidating the Foundations of Statistical Inference with 2 x 2 Tables
title_short Elucidating the Foundations of Statistical Inference with 2 x 2 Tables
title_sort elucidating the foundations of statistical inference with 2 x 2 tables
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4388855/
https://www.ncbi.nlm.nih.gov/pubmed/25849515
http://dx.doi.org/10.1371/journal.pone.0121263
work_keys_str_mv AT choileena elucidatingthefoundationsofstatisticalinferencewith2x2tables
AT blumejeffreyd elucidatingthefoundationsofstatisticalinferencewith2x2tables
AT dupontwilliamd elucidatingthefoundationsofstatisticalinferencewith2x2tables