Cargando…

On the challenges of drawing conclusions from p-values just below 0.05

In recent years, researchers have attempted to provide an indication of the prevalence of inflated Type 1 error rates by analyzing the distribution of p-values in the published literature. De Winter & Dodou (2015) analyzed the distribution (and its change over time) of a large number of p-values...

Descripción completa

Detalles Bibliográficos
Autor principal: Lakens, Daniël
Formato: Online Artículo Texto
Lenguaje:English
Publicado: PeerJ Inc. 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4525697/
https://www.ncbi.nlm.nih.gov/pubmed/26246976
http://dx.doi.org/10.7717/peerj.1142
_version_ 1782384348353265664
author Lakens, Daniël
author_facet Lakens, Daniël
author_sort Lakens, Daniël
collection PubMed
description In recent years, researchers have attempted to provide an indication of the prevalence of inflated Type 1 error rates by analyzing the distribution of p-values in the published literature. De Winter & Dodou (2015) analyzed the distribution (and its change over time) of a large number of p-values automatically extracted from abstracts in the scientific literature. They concluded there is a ‘surge of p-values between 0.041–0.049 in recent decades’ which ‘suggests (but does not prove) questionable research practices have increased over the past 25 years.’ I show the changes in the ratio of fractions of p-values between 0.041–0.049 over the years are better explained by assuming the average power has decreased over time. Furthermore, I propose that their observation that p-values just below 0.05 increase more strongly than p-values above 0.05 can be explained by an increase in publication bias (or the file drawer effect) over the years (cf. Fanelli, 2012; Pautasso, 2010, which has led to a relative decrease of ‘marginally significant’ p-values in abstracts in the literature (instead of an increase in p-values just below 0.05). I explain why researchers analyzing large numbers of p-values need to relate their assumptions to a model of p-value distributions that takes into account the average power of the performed studies, the ratio of true positives to false positives in the literature, the effects of publication bias, and the Type 1 error rate (and possible mechanisms through which it has inflated). Finally, I discuss why publication bias and underpowered studies might be a bigger problem for science than inflated Type 1 error rates, and explain the challenges when attempting to draw conclusions about inflated Type 1 error rates from a large heterogeneous set of p-values.
format Online
Article
Text
id pubmed-4525697
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher PeerJ Inc.
record_format MEDLINE/PubMed
spelling pubmed-45256972015-08-05 On the challenges of drawing conclusions from p-values just below 0.05 Lakens, Daniël PeerJ Statistics In recent years, researchers have attempted to provide an indication of the prevalence of inflated Type 1 error rates by analyzing the distribution of p-values in the published literature. De Winter & Dodou (2015) analyzed the distribution (and its change over time) of a large number of p-values automatically extracted from abstracts in the scientific literature. They concluded there is a ‘surge of p-values between 0.041–0.049 in recent decades’ which ‘suggests (but does not prove) questionable research practices have increased over the past 25 years.’ I show the changes in the ratio of fractions of p-values between 0.041–0.049 over the years are better explained by assuming the average power has decreased over time. Furthermore, I propose that their observation that p-values just below 0.05 increase more strongly than p-values above 0.05 can be explained by an increase in publication bias (or the file drawer effect) over the years (cf. Fanelli, 2012; Pautasso, 2010, which has led to a relative decrease of ‘marginally significant’ p-values in abstracts in the literature (instead of an increase in p-values just below 0.05). I explain why researchers analyzing large numbers of p-values need to relate their assumptions to a model of p-value distributions that takes into account the average power of the performed studies, the ratio of true positives to false positives in the literature, the effects of publication bias, and the Type 1 error rate (and possible mechanisms through which it has inflated). Finally, I discuss why publication bias and underpowered studies might be a bigger problem for science than inflated Type 1 error rates, and explain the challenges when attempting to draw conclusions about inflated Type 1 error rates from a large heterogeneous set of p-values. PeerJ Inc. 2015-07-30 /pmc/articles/PMC4525697/ /pubmed/26246976 http://dx.doi.org/10.7717/peerj.1142 Text en © 2015 Lakens http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle Statistics
Lakens, Daniël
On the challenges of drawing conclusions from p-values just below 0.05
title On the challenges of drawing conclusions from p-values just below 0.05
title_full On the challenges of drawing conclusions from p-values just below 0.05
title_fullStr On the challenges of drawing conclusions from p-values just below 0.05
title_full_unstemmed On the challenges of drawing conclusions from p-values just below 0.05
title_short On the challenges of drawing conclusions from p-values just below 0.05
title_sort on the challenges of drawing conclusions from p-values just below 0.05
topic Statistics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4525697/
https://www.ncbi.nlm.nih.gov/pubmed/26246976
http://dx.doi.org/10.7717/peerj.1142
work_keys_str_mv AT lakensdaniel onthechallengesofdrawingconclusionsfrompvaluesjustbelow005