Cargando…

Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value

Background. The p-curve is a plot of the distribution of p-values reported in a set of scientific studies. Comparisons between ranges of p-values have been used to evaluate fields of research in terms of the extent to which studies have genuine evidential value, and the extent to which they suffer f...

Descripción completa

Detalles Bibliográficos
Autores principales:	Bishop, Dorothy V.M., Thompson, Paul A.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	PeerJ Inc. 2016
Materias:	Science Policy
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4768688/ https://www.ncbi.nlm.nih.gov/pubmed/26925335 http://dx.doi.org/10.7717/peerj.1715

_version_	1782417989511938048
author	Bishop, Dorothy V.M. Thompson, Paul A.
author_facet	Bishop, Dorothy V.M. Thompson, Paul A.
author_sort	Bishop, Dorothy V.M.
collection	PubMed
description	Background. The p-curve is a plot of the distribution of p-values reported in a set of scientific studies. Comparisons between ranges of p-values have been used to evaluate fields of research in terms of the extent to which studies have genuine evidential value, and the extent to which they suffer from bias in the selection of variables and analyses for publication, p-hacking. Methods. p-hacking can take various forms. Here we used R code to simulate the use of ghost variables, where an experimenter gathers data on several dependent variables but reports only those with statistically significant effects. We also examined a text-mined dataset used by Head et al. (2015) and assessed its suitability for investigating p-hacking. Results. We show that when there is ghost p-hacking, the shape of the p-curve depends on whether dependent variables are intercorrelated. For uncorrelated variables, simulated p-hacked data do not give the “p-hacking bump” just below .05 that is regarded as evidence of p-hacking, though there is a negative skew when simulated variables are inter-correlated. The way p-curves vary according to features of underlying data poses problems when automated text mining is used to detect p-values in heterogeneous sets of published papers. Conclusions. The absence of a bump in the p-curve is not indicative of lack of p-hacking. Furthermore, while studies with evidential value will usually generate a right-skewed p-curve, we cannot treat a right-skewed p-curve as an indicator of the extent of evidential value, unless we have a model specific to the type of p-values entered into the analysis. We conclude that it is not feasible to use the p-curve to estimate the extent of p-hacking and evidential value unless there is considerable control over the type of data entered into the analysis. In particular, p-hacking with ghost variables is likely to be missed.
format	Online Article Text
id	pubmed-4768688
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	PeerJ Inc.
record_format	MEDLINE/PubMed
spelling	pubmed-47686882016-02-26 Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value Bishop, Dorothy V.M. Thompson, Paul A. PeerJ Science Policy Background. The p-curve is a plot of the distribution of p-values reported in a set of scientific studies. Comparisons between ranges of p-values have been used to evaluate fields of research in terms of the extent to which studies have genuine evidential value, and the extent to which they suffer from bias in the selection of variables and analyses for publication, p-hacking. Methods. p-hacking can take various forms. Here we used R code to simulate the use of ghost variables, where an experimenter gathers data on several dependent variables but reports only those with statistically significant effects. We also examined a text-mined dataset used by Head et al. (2015) and assessed its suitability for investigating p-hacking. Results. We show that when there is ghost p-hacking, the shape of the p-curve depends on whether dependent variables are intercorrelated. For uncorrelated variables, simulated p-hacked data do not give the “p-hacking bump” just below .05 that is regarded as evidence of p-hacking, though there is a negative skew when simulated variables are inter-correlated. The way p-curves vary according to features of underlying data poses problems when automated text mining is used to detect p-values in heterogeneous sets of published papers. Conclusions. The absence of a bump in the p-curve is not indicative of lack of p-hacking. Furthermore, while studies with evidential value will usually generate a right-skewed p-curve, we cannot treat a right-skewed p-curve as an indicator of the extent of evidential value, unless we have a model specific to the type of p-values entered into the analysis. We conclude that it is not feasible to use the p-curve to estimate the extent of p-hacking and evidential value unless there is considerable control over the type of data entered into the analysis. In particular, p-hacking with ghost variables is likely to be missed. PeerJ Inc. 2016-02-18 /pmc/articles/PMC4768688/ /pubmed/26925335 http://dx.doi.org/10.7717/peerj.1715 Text en ©2016 Bishop and Thompson http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, reproduction and adaptation in any medium and for any purpose provided that it is properly attributed. For attribution, the original author(s), title, publication source (PeerJ) and either DOI or URL of the article must be cited.
spellingShingle	Science Policy Bishop, Dorothy V.M. Thompson, Paul A. Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value
title	Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value
title_full	Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value
title_fullStr	Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value
title_full_unstemmed	Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value
title_short	Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value
title_sort	problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value
topic	Science Policy
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4768688/ https://www.ncbi.nlm.nih.gov/pubmed/26925335 http://dx.doi.org/10.7717/peerj.1715
work_keys_str_mv	AT bishopdorothyvm problemsinusingpcurveanalysisandtextminingtodetectrateofphackingandevidentialvalue AT thompsonpaula problemsinusingpcurveanalysisandtextminingtodetectrateofphackingandevidentialvalue

Problems in using p-curve analysis and text-mining to detect rate of p-hacking and evidential value

Ejemplares similares