Cargando…

A Tutorial on Hunting Statistical Significance by Chasing N

There is increasing concern about the replicability of studies in psychology and cognitive neuroscience. Hidden data dredging (also called p-hacking) is a major contributor to this crisis because it substantially increases Type I error resulting in a much larger proportion of false positive findings...

Descripción completa

Detalles Bibliográficos
Autor principal:	Szucs, Denes
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Frontiers Media S.A. 2016
Materias:	Psychology
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5031612/ https://www.ncbi.nlm.nih.gov/pubmed/27713723 http://dx.doi.org/10.3389/fpsyg.2016.01444

_version_	1782454833093017600
author	Szucs, Denes
author_facet	Szucs, Denes
author_sort	Szucs, Denes
collection	PubMed
description	There is increasing concern about the replicability of studies in psychology and cognitive neuroscience. Hidden data dredging (also called p-hacking) is a major contributor to this crisis because it substantially increases Type I error resulting in a much larger proportion of false positive findings than the usually expected 5%. In order to build better intuition to avoid, detect and criticize some typical problems, here I systematically illustrate the large impact of some easy to implement and so, perhaps frequent data dredging techniques on boosting false positive findings. I illustrate several forms of two special cases of data dredging. First, researchers may violate the data collection stopping rules of null hypothesis significance testing by repeatedly checking for statistical significance with various numbers of participants. Second, researchers may group participants post hoc along potential but unplanned independent grouping variables. The first approach ‘hacks’ the number of participants in studies, the second approach ‘hacks’ the number of variables in the analysis. I demonstrate the high amount of false positive findings generated by these techniques with data from true null distributions. I also illustrate that it is extremely easy to introduce strong bias into data by very mild selection and re-testing. Similar, usually undocumented data dredging steps can easily lead to having 20–50%, or more false positives.
format	Online Article Text
id	pubmed-5031612
institution	National Center for Biotechnology Information
language	English
publishDate	2016
publisher	Frontiers Media S.A.
record_format	MEDLINE/PubMed
spelling	pubmed-50316122016-10-06 A Tutorial on Hunting Statistical Significance by Chasing N Szucs, Denes Front Psychol Psychology There is increasing concern about the replicability of studies in psychology and cognitive neuroscience. Hidden data dredging (also called p-hacking) is a major contributor to this crisis because it substantially increases Type I error resulting in a much larger proportion of false positive findings than the usually expected 5%. In order to build better intuition to avoid, detect and criticize some typical problems, here I systematically illustrate the large impact of some easy to implement and so, perhaps frequent data dredging techniques on boosting false positive findings. I illustrate several forms of two special cases of data dredging. First, researchers may violate the data collection stopping rules of null hypothesis significance testing by repeatedly checking for statistical significance with various numbers of participants. Second, researchers may group participants post hoc along potential but unplanned independent grouping variables. The first approach ‘hacks’ the number of participants in studies, the second approach ‘hacks’ the number of variables in the analysis. I demonstrate the high amount of false positive findings generated by these techniques with data from true null distributions. I also illustrate that it is extremely easy to introduce strong bias into data by very mild selection and re-testing. Similar, usually undocumented data dredging steps can easily lead to having 20–50%, or more false positives. Frontiers Media S.A. 2016-09-22 /pmc/articles/PMC5031612/ /pubmed/27713723 http://dx.doi.org/10.3389/fpsyg.2016.01444 Text en Copyright © 2016 Szucs. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle	Psychology Szucs, Denes A Tutorial on Hunting Statistical Significance by Chasing N
title	A Tutorial on Hunting Statistical Significance by Chasing N
title_full	A Tutorial on Hunting Statistical Significance by Chasing N
title_fullStr	A Tutorial on Hunting Statistical Significance by Chasing N
title_full_unstemmed	A Tutorial on Hunting Statistical Significance by Chasing N
title_short	A Tutorial on Hunting Statistical Significance by Chasing N
title_sort	tutorial on hunting statistical significance by chasing n
topic	Psychology
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5031612/ https://www.ncbi.nlm.nih.gov/pubmed/27713723 http://dx.doi.org/10.3389/fpsyg.2016.01444
work_keys_str_mv	AT szucsdenes atutorialonhuntingstatisticalsignificancebychasingn AT szucsdenes tutorialonhuntingstatisticalsignificancebychasingn

A Tutorial on Hunting Statistical Significance by Chasing N

Ejemplares similares