Cargando…

A simple model suggesting economically rational sample-size choice drives irreproducibility

Several systematic studies have suggested that a large fraction of published research is not reproducible. One probable reason for low reproducibility is insufficient sample size, resulting in low power and low positive predictive value. It has been suggested that insufficient sample-size choice is...

Descripción completa

Detalles Bibliográficos
Autor principal: Braganza, Oliver
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7065751/
https://www.ncbi.nlm.nih.gov/pubmed/32160229
http://dx.doi.org/10.1371/journal.pone.0229615
_version_ 1783505112028676096
author Braganza, Oliver
author_facet Braganza, Oliver
author_sort Braganza, Oliver
collection PubMed
description Several systematic studies have suggested that a large fraction of published research is not reproducible. One probable reason for low reproducibility is insufficient sample size, resulting in low power and low positive predictive value. It has been suggested that insufficient sample-size choice is driven by a combination of scientific competition and ‘positive publication bias’. Here we formalize this intuition in a simple model, in which scientists choose economically rational sample sizes, balancing the cost of experimentation with income from publication. Specifically, assuming that a scientist’s income derives only from ‘positive’ findings (positive publication bias) and that individual samples cost a fixed amount, allows to leverage basic statistical formulas into an economic optimality prediction. We find that if effects have i) low base probability, ii) small effect size or iii) low grant income per publication, then the rational (economically optimal) sample size is small. Furthermore, for plausible distributions of these parameters we find a robust emergence of a bimodal distribution of obtained statistical power and low overall reproducibility rates, both matching empirical findings. Finally, we explore conditional equivalence testing as a means to align economic incentives with adequate sample sizes. Overall, the model describes a simple mechanism explaining both the prevalence and the persistence of small sample sizes, and is well suited for empirical validation. It proposes economic rationality, or economic pressures, as a principal driver of irreproducibility and suggests strategies to change this.
format Online
Article
Text
id pubmed-7065751
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70657512020-03-23 A simple model suggesting economically rational sample-size choice drives irreproducibility Braganza, Oliver PLoS One Research Article Several systematic studies have suggested that a large fraction of published research is not reproducible. One probable reason for low reproducibility is insufficient sample size, resulting in low power and low positive predictive value. It has been suggested that insufficient sample-size choice is driven by a combination of scientific competition and ‘positive publication bias’. Here we formalize this intuition in a simple model, in which scientists choose economically rational sample sizes, balancing the cost of experimentation with income from publication. Specifically, assuming that a scientist’s income derives only from ‘positive’ findings (positive publication bias) and that individual samples cost a fixed amount, allows to leverage basic statistical formulas into an economic optimality prediction. We find that if effects have i) low base probability, ii) small effect size or iii) low grant income per publication, then the rational (economically optimal) sample size is small. Furthermore, for plausible distributions of these parameters we find a robust emergence of a bimodal distribution of obtained statistical power and low overall reproducibility rates, both matching empirical findings. Finally, we explore conditional equivalence testing as a means to align economic incentives with adequate sample sizes. Overall, the model describes a simple mechanism explaining both the prevalence and the persistence of small sample sizes, and is well suited for empirical validation. It proposes economic rationality, or economic pressures, as a principal driver of irreproducibility and suggests strategies to change this. Public Library of Science 2020-03-11 /pmc/articles/PMC7065751/ /pubmed/32160229 http://dx.doi.org/10.1371/journal.pone.0229615 Text en © 2020 Oliver Braganza http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Braganza, Oliver
A simple model suggesting economically rational sample-size choice drives irreproducibility
title A simple model suggesting economically rational sample-size choice drives irreproducibility
title_full A simple model suggesting economically rational sample-size choice drives irreproducibility
title_fullStr A simple model suggesting economically rational sample-size choice drives irreproducibility
title_full_unstemmed A simple model suggesting economically rational sample-size choice drives irreproducibility
title_short A simple model suggesting economically rational sample-size choice drives irreproducibility
title_sort simple model suggesting economically rational sample-size choice drives irreproducibility
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7065751/
https://www.ncbi.nlm.nih.gov/pubmed/32160229
http://dx.doi.org/10.1371/journal.pone.0229615
work_keys_str_mv AT braganzaoliver asimplemodelsuggestingeconomicallyrationalsamplesizechoicedrivesirreproducibility
AT braganzaoliver simplemodelsuggestingeconomicallyrationalsamplesizechoicedrivesirreproducibility