Cargando…
A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results
A typical rule that has been used for the endorsement of new medications by the Food and Drug Administration is to have two trials, each convincing on its own, demonstrating effectiveness. “Convincing” may be subjectively interpreted, but the use of p-values and the focus on statistical significance...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5342224/ https://www.ncbi.nlm.nih.gov/pubmed/28273140 http://dx.doi.org/10.1371/journal.pone.0173184 |
_version_ | 1782513132347850752 |
---|---|
author | van Ravenzwaaij, Don Ioannidis, John P. A. |
author_facet | van Ravenzwaaij, Don Ioannidis, John P. A. |
author_sort | van Ravenzwaaij, Don |
collection | PubMed |
description | A typical rule that has been used for the endorsement of new medications by the Food and Drug Administration is to have two trials, each convincing on its own, demonstrating effectiveness. “Convincing” may be subjectively interpreted, but the use of p-values and the focus on statistical significance (in particular with p < .05 being coined significant) is pervasive in clinical research. Therefore, in this paper, we calculate with simulations what it means to have exactly two trials, each with p < .05, in terms of the actual strength of evidence quantified by Bayes factors. Our results show that different cases where two trials have a p-value below .05 have wildly differing Bayes factors. Bayes factors of at least 20 in favor of the alternative hypothesis are not necessarily achieved and they fail to be reached in a large proportion of cases, in particular when the true effect size is small (0.2 standard deviations) or zero. In a non-trivial number of cases, evidence actually points to the null hypothesis, in particular when the true effect size is zero, when the number of trials is large, and when the number of participants in both groups is low. We recommend use of Bayes factors as a routine tool to assess endorsement of new medications, because Bayes factors consistently quantify strength of evidence. Use of p-values may lead to paradoxical and spurious decision-making regarding the use of new medications. |
format | Online Article Text |
id | pubmed-5342224 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-53422242017-03-29 A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results van Ravenzwaaij, Don Ioannidis, John P. A. PLoS One Research Article A typical rule that has been used for the endorsement of new medications by the Food and Drug Administration is to have two trials, each convincing on its own, demonstrating effectiveness. “Convincing” may be subjectively interpreted, but the use of p-values and the focus on statistical significance (in particular with p < .05 being coined significant) is pervasive in clinical research. Therefore, in this paper, we calculate with simulations what it means to have exactly two trials, each with p < .05, in terms of the actual strength of evidence quantified by Bayes factors. Our results show that different cases where two trials have a p-value below .05 have wildly differing Bayes factors. Bayes factors of at least 20 in favor of the alternative hypothesis are not necessarily achieved and they fail to be reached in a large proportion of cases, in particular when the true effect size is small (0.2 standard deviations) or zero. In a non-trivial number of cases, evidence actually points to the null hypothesis, in particular when the true effect size is zero, when the number of trials is large, and when the number of participants in both groups is low. We recommend use of Bayes factors as a routine tool to assess endorsement of new medications, because Bayes factors consistently quantify strength of evidence. Use of p-values may lead to paradoxical and spurious decision-making regarding the use of new medications. Public Library of Science 2017-03-08 /pmc/articles/PMC5342224/ /pubmed/28273140 http://dx.doi.org/10.1371/journal.pone.0173184 Text en https://creativecommons.org/publicdomain/zero/1.0/ This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication. |
spellingShingle | Research Article van Ravenzwaaij, Don Ioannidis, John P. A. A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results |
title | A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results |
title_full | A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results |
title_fullStr | A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results |
title_full_unstemmed | A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results |
title_short | A simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results |
title_sort | simulation study of the strength of evidence in the recommendation of medications based on two trials with statistically significant results |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5342224/ https://www.ncbi.nlm.nih.gov/pubmed/28273140 http://dx.doi.org/10.1371/journal.pone.0173184 |
work_keys_str_mv | AT vanravenzwaaijdon asimulationstudyofthestrengthofevidenceintherecommendationofmedicationsbasedontwotrialswithstatisticallysignificantresults AT ioannidisjohnpa asimulationstudyofthestrengthofevidenceintherecommendationofmedicationsbasedontwotrialswithstatisticallysignificantresults AT vanravenzwaaijdon simulationstudyofthestrengthofevidenceintherecommendationofmedicationsbasedontwotrialswithstatisticallysignificantresults AT ioannidisjohnpa simulationstudyofthestrengthofevidenceintherecommendationofmedicationsbasedontwotrialswithstatisticallysignificantresults |