Cargando…
Bayesian prediction intervals for assessing P-value variability in prospective replication studies
Increased availability of data and accessibility of computational tools in recent years have created an unprecedented upsurge of scientific studies driven by statistical analysis. Limitations inherent to statistics impose constraints on the reliability of conclusions drawn from data, so misuse of st...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5802740/ https://www.ncbi.nlm.nih.gov/pubmed/29217835 http://dx.doi.org/10.1038/s41398-017-0024-3 |
_version_ | 1783298578095013888 |
---|---|
author | Vsevolozhskaya, Olga Ruiz, Gabriel Zaykin, Dmitri |
author_facet | Vsevolozhskaya, Olga Ruiz, Gabriel Zaykin, Dmitri |
author_sort | Vsevolozhskaya, Olga |
collection | PubMed |
description | Increased availability of data and accessibility of computational tools in recent years have created an unprecedented upsurge of scientific studies driven by statistical analysis. Limitations inherent to statistics impose constraints on the reliability of conclusions drawn from data, so misuse of statistical methods is a growing concern. Hypothesis and significance testing, and the accompanying P-values are being scrutinized as representing the most widely applied and abused practices. One line of critique is that P-values are inherently unfit to fulfill their ostensible role as measures of credibility for scientific hypotheses. It has also been suggested that while P-values may have their role as summary measures of effect, researchers underappreciate the degree of randomness in the P-value. High variability of P-values would suggest that having obtained a small P-value in one study, one is, ne vertheless, still likely to obtain a much larger P-value in a similarly powered replication study. Thus, “replicability of P-value” is in itself questionable. To characterize P-value variability, one can use prediction intervals whose endpoints reflect the likely spread of P-values that could have been obtained by a replication study. Unfortunately, the intervals currently in use, the frequentist P-intervals, are based on unrealistic implicit assumptions. Namely, P-intervals are constructed with the assumptions that imply substantial chances of encountering large values of effect size in an observational study, which leads to bias. The long-run frequentist probability provided by P-intervals is similar in interpretation to that of the classical confidence intervals, but the endpoints of any particular interval lack interpretation as probabilistic bounds for the possible spread of future P-values that may have been obtained in replication studies. Along with classical frequentist intervals, there exists a Bayesian viewpoint toward interval construction in which the endpoints of an interval have a meaningful probabilistic interpretation. We propose Bayesian intervals for prediction of P-value variability in prospective replication studies. Contingent upon approximate prior knowledge of the effect size distribution, our proposed Bayesian intervals have endpoints that are directly interpretable as probabilistic bounds for replication P-values, and they are resistant to selection bias. We showcase our approach by its application to P-values reported for five psychiatric disorders by the Psychiatric Genomics Consortium group. |
format | Online Article Text |
id | pubmed-5802740 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-58027402018-02-08 Bayesian prediction intervals for assessing P-value variability in prospective replication studies Vsevolozhskaya, Olga Ruiz, Gabriel Zaykin, Dmitri Transl Psychiatry Article Increased availability of data and accessibility of computational tools in recent years have created an unprecedented upsurge of scientific studies driven by statistical analysis. Limitations inherent to statistics impose constraints on the reliability of conclusions drawn from data, so misuse of statistical methods is a growing concern. Hypothesis and significance testing, and the accompanying P-values are being scrutinized as representing the most widely applied and abused practices. One line of critique is that P-values are inherently unfit to fulfill their ostensible role as measures of credibility for scientific hypotheses. It has also been suggested that while P-values may have their role as summary measures of effect, researchers underappreciate the degree of randomness in the P-value. High variability of P-values would suggest that having obtained a small P-value in one study, one is, ne vertheless, still likely to obtain a much larger P-value in a similarly powered replication study. Thus, “replicability of P-value” is in itself questionable. To characterize P-value variability, one can use prediction intervals whose endpoints reflect the likely spread of P-values that could have been obtained by a replication study. Unfortunately, the intervals currently in use, the frequentist P-intervals, are based on unrealistic implicit assumptions. Namely, P-intervals are constructed with the assumptions that imply substantial chances of encountering large values of effect size in an observational study, which leads to bias. The long-run frequentist probability provided by P-intervals is similar in interpretation to that of the classical confidence intervals, but the endpoints of any particular interval lack interpretation as probabilistic bounds for the possible spread of future P-values that may have been obtained in replication studies. Along with classical frequentist intervals, there exists a Bayesian viewpoint toward interval construction in which the endpoints of an interval have a meaningful probabilistic interpretation. We propose Bayesian intervals for prediction of P-value variability in prospective replication studies. Contingent upon approximate prior knowledge of the effect size distribution, our proposed Bayesian intervals have endpoints that are directly interpretable as probabilistic bounds for replication P-values, and they are resistant to selection bias. We showcase our approach by its application to P-values reported for five psychiatric disorders by the Psychiatric Genomics Consortium group. Nature Publishing Group UK 2017-12-08 /pmc/articles/PMC5802740/ /pubmed/29217835 http://dx.doi.org/10.1038/s41398-017-0024-3 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/. |
spellingShingle | Article Vsevolozhskaya, Olga Ruiz, Gabriel Zaykin, Dmitri Bayesian prediction intervals for assessing P-value variability in prospective replication studies |
title | Bayesian prediction intervals for assessing P-value variability in prospective replication studies |
title_full | Bayesian prediction intervals for assessing P-value variability in prospective replication studies |
title_fullStr | Bayesian prediction intervals for assessing P-value variability in prospective replication studies |
title_full_unstemmed | Bayesian prediction intervals for assessing P-value variability in prospective replication studies |
title_short | Bayesian prediction intervals for assessing P-value variability in prospective replication studies |
title_sort | bayesian prediction intervals for assessing p-value variability in prospective replication studies |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5802740/ https://www.ncbi.nlm.nih.gov/pubmed/29217835 http://dx.doi.org/10.1038/s41398-017-0024-3 |
work_keys_str_mv | AT vsevolozhskayaolga bayesianpredictionintervalsforassessingpvaluevariabilityinprospectivereplicationstudies AT ruizgabriel bayesianpredictionintervalsforassessingpvaluevariabilityinprospectivereplicationstudies AT zaykindmitri bayesianpredictionintervalsforassessingpvaluevariabilityinprospectivereplicationstudies |