Cargando…
Beyond the traditional simulation design for evaluating type 1 error control: From the “theoretical” null to “empirical” null
When evaluating a newly developed statistical test, an important step is to check its type 1 error (T1E) control using simulations. This is often achieved by the standard simulation design S0 under the so‐called “theoretical” null of no association. In practice, the whole‐genome association analyses...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6518945/ https://www.ncbi.nlm.nih.gov/pubmed/30478944 http://dx.doi.org/10.1002/gepi.22172 |
Sumario: | When evaluating a newly developed statistical test, an important step is to check its type 1 error (T1E) control using simulations. This is often achieved by the standard simulation design S0 under the so‐called “theoretical” null of no association. In practice, the whole‐genome association analyses scan through a large number of genetic markers ([Formula: see text] s) for the ones associated with an outcome of interest ([Formula: see text]), where [Formula: see text] comes from an alternative while the majority of [Formula: see text] s are not associated with [Formula: see text]; the [Formula: see text] relationships are under the “empirical” null. This reality can be better represented by two other simulation designs, where design S1.1 simulates [Formula: see text] from analternative model based on [Formula: see text] , then evaluates its association with independently generated [Formula: see text]; while design S1.2 evaluates the association between permutated [Formula: see text] and [Formula: see text]. More than a decade ago, Efron (2004) has noted the important distinction between the “theoretical” and “empirical” null in false discovery rate control. Using scale tests for variance heterogeneity, direct univariate, and multivariate interaction tests as examples, here we show that not all null simulation designs are equal. In examining the accuracy of a likelihood ratio test, while simulation design S0 suggested the method being accurate, designs S1.1 and S1.2 revealed its increased empirical T1E rate if applied in real data setting. The inflation becomes more severe at the tail and does not diminish as sample size increases. This is an important observation that calls for new practices for methods evaluation and T1E control interpretation. |
---|