Cargando…

Improved Statistics for Genome-Wide Interaction Analysis

Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regress...

Descripción completa

Detalles Bibliográficos
Autores principales: Ueki, Masao, Cordell, Heather J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3320596/
https://www.ncbi.nlm.nih.gov/pubmed/22496670
http://dx.doi.org/10.1371/journal.pgen.1002625
_version_ 1782228865903493120
author Ueki, Masao
Cordell, Heather J.
author_facet Ueki, Masao
Cordell, Heather J.
author_sort Ueki, Masao
collection PubMed
description Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new “joint effects” statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al.'s originally-proposed statistics, on account of the inflated error rate that can result.
format Online
Article
Text
id pubmed-3320596
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-33205962012-04-11 Improved Statistics for Genome-Wide Interaction Analysis Ueki, Masao Cordell, Heather J. PLoS Genet Research Article Recently, Wu and colleagues [1] proposed two novel statistics for genome-wide interaction analysis using case/control or case-only data. In computer simulations, their proposed case/control statistic outperformed competing approaches, including the fast-epistasis option in PLINK and logistic regression analysis under the correct model; however, reasons for its superior performance were not fully explored. Here we investigate the theoretical properties and performance of Wu et al.'s proposed statistics and explain why, in some circumstances, they outperform competing approaches. Unfortunately, we find minor errors in the formulae for their statistics, resulting in tests that have higher than nominal type 1 error. We also find minor errors in PLINK's fast-epistasis and case-only statistics, although theory and simulations suggest that these errors have only negligible effect on type 1 error. We propose adjusted versions of all four statistics that, both theoretically and in computer simulations, maintain correct type 1 error rates under the null hypothesis. We also investigate statistics based on correlation coefficients that maintain similar control of type 1 error. Although designed to test specifically for interaction, we show that some of these previously-proposed statistics can, in fact, be sensitive to main effects at one or both loci, particularly in the presence of linkage disequilibrium. We propose two new “joint effects” statistics that, provided the disease is rare, are sensitive only to genuine interaction effects. In computer simulations we find, in most situations considered, that highest power is achieved by analysis under the correct genetic model. Such an analysis is unachievable in practice, as we do not know this model. However, generally high power over a wide range of scenarios is exhibited by our joint effects and adjusted Wu statistics. We recommend use of these alternative or adjusted statistics and urge caution when using Wu et al.'s originally-proposed statistics, on account of the inflated error rate that can result. Public Library of Science 2012-04-05 /pmc/articles/PMC3320596/ /pubmed/22496670 http://dx.doi.org/10.1371/journal.pgen.1002625 Text en Ueki, Cordell. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Ueki, Masao
Cordell, Heather J.
Improved Statistics for Genome-Wide Interaction Analysis
title Improved Statistics for Genome-Wide Interaction Analysis
title_full Improved Statistics for Genome-Wide Interaction Analysis
title_fullStr Improved Statistics for Genome-Wide Interaction Analysis
title_full_unstemmed Improved Statistics for Genome-Wide Interaction Analysis
title_short Improved Statistics for Genome-Wide Interaction Analysis
title_sort improved statistics for genome-wide interaction analysis
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3320596/
https://www.ncbi.nlm.nih.gov/pubmed/22496670
http://dx.doi.org/10.1371/journal.pgen.1002625
work_keys_str_mv AT uekimasao improvedstatisticsforgenomewideinteractionanalysis
AT cordellheatherj improvedstatisticsforgenomewideinteractionanalysis