Cargando…

A powerful and efficient set test for genetic markers that handles confounders

Motivation: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the b...

Descripción completa

Detalles Bibliográficos
Autores principales: Listgarten, Jennifer, Lippert, Christoph, Kang, Eun Yong, Xiang, Jing, Kadie, Carl M., Heckerman, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673214/
https://www.ncbi.nlm.nih.gov/pubmed/23599503
http://dx.doi.org/10.1093/bioinformatics/btt177
_version_ 1782272227759095808
author Listgarten, Jennifer
Lippert, Christoph
Kang, Eun Yong
Xiang, Jing
Kadie, Carl M.
Heckerman, David
author_facet Listgarten, Jennifer
Lippert, Christoph
Kang, Eun Yong
Xiang, Jing
Kadie, Carl M.
Heckerman, David
author_sort Listgarten, Jennifer
collection PubMed
description Motivation: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the burden of multiple hypothesis testing. Until now, these approaches did not address confounding by family relatedness and population structure, a problem that is becoming more important as larger datasets are used to increase power. Results: We introduce a new approach for set tests that handles confounders. Our model is based on the linear mixed model and uses two random effects—one to capture the set association signal and one to capture confounders. We also introduce a computational speedup for two random-effects models that makes this approach feasible even for extremely large cohorts. Using this model with both the likelihood ratio test and score test, we find that the former yields more power while controlling type I error. Application of our approach to richly structured Genetic Analysis Workshop 14 data demonstrates that our method successfully corrects for population structure and family relatedness, whereas application of our method to a 15 000 individual Crohn’s disease case–control cohort demonstrates that it additionally recovers genes not recoverable by univariate analysis. Availability: A Python-based library implementing our approach is available at http://mscompbio.codeplex.com. Contact: jennl@microsoft.com or lippert@microsoft.com or heckerma@microsoft.com Supplementary information: Supplementary data are available at Bioinformatics online.
format Online
Article
Text
id pubmed-3673214
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-36732142013-06-05 A powerful and efficient set test for genetic markers that handles confounders Listgarten, Jennifer Lippert, Christoph Kang, Eun Yong Xiang, Jing Kadie, Carl M. Heckerman, David Bioinformatics Original Papers Motivation: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the burden of multiple hypothesis testing. Until now, these approaches did not address confounding by family relatedness and population structure, a problem that is becoming more important as larger datasets are used to increase power. Results: We introduce a new approach for set tests that handles confounders. Our model is based on the linear mixed model and uses two random effects—one to capture the set association signal and one to capture confounders. We also introduce a computational speedup for two random-effects models that makes this approach feasible even for extremely large cohorts. Using this model with both the likelihood ratio test and score test, we find that the former yields more power while controlling type I error. Application of our approach to richly structured Genetic Analysis Workshop 14 data demonstrates that our method successfully corrects for population structure and family relatedness, whereas application of our method to a 15 000 individual Crohn’s disease case–control cohort demonstrates that it additionally recovers genes not recoverable by univariate analysis. Availability: A Python-based library implementing our approach is available at http://mscompbio.codeplex.com. Contact: jennl@microsoft.com or lippert@microsoft.com or heckerma@microsoft.com Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-06-15 2013-04-18 /pmc/articles/PMC3673214/ /pubmed/23599503 http://dx.doi.org/10.1093/bioinformatics/btt177 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Papers
Listgarten, Jennifer
Lippert, Christoph
Kang, Eun Yong
Xiang, Jing
Kadie, Carl M.
Heckerman, David
A powerful and efficient set test for genetic markers that handles confounders
title A powerful and efficient set test for genetic markers that handles confounders
title_full A powerful and efficient set test for genetic markers that handles confounders
title_fullStr A powerful and efficient set test for genetic markers that handles confounders
title_full_unstemmed A powerful and efficient set test for genetic markers that handles confounders
title_short A powerful and efficient set test for genetic markers that handles confounders
title_sort powerful and efficient set test for genetic markers that handles confounders
topic Original Papers
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673214/
https://www.ncbi.nlm.nih.gov/pubmed/23599503
http://dx.doi.org/10.1093/bioinformatics/btt177
work_keys_str_mv AT listgartenjennifer apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT lippertchristoph apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT kangeunyong apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT xiangjing apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT kadiecarlm apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT heckermandavid apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT listgartenjennifer powerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT lippertchristoph powerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT kangeunyong powerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT xiangjing powerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT kadiecarlm powerfulandefficientsettestforgeneticmarkersthathandlesconfounders
AT heckermandavid powerfulandefficientsettestforgeneticmarkersthathandlesconfounders