Cargando…
A powerful and efficient set test for genetic markers that handles confounders
Motivation: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the b...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673214/ https://www.ncbi.nlm.nih.gov/pubmed/23599503 http://dx.doi.org/10.1093/bioinformatics/btt177 |
_version_ | 1782272227759095808 |
---|---|
author | Listgarten, Jennifer Lippert, Christoph Kang, Eun Yong Xiang, Jing Kadie, Carl M. Heckerman, David |
author_facet | Listgarten, Jennifer Lippert, Christoph Kang, Eun Yong Xiang, Jing Kadie, Carl M. Heckerman, David |
author_sort | Listgarten, Jennifer |
collection | PubMed |
description | Motivation: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the burden of multiple hypothesis testing. Until now, these approaches did not address confounding by family relatedness and population structure, a problem that is becoming more important as larger datasets are used to increase power. Results: We introduce a new approach for set tests that handles confounders. Our model is based on the linear mixed model and uses two random effects—one to capture the set association signal and one to capture confounders. We also introduce a computational speedup for two random-effects models that makes this approach feasible even for extremely large cohorts. Using this model with both the likelihood ratio test and score test, we find that the former yields more power while controlling type I error. Application of our approach to richly structured Genetic Analysis Workshop 14 data demonstrates that our method successfully corrects for population structure and family relatedness, whereas application of our method to a 15 000 individual Crohn’s disease case–control cohort demonstrates that it additionally recovers genes not recoverable by univariate analysis. Availability: A Python-based library implementing our approach is available at http://mscompbio.codeplex.com. Contact: jennl@microsoft.com or lippert@microsoft.com or heckerma@microsoft.com Supplementary information: Supplementary data are available at Bioinformatics online. |
format | Online Article Text |
id | pubmed-3673214 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-36732142013-06-05 A powerful and efficient set test for genetic markers that handles confounders Listgarten, Jennifer Lippert, Christoph Kang, Eun Yong Xiang, Jing Kadie, Carl M. Heckerman, David Bioinformatics Original Papers Motivation: Approaches for testing sets of variants, such as a set of rare or common variants within a gene or pathway, for association with complex traits are important. In particular, set tests allow for aggregation of weak signal within a set, can capture interplay among variants and reduce the burden of multiple hypothesis testing. Until now, these approaches did not address confounding by family relatedness and population structure, a problem that is becoming more important as larger datasets are used to increase power. Results: We introduce a new approach for set tests that handles confounders. Our model is based on the linear mixed model and uses two random effects—one to capture the set association signal and one to capture confounders. We also introduce a computational speedup for two random-effects models that makes this approach feasible even for extremely large cohorts. Using this model with both the likelihood ratio test and score test, we find that the former yields more power while controlling type I error. Application of our approach to richly structured Genetic Analysis Workshop 14 data demonstrates that our method successfully corrects for population structure and family relatedness, whereas application of our method to a 15 000 individual Crohn’s disease case–control cohort demonstrates that it additionally recovers genes not recoverable by univariate analysis. Availability: A Python-based library implementing our approach is available at http://mscompbio.codeplex.com. Contact: jennl@microsoft.com or lippert@microsoft.com or heckerma@microsoft.com Supplementary information: Supplementary data are available at Bioinformatics online. Oxford University Press 2013-06-15 2013-04-18 /pmc/articles/PMC3673214/ /pubmed/23599503 http://dx.doi.org/10.1093/bioinformatics/btt177 Text en © The Author 2013. Published by Oxford University Press. http://creativecommons.org/licenses/by/3.0/ This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/3.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Papers Listgarten, Jennifer Lippert, Christoph Kang, Eun Yong Xiang, Jing Kadie, Carl M. Heckerman, David A powerful and efficient set test for genetic markers that handles confounders |
title | A powerful and efficient set test for genetic markers that handles confounders |
title_full | A powerful and efficient set test for genetic markers that handles confounders |
title_fullStr | A powerful and efficient set test for genetic markers that handles confounders |
title_full_unstemmed | A powerful and efficient set test for genetic markers that handles confounders |
title_short | A powerful and efficient set test for genetic markers that handles confounders |
title_sort | powerful and efficient set test for genetic markers that handles confounders |
topic | Original Papers |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3673214/ https://www.ncbi.nlm.nih.gov/pubmed/23599503 http://dx.doi.org/10.1093/bioinformatics/btt177 |
work_keys_str_mv | AT listgartenjennifer apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT lippertchristoph apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT kangeunyong apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT xiangjing apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT kadiecarlm apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT heckermandavid apowerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT listgartenjennifer powerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT lippertchristoph powerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT kangeunyong powerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT xiangjing powerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT kadiecarlm powerfulandefficientsettestforgeneticmarkersthathandlesconfounders AT heckermandavid powerfulandefficientsettestforgeneticmarkersthathandlesconfounders |