Cargando…

The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease

Genome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal...

Descripción completa

Detalles Bibliográficos
Autores principales: Moutsianas, Loukas, Agarwala, Vineeta, Fuchsberger, Christian, Flannick, Jason, Rivas, Manuel A., Gaulton, Kyle J., Albers, Patrick K., McVean, Gil, Boehnke, Michael, Altshuler, David, McCarthy, Mark I.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4407972/
https://www.ncbi.nlm.nih.gov/pubmed/25906071
http://dx.doi.org/10.1371/journal.pgen.1005165
_version_ 1782367996267724800
author Moutsianas, Loukas
Agarwala, Vineeta
Fuchsberger, Christian
Flannick, Jason
Rivas, Manuel A.
Gaulton, Kyle J.
Albers, Patrick K.
McVean, Gil
Boehnke, Michael
Altshuler, David
McCarthy, Mark I.
author_facet Moutsianas, Loukas
Agarwala, Vineeta
Fuchsberger, Christian
Flannick, Jason
Rivas, Manuel A.
Gaulton, Kyle J.
Albers, Patrick K.
McVean, Gil
Boehnke, Michael
Altshuler, David
McCarthy, Mark I.
author_sort Moutsianas, Loukas
collection PubMed
description Genome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal variants. In this study, we evaluated the power of gene-based association methods to interrogate such hypotheses, and examined the implications for study design. We developed a flexible simulation approach, using 1000 Genomes data, to (a) generate sequence variation at human genes in up to 10K case-control samples, and (b) quantify the statistical power of a panel of widely used gene-based association tests under a variety of allelic architectures, locus effect sizes, and significance thresholds. For loci explaining ~1% of phenotypic variance underlying a common dichotomous trait, we find that all methods have low absolute power to achieve exome-wide significance (~5-20% power at α=2.5×10(-6)) in 3K individuals; even in 10K samples, power is modest (~60%). The combined application of multiple methods increases sensitivity, but does so at the expense of a higher false positive rate. MiST, SKAT-O, and KBAC have the highest individual mean power across simulated datasets, but we observe wide architecture-dependent variability in the individual loci detected by each test, suggesting that inferences about disease architecture from analysis of sequencing studies can differ depending on which methods are used. Our results imply that tens of thousands of individuals, extensive functional annotation, or highly targeted hypothesis testing will be required to confidently detect or exclude rare variant signals at complex disease loci.
format Online
Article
Text
id pubmed-4407972
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-44079722015-05-04 The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease Moutsianas, Loukas Agarwala, Vineeta Fuchsberger, Christian Flannick, Jason Rivas, Manuel A. Gaulton, Kyle J. Albers, Patrick K. McVean, Gil Boehnke, Michael Altshuler, David McCarthy, Mark I. PLoS Genet Research Article Genome and exome sequencing in large cohorts enables characterization of the role of rare variation in complex diseases. Success in this endeavor, however, requires investigators to test a diverse array of genetic hypotheses which differ in the number, frequency and effect sizes of underlying causal variants. In this study, we evaluated the power of gene-based association methods to interrogate such hypotheses, and examined the implications for study design. We developed a flexible simulation approach, using 1000 Genomes data, to (a) generate sequence variation at human genes in up to 10K case-control samples, and (b) quantify the statistical power of a panel of widely used gene-based association tests under a variety of allelic architectures, locus effect sizes, and significance thresholds. For loci explaining ~1% of phenotypic variance underlying a common dichotomous trait, we find that all methods have low absolute power to achieve exome-wide significance (~5-20% power at α=2.5×10(-6)) in 3K individuals; even in 10K samples, power is modest (~60%). The combined application of multiple methods increases sensitivity, but does so at the expense of a higher false positive rate. MiST, SKAT-O, and KBAC have the highest individual mean power across simulated datasets, but we observe wide architecture-dependent variability in the individual loci detected by each test, suggesting that inferences about disease architecture from analysis of sequencing studies can differ depending on which methods are used. Our results imply that tens of thousands of individuals, extensive functional annotation, or highly targeted hypothesis testing will be required to confidently detect or exclude rare variant signals at complex disease loci. Public Library of Science 2015-04-23 /pmc/articles/PMC4407972/ /pubmed/25906071 http://dx.doi.org/10.1371/journal.pgen.1005165 Text en © 2015 Moutsianas et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Moutsianas, Loukas
Agarwala, Vineeta
Fuchsberger, Christian
Flannick, Jason
Rivas, Manuel A.
Gaulton, Kyle J.
Albers, Patrick K.
McVean, Gil
Boehnke, Michael
Altshuler, David
McCarthy, Mark I.
The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease
title The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease
title_full The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease
title_fullStr The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease
title_full_unstemmed The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease
title_short The Power of Gene-Based Rare Variant Methods to Detect Disease-Associated Variation and Test Hypotheses About Complex Disease
title_sort power of gene-based rare variant methods to detect disease-associated variation and test hypotheses about complex disease
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4407972/
https://www.ncbi.nlm.nih.gov/pubmed/25906071
http://dx.doi.org/10.1371/journal.pgen.1005165
work_keys_str_mv AT moutsianasloukas thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT agarwalavineeta thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT fuchsbergerchristian thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT flannickjason thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT rivasmanuela thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT gaultonkylej thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT alberspatrickk thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT mcveangil thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT boehnkemichael thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT altshulerdavid thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT mccarthymarki thepowerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT moutsianasloukas powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT agarwalavineeta powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT fuchsbergerchristian powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT flannickjason powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT rivasmanuela powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT gaultonkylej powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT alberspatrickk powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT mcveangil powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT boehnkemichael powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT altshulerdavid powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease
AT mccarthymarki powerofgenebasedrarevariantmethodstodetectdiseaseassociatedvariationandtesthypothesesaboutcomplexdisease