Cargando…

ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls

A primary goal of the recent investment in sequencing is to detect novel genetic associations in health and disease improving the development of treatments and playing a critical role in precision medicine. While this investment has resulted in an enormous total number of sequenced genomes, individu...

Descripción completa

Detalles Bibliográficos
Autores principales: Hendricks, Audrey E., Billups, Stephen C., Pike, Hamish N. C., Farooqi, I. Sadaf, Zeggini, Eleftheria, Santorico, Stephanie A., Barroso, Inês, Dupuis, Josée
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6191077/
https://www.ncbi.nlm.nih.gov/pubmed/30325923
http://dx.doi.org/10.1371/journal.pgen.1007591
_version_ 1783363656790048768
author Hendricks, Audrey E.
Billups, Stephen C.
Pike, Hamish N. C.
Farooqi, I. Sadaf
Zeggini, Eleftheria
Santorico, Stephanie A.
Barroso, Inês
Dupuis, Josée
author_facet Hendricks, Audrey E.
Billups, Stephen C.
Pike, Hamish N. C.
Farooqi, I. Sadaf
Zeggini, Eleftheria
Santorico, Stephanie A.
Barroso, Inês
Dupuis, Josée
author_sort Hendricks, Audrey E.
collection PubMed
description A primary goal of the recent investment in sequencing is to detect novel genetic associations in health and disease improving the development of treatments and playing a critical role in precision medicine. While this investment has resulted in an enormous total number of sequenced genomes, individual studies of complex traits and diseases are often smaller and underpowered to detect rare variant genetic associations. Existing genetic resources such as the Exome Aggregation Consortium (>60,000 exomes) and the Genome Aggregation Database (~140,000 sequenced samples) have the potential to be used as controls in these studies. Fully utilizing these and other existing sequencing resources may increase power and could be especially useful in studies where resources to sequence additional samples are limited. However, to date, these large, publicly available genetic resources remain underutilized, or even misused, in large part due to the lack of statistical methods that can appropriately use this summary level data. Here, we present a new method to incorporate external controls in case-control analysis called ProxECAT (Proxy External Controls Association Test). ProxECAT estimates enrichment of rare variants within a gene region using internally sequenced cases and external controls. We evaluated ProxECAT in simulations and empirical analyses of obesity cases using both low-depth of coverage (7x) whole-genome sequenced controls and ExAC as controls. We find that ProxECAT maintains the expected type I error rate with increased power as the number of external controls increases. With an accompanying R package, ProxECAT enables the use of publicly available allele frequencies as external controls in case-control analysis.
format Online
Article
Text
id pubmed-6191077
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-61910772018-10-25 ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls Hendricks, Audrey E. Billups, Stephen C. Pike, Hamish N. C. Farooqi, I. Sadaf Zeggini, Eleftheria Santorico, Stephanie A. Barroso, Inês Dupuis, Josée PLoS Genet Research Article A primary goal of the recent investment in sequencing is to detect novel genetic associations in health and disease improving the development of treatments and playing a critical role in precision medicine. While this investment has resulted in an enormous total number of sequenced genomes, individual studies of complex traits and diseases are often smaller and underpowered to detect rare variant genetic associations. Existing genetic resources such as the Exome Aggregation Consortium (>60,000 exomes) and the Genome Aggregation Database (~140,000 sequenced samples) have the potential to be used as controls in these studies. Fully utilizing these and other existing sequencing resources may increase power and could be especially useful in studies where resources to sequence additional samples are limited. However, to date, these large, publicly available genetic resources remain underutilized, or even misused, in large part due to the lack of statistical methods that can appropriately use this summary level data. Here, we present a new method to incorporate external controls in case-control analysis called ProxECAT (Proxy External Controls Association Test). ProxECAT estimates enrichment of rare variants within a gene region using internally sequenced cases and external controls. We evaluated ProxECAT in simulations and empirical analyses of obesity cases using both low-depth of coverage (7x) whole-genome sequenced controls and ExAC as controls. We find that ProxECAT maintains the expected type I error rate with increased power as the number of external controls increases. With an accompanying R package, ProxECAT enables the use of publicly available allele frequencies as external controls in case-control analysis. Public Library of Science 2018-10-16 /pmc/articles/PMC6191077/ /pubmed/30325923 http://dx.doi.org/10.1371/journal.pgen.1007591 Text en © 2018 Hendricks et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Hendricks, Audrey E.
Billups, Stephen C.
Pike, Hamish N. C.
Farooqi, I. Sadaf
Zeggini, Eleftheria
Santorico, Stephanie A.
Barroso, Inês
Dupuis, Josée
ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls
title ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls
title_full ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls
title_fullStr ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls
title_full_unstemmed ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls
title_short ProxECAT: Proxy External Controls Association Test. A new case-control gene region association test using allele frequencies from public controls
title_sort proxecat: proxy external controls association test. a new case-control gene region association test using allele frequencies from public controls
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6191077/
https://www.ncbi.nlm.nih.gov/pubmed/30325923
http://dx.doi.org/10.1371/journal.pgen.1007591
work_keys_str_mv AT hendricksaudreye proxecatproxyexternalcontrolsassociationtestanewcasecontrolgeneregionassociationtestusingallelefrequenciesfrompubliccontrols
AT billupsstephenc proxecatproxyexternalcontrolsassociationtestanewcasecontrolgeneregionassociationtestusingallelefrequenciesfrompubliccontrols
AT pikehamishnc proxecatproxyexternalcontrolsassociationtestanewcasecontrolgeneregionassociationtestusingallelefrequenciesfrompubliccontrols
AT farooqiisadaf proxecatproxyexternalcontrolsassociationtestanewcasecontrolgeneregionassociationtestusingallelefrequenciesfrompubliccontrols
AT zegginieleftheria proxecatproxyexternalcontrolsassociationtestanewcasecontrolgeneregionassociationtestusingallelefrequenciesfrompubliccontrols
AT santoricostephaniea proxecatproxyexternalcontrolsassociationtestanewcasecontrolgeneregionassociationtestusingallelefrequenciesfrompubliccontrols
AT barrosoines proxecatproxyexternalcontrolsassociationtestanewcasecontrolgeneregionassociationtestusingallelefrequenciesfrompubliccontrols
AT dupuisjosee proxecatproxyexternalcontrolsassociationtestanewcasecontrolgeneregionassociationtestusingallelefrequenciesfrompubliccontrols