Cargando…
Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data
Genome-wide association studies (GWAS) have associated many single variants with complex disease, yet the better part of heritable complex disease risk remains unexplained. Analytical tools designed to work under specific population genetic models are needed. Rare variants are increasingly shown to...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4825638/ https://www.ncbi.nlm.nih.gov/pubmed/26865700 http://dx.doi.org/10.1534/g3.115.026013 |
_version_ | 1782426247371948032 |
---|---|
author | Sanjak, Jaleal S. Long, Anthony D. Thornton, Kevin R. |
author_facet | Sanjak, Jaleal S. Long, Anthony D. Thornton, Kevin R. |
author_sort | Sanjak, Jaleal S. |
collection | PubMed |
description | Genome-wide association studies (GWAS) have associated many single variants with complex disease, yet the better part of heritable complex disease risk remains unexplained. Analytical tools designed to work under specific population genetic models are needed. Rare variants are increasingly shown to be important in human complex disease, but most existing GWAS data do not cover rare variants. Explicit population genetic models predict that genes contributing to complex traits and experiencing recurrent, unconditionally deleterious, mutation will harbor multiple rare, causative mutations of subtle effect. It is difficult to identify genes harboring rare variants of large effect that contribute to complex disease risk via the single marker association tests typically used in GWAS. Gene/region-based association tests may have the power detect associations by combining information from multiple markers, but have yielded limited success in practice. This is partially because many methods have not been widely applied. Here, we empirically demonstrate the utility of a procedure based on the rank truncated product (RTP) method, filtered to reduce the effects of linkage disequilibrium. We apply the procedure to the Wellcome Trust Case Control Consortium (WTCCC) data set, and uncover previously unidentified associations, some of which have been replicated in much larger studies. We show that, in the absence of significant rare variant coverage, RTP based methods still have the power to detect associated genes. We recommend that RTP-based methods be applied to all existing GWAS data to maximize the usefulness of those data. For this, we provide efficient software implementing our procedure. |
format | Online Article Text |
id | pubmed-4825638 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-48256382016-04-11 Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data Sanjak, Jaleal S. Long, Anthony D. Thornton, Kevin R. G3 (Bethesda) Investigations Genome-wide association studies (GWAS) have associated many single variants with complex disease, yet the better part of heritable complex disease risk remains unexplained. Analytical tools designed to work under specific population genetic models are needed. Rare variants are increasingly shown to be important in human complex disease, but most existing GWAS data do not cover rare variants. Explicit population genetic models predict that genes contributing to complex traits and experiencing recurrent, unconditionally deleterious, mutation will harbor multiple rare, causative mutations of subtle effect. It is difficult to identify genes harboring rare variants of large effect that contribute to complex disease risk via the single marker association tests typically used in GWAS. Gene/region-based association tests may have the power detect associations by combining information from multiple markers, but have yielded limited success in practice. This is partially because many methods have not been widely applied. Here, we empirically demonstrate the utility of a procedure based on the rank truncated product (RTP) method, filtered to reduce the effects of linkage disequilibrium. We apply the procedure to the Wellcome Trust Case Control Consortium (WTCCC) data set, and uncover previously unidentified associations, some of which have been replicated in much larger studies. We show that, in the absence of significant rare variant coverage, RTP based methods still have the power to detect associated genes. We recommend that RTP-based methods be applied to all existing GWAS data to maximize the usefulness of those data. For this, we provide efficient software implementing our procedure. Genetics Society of America 2016-02-09 /pmc/articles/PMC4825638/ /pubmed/26865700 http://dx.doi.org/10.1534/g3.115.026013 Text en Copyright © 2016 Sanjak et al. |
spellingShingle | Investigations Sanjak, Jaleal S. Long, Anthony D. Thornton, Kevin R. Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data |
title | Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data |
title_full | Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data |
title_fullStr | Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data |
title_full_unstemmed | Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data |
title_short | Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data |
title_sort | efficient software for multi-marker, region-based analysis of gwas data |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4825638/ https://www.ncbi.nlm.nih.gov/pubmed/26865700 http://dx.doi.org/10.1534/g3.115.026013 |
work_keys_str_mv | AT sanjakjaleals efficientsoftwareformultimarkerregionbasedanalysisofgwasdata AT longanthonyd efficientsoftwareformultimarkerregionbasedanalysisofgwasdata AT thorntonkevinr efficientsoftwareformultimarkerregionbasedanalysisofgwasdata |