Cargando…

Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data

Genome-wide association studies (GWAS) have associated many single variants with complex disease, yet the better part of heritable complex disease risk remains unexplained. Analytical tools designed to work under specific population genetic models are needed. Rare variants are increasingly shown to...

Descripción completa

Detalles Bibliográficos
Autores principales: Sanjak, Jaleal S., Long, Anthony D., Thornton, Kevin R.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4825638/
https://www.ncbi.nlm.nih.gov/pubmed/26865700
http://dx.doi.org/10.1534/g3.115.026013
_version_ 1782426247371948032
author Sanjak, Jaleal S.
Long, Anthony D.
Thornton, Kevin R.
author_facet Sanjak, Jaleal S.
Long, Anthony D.
Thornton, Kevin R.
author_sort Sanjak, Jaleal S.
collection PubMed
description Genome-wide association studies (GWAS) have associated many single variants with complex disease, yet the better part of heritable complex disease risk remains unexplained. Analytical tools designed to work under specific population genetic models are needed. Rare variants are increasingly shown to be important in human complex disease, but most existing GWAS data do not cover rare variants. Explicit population genetic models predict that genes contributing to complex traits and experiencing recurrent, unconditionally deleterious, mutation will harbor multiple rare, causative mutations of subtle effect. It is difficult to identify genes harboring rare variants of large effect that contribute to complex disease risk via the single marker association tests typically used in GWAS. Gene/region-based association tests may have the power detect associations by combining information from multiple markers, but have yielded limited success in practice. This is partially because many methods have not been widely applied. Here, we empirically demonstrate the utility of a procedure based on the rank truncated product (RTP) method, filtered to reduce the effects of linkage disequilibrium. We apply the procedure to the Wellcome Trust Case Control Consortium (WTCCC) data set, and uncover previously unidentified associations, some of which have been replicated in much larger studies. We show that, in the absence of significant rare variant coverage, RTP based methods still have the power to detect associated genes. We recommend that RTP-based methods be applied to all existing GWAS data to maximize the usefulness of those data. For this, we provide efficient software implementing our procedure.
format Online
Article
Text
id pubmed-4825638
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-48256382016-04-11 Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data Sanjak, Jaleal S. Long, Anthony D. Thornton, Kevin R. G3 (Bethesda) Investigations Genome-wide association studies (GWAS) have associated many single variants with complex disease, yet the better part of heritable complex disease risk remains unexplained. Analytical tools designed to work under specific population genetic models are needed. Rare variants are increasingly shown to be important in human complex disease, but most existing GWAS data do not cover rare variants. Explicit population genetic models predict that genes contributing to complex traits and experiencing recurrent, unconditionally deleterious, mutation will harbor multiple rare, causative mutations of subtle effect. It is difficult to identify genes harboring rare variants of large effect that contribute to complex disease risk via the single marker association tests typically used in GWAS. Gene/region-based association tests may have the power detect associations by combining information from multiple markers, but have yielded limited success in practice. This is partially because many methods have not been widely applied. Here, we empirically demonstrate the utility of a procedure based on the rank truncated product (RTP) method, filtered to reduce the effects of linkage disequilibrium. We apply the procedure to the Wellcome Trust Case Control Consortium (WTCCC) data set, and uncover previously unidentified associations, some of which have been replicated in much larger studies. We show that, in the absence of significant rare variant coverage, RTP based methods still have the power to detect associated genes. We recommend that RTP-based methods be applied to all existing GWAS data to maximize the usefulness of those data. For this, we provide efficient software implementing our procedure. Genetics Society of America 2016-02-09 /pmc/articles/PMC4825638/ /pubmed/26865700 http://dx.doi.org/10.1534/g3.115.026013 Text en Copyright © 2016 Sanjak et al.
spellingShingle Investigations
Sanjak, Jaleal S.
Long, Anthony D.
Thornton, Kevin R.
Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data
title Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data
title_full Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data
title_fullStr Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data
title_full_unstemmed Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data
title_short Efficient Software for Multi-marker, Region-Based Analysis of GWAS Data
title_sort efficient software for multi-marker, region-based analysis of gwas data
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4825638/
https://www.ncbi.nlm.nih.gov/pubmed/26865700
http://dx.doi.org/10.1534/g3.115.026013
work_keys_str_mv AT sanjakjaleals efficientsoftwareformultimarkerregionbasedanalysisofgwasdata
AT longanthonyd efficientsoftwareformultimarkerregionbasedanalysisofgwasdata
AT thorntonkevinr efficientsoftwareformultimarkerregionbasedanalysisofgwasdata