Cargando…

Functional annotation signatures of disease susceptibility loci improve SNP association analysis

BACKGROUND: Genetic association studies are conducted to discover genetic loci that contribute to an inherited trait, identify the variants behind these associations and ascertain their functional role in determining the phenotype. To date, functional annotations of the genetic variants have rarely...

Descripción completa

Detalles Bibliográficos
Autores principales: Iversen, Edwin S, Lipton, Gary, Clyde, Merlise A, Monteiro, Alvaro NA
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4041996/
https://www.ncbi.nlm.nih.gov/pubmed/24886216
http://dx.doi.org/10.1186/1471-2164-15-398
_version_ 1782318741806120960
author Iversen, Edwin S
Lipton, Gary
Clyde, Merlise A
Monteiro, Alvaro NA
author_facet Iversen, Edwin S
Lipton, Gary
Clyde, Merlise A
Monteiro, Alvaro NA
author_sort Iversen, Edwin S
collection PubMed
description BACKGROUND: Genetic association studies are conducted to discover genetic loci that contribute to an inherited trait, identify the variants behind these associations and ascertain their functional role in determining the phenotype. To date, functional annotations of the genetic variants have rarely played more than an indirect role in assessing evidence for association. Here, we demonstrate how these data can be systematically integrated into an association study’s analysis plan. RESULTS: We developed a Bayesian statistical model for the prior probability of phenotype–genotype association that incorporates data from past association studies and publicly available functional annotation data regarding the susceptibility variants under study. The model takes the form of a binary regression of association status on a set of annotation variables whose coefficients were estimated through an analysis of associated SNPs in the GWAS Catalog (GC). The functional predictors examined included measures that have been demonstrated to correlate with the association status of SNPs in the GC and some whose utility in this regard is speculative: summaries of the UCSC Human Genome Browser ENCODE super–track data, dbSNP function class, sequence conservation summaries, proximity to genomic variants in the Database of Genomic Variants and known regulatory elements in the Open Regulatory Annotation database, PolyPhen–2 probabilities and RegulomeDB categories. Because we expected that only a fraction of the annotations would contribute to predicting association, we employed a penalized likelihood method to reduce the impact of non–informative predictors and evaluated the model’s ability to predict GC SNPs not used to construct the model. We show that the functional data alone are predictive of a SNP’s presence in the GC. Further, using data from a genome–wide study of ovarian cancer, we demonstrate that their use as prior data when testing for association is practical at the genome–wide scale and improves power to detect associations. CONCLUSIONS: We show how diverse functional annotations can be efficiently combined to create ‘functional signatures’ that predict the a priori odds of a variant’s association to a trait and how these signatures can be integrated into a standard genome–wide–scale association analysis, resulting in improved power to detect truly associated variants. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-398) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4041996
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-40419962014-06-06 Functional annotation signatures of disease susceptibility loci improve SNP association analysis Iversen, Edwin S Lipton, Gary Clyde, Merlise A Monteiro, Alvaro NA BMC Genomics Methodology Article BACKGROUND: Genetic association studies are conducted to discover genetic loci that contribute to an inherited trait, identify the variants behind these associations and ascertain their functional role in determining the phenotype. To date, functional annotations of the genetic variants have rarely played more than an indirect role in assessing evidence for association. Here, we demonstrate how these data can be systematically integrated into an association study’s analysis plan. RESULTS: We developed a Bayesian statistical model for the prior probability of phenotype–genotype association that incorporates data from past association studies and publicly available functional annotation data regarding the susceptibility variants under study. The model takes the form of a binary regression of association status on a set of annotation variables whose coefficients were estimated through an analysis of associated SNPs in the GWAS Catalog (GC). The functional predictors examined included measures that have been demonstrated to correlate with the association status of SNPs in the GC and some whose utility in this regard is speculative: summaries of the UCSC Human Genome Browser ENCODE super–track data, dbSNP function class, sequence conservation summaries, proximity to genomic variants in the Database of Genomic Variants and known regulatory elements in the Open Regulatory Annotation database, PolyPhen–2 probabilities and RegulomeDB categories. Because we expected that only a fraction of the annotations would contribute to predicting association, we employed a penalized likelihood method to reduce the impact of non–informative predictors and evaluated the model’s ability to predict GC SNPs not used to construct the model. We show that the functional data alone are predictive of a SNP’s presence in the GC. Further, using data from a genome–wide study of ovarian cancer, we demonstrate that their use as prior data when testing for association is practical at the genome–wide scale and improves power to detect associations. CONCLUSIONS: We show how diverse functional annotations can be efficiently combined to create ‘functional signatures’ that predict the a priori odds of a variant’s association to a trait and how these signatures can be integrated into a standard genome–wide–scale association analysis, resulting in improved power to detect truly associated variants. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2164-15-398) contains supplementary material, which is available to authorized users. BioMed Central 2014-05-24 /pmc/articles/PMC4041996/ /pubmed/24886216 http://dx.doi.org/10.1186/1471-2164-15-398 Text en © Iversen et al.; licensee BioMed Central Ltd. 2014 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Iversen, Edwin S
Lipton, Gary
Clyde, Merlise A
Monteiro, Alvaro NA
Functional annotation signatures of disease susceptibility loci improve SNP association analysis
title Functional annotation signatures of disease susceptibility loci improve SNP association analysis
title_full Functional annotation signatures of disease susceptibility loci improve SNP association analysis
title_fullStr Functional annotation signatures of disease susceptibility loci improve SNP association analysis
title_full_unstemmed Functional annotation signatures of disease susceptibility loci improve SNP association analysis
title_short Functional annotation signatures of disease susceptibility loci improve SNP association analysis
title_sort functional annotation signatures of disease susceptibility loci improve snp association analysis
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4041996/
https://www.ncbi.nlm.nih.gov/pubmed/24886216
http://dx.doi.org/10.1186/1471-2164-15-398
work_keys_str_mv AT iversenedwins functionalannotationsignaturesofdiseasesusceptibilitylociimprovesnpassociationanalysis
AT liptongary functionalannotationsignaturesofdiseasesusceptibilitylociimprovesnpassociationanalysis
AT clydemerlisea functionalannotationsignaturesofdiseasesusceptibilitylociimprovesnpassociationanalysis
AT monteiroalvarona functionalannotationsignaturesofdiseasesusceptibilitylociimprovesnpassociationanalysis