Cargando…

Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism

BACKGROUND: Genome-wide association studies provide important insights to the genetic component of disease risks. However, an existing challenge is how to incorporate collective effects of interactions beyond the level of independent single nucleotide polymorphism (SNP) tests. While methods consider...

Descripción completa

Detalles Bibliográficos
Autores principales: Woo, Hyung Jun, Yu, Chenggang, Kumar, Kamal, Gold, Bert, Reifman, Jaques
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5006276/
https://www.ncbi.nlm.nih.gov/pubmed/27576376
http://dx.doi.org/10.1186/s12864-016-2871-3
_version_ 1782451032839684096
author Woo, Hyung Jun
Yu, Chenggang
Kumar, Kamal
Gold, Bert
Reifman, Jaques
author_facet Woo, Hyung Jun
Yu, Chenggang
Kumar, Kamal
Gold, Bert
Reifman, Jaques
author_sort Woo, Hyung Jun
collection PubMed
description BACKGROUND: Genome-wide association studies provide important insights to the genetic component of disease risks. However, an existing challenge is how to incorporate collective effects of interactions beyond the level of independent single nucleotide polymorphism (SNP) tests. While methods considering each SNP pair separately have provided insights, a large portion of expected heritability may reside in higher-order interaction effects. RESULTS: We describe an inference approach (discrete discriminant analysis; DDA) designed to probe collective interactions while treating both genotypes and phenotypes as random variables. The genotype distributions in case and control groups are modeled separately based on empirical allele frequency and covariance data, whose differences yield disease risk parameters. We compared pairwise tests and collective inference methods, the latter based both on DDA and logistic regression. Analyses using simulated data demonstrated that significantly higher sensitivity and specificity can be achieved with collective inference in comparison to pairwise tests, and with DDA in comparison to logistic regression. Using age-related macular degeneration (AMD) data, we demonstrated two possible applications of DDA. In the first application, a genome-wide SNP set is reduced into a small number (∼100) of variants via filtering and SNP pairs with significant interactions are identified. We found that interactions between SNPs with highest AMD association were epigenetically active in the liver, adipocytes, and mesenchymal stem cells. In the other application, multiple groups of SNPs were formed from the genome-wide data and their relative strengths of association were compared using cross-validation. This analysis allowed us to discover novel collections of loci for which interactions between SNPs play significant roles in their disease association. In particular, we considered pathway-based groups of SNPs containing up to ∼10, 000 variants in each group. In addition to pathways related to complement activation, our collective inference pointed to pathway groups involved in phospholipid synthesis, oxidative stress, and apoptosis, consistent with the AMD pathogenesis mechanism where the dysfunction of retinal pigment epithelium cells plays central roles. CONCLUSIONS: The simultaneous inference of collective interaction effects within a set of SNPs has the potential to reveal novel aspects of disease association. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2871-3) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5006276
institution National Center for Biotechnology Information
language English
publishDate 2016
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-50062762016-09-01 Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism Woo, Hyung Jun Yu, Chenggang Kumar, Kamal Gold, Bert Reifman, Jaques BMC Genomics Methodology Article BACKGROUND: Genome-wide association studies provide important insights to the genetic component of disease risks. However, an existing challenge is how to incorporate collective effects of interactions beyond the level of independent single nucleotide polymorphism (SNP) tests. While methods considering each SNP pair separately have provided insights, a large portion of expected heritability may reside in higher-order interaction effects. RESULTS: We describe an inference approach (discrete discriminant analysis; DDA) designed to probe collective interactions while treating both genotypes and phenotypes as random variables. The genotype distributions in case and control groups are modeled separately based on empirical allele frequency and covariance data, whose differences yield disease risk parameters. We compared pairwise tests and collective inference methods, the latter based both on DDA and logistic regression. Analyses using simulated data demonstrated that significantly higher sensitivity and specificity can be achieved with collective inference in comparison to pairwise tests, and with DDA in comparison to logistic regression. Using age-related macular degeneration (AMD) data, we demonstrated two possible applications of DDA. In the first application, a genome-wide SNP set is reduced into a small number (∼100) of variants via filtering and SNP pairs with significant interactions are identified. We found that interactions between SNPs with highest AMD association were epigenetically active in the liver, adipocytes, and mesenchymal stem cells. In the other application, multiple groups of SNPs were formed from the genome-wide data and their relative strengths of association were compared using cross-validation. This analysis allowed us to discover novel collections of loci for which interactions between SNPs play significant roles in their disease association. In particular, we considered pathway-based groups of SNPs containing up to ∼10, 000 variants in each group. In addition to pathways related to complement activation, our collective inference pointed to pathway groups involved in phospholipid synthesis, oxidative stress, and apoptosis, consistent with the AMD pathogenesis mechanism where the dysfunction of retinal pigment epithelium cells plays central roles. CONCLUSIONS: The simultaneous inference of collective interaction effects within a set of SNPs has the potential to reveal novel aspects of disease association. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2871-3) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-30 /pmc/articles/PMC5006276/ /pubmed/27576376 http://dx.doi.org/10.1186/s12864-016-2871-3 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Woo, Hyung Jun
Yu, Chenggang
Kumar, Kamal
Gold, Bert
Reifman, Jaques
Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism
title Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism
title_full Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism
title_fullStr Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism
title_full_unstemmed Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism
title_short Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism
title_sort genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5006276/
https://www.ncbi.nlm.nih.gov/pubmed/27576376
http://dx.doi.org/10.1186/s12864-016-2871-3
work_keys_str_mv AT woohyungjun genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism
AT yuchenggang genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism
AT kumarkamal genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism
AT goldbert genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism
AT reifmanjaques genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism