Cargando…
Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism
BACKGROUND: Genome-wide association studies provide important insights to the genetic component of disease risks. However, an existing challenge is how to incorporate collective effects of interactions beyond the level of independent single nucleotide polymorphism (SNP) tests. While methods consider...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5006276/ https://www.ncbi.nlm.nih.gov/pubmed/27576376 http://dx.doi.org/10.1186/s12864-016-2871-3 |
_version_ | 1782451032839684096 |
---|---|
author | Woo, Hyung Jun Yu, Chenggang Kumar, Kamal Gold, Bert Reifman, Jaques |
author_facet | Woo, Hyung Jun Yu, Chenggang Kumar, Kamal Gold, Bert Reifman, Jaques |
author_sort | Woo, Hyung Jun |
collection | PubMed |
description | BACKGROUND: Genome-wide association studies provide important insights to the genetic component of disease risks. However, an existing challenge is how to incorporate collective effects of interactions beyond the level of independent single nucleotide polymorphism (SNP) tests. While methods considering each SNP pair separately have provided insights, a large portion of expected heritability may reside in higher-order interaction effects. RESULTS: We describe an inference approach (discrete discriminant analysis; DDA) designed to probe collective interactions while treating both genotypes and phenotypes as random variables. The genotype distributions in case and control groups are modeled separately based on empirical allele frequency and covariance data, whose differences yield disease risk parameters. We compared pairwise tests and collective inference methods, the latter based both on DDA and logistic regression. Analyses using simulated data demonstrated that significantly higher sensitivity and specificity can be achieved with collective inference in comparison to pairwise tests, and with DDA in comparison to logistic regression. Using age-related macular degeneration (AMD) data, we demonstrated two possible applications of DDA. In the first application, a genome-wide SNP set is reduced into a small number (∼100) of variants via filtering and SNP pairs with significant interactions are identified. We found that interactions between SNPs with highest AMD association were epigenetically active in the liver, adipocytes, and mesenchymal stem cells. In the other application, multiple groups of SNPs were formed from the genome-wide data and their relative strengths of association were compared using cross-validation. This analysis allowed us to discover novel collections of loci for which interactions between SNPs play significant roles in their disease association. In particular, we considered pathway-based groups of SNPs containing up to ∼10, 000 variants in each group. In addition to pathways related to complement activation, our collective inference pointed to pathway groups involved in phospholipid synthesis, oxidative stress, and apoptosis, consistent with the AMD pathogenesis mechanism where the dysfunction of retinal pigment epithelium cells plays central roles. CONCLUSIONS: The simultaneous inference of collective interaction effects within a set of SNPs has the potential to reveal novel aspects of disease association. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2871-3) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5006276 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-50062762016-09-01 Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism Woo, Hyung Jun Yu, Chenggang Kumar, Kamal Gold, Bert Reifman, Jaques BMC Genomics Methodology Article BACKGROUND: Genome-wide association studies provide important insights to the genetic component of disease risks. However, an existing challenge is how to incorporate collective effects of interactions beyond the level of independent single nucleotide polymorphism (SNP) tests. While methods considering each SNP pair separately have provided insights, a large portion of expected heritability may reside in higher-order interaction effects. RESULTS: We describe an inference approach (discrete discriminant analysis; DDA) designed to probe collective interactions while treating both genotypes and phenotypes as random variables. The genotype distributions in case and control groups are modeled separately based on empirical allele frequency and covariance data, whose differences yield disease risk parameters. We compared pairwise tests and collective inference methods, the latter based both on DDA and logistic regression. Analyses using simulated data demonstrated that significantly higher sensitivity and specificity can be achieved with collective inference in comparison to pairwise tests, and with DDA in comparison to logistic regression. Using age-related macular degeneration (AMD) data, we demonstrated two possible applications of DDA. In the first application, a genome-wide SNP set is reduced into a small number (∼100) of variants via filtering and SNP pairs with significant interactions are identified. We found that interactions between SNPs with highest AMD association were epigenetically active in the liver, adipocytes, and mesenchymal stem cells. In the other application, multiple groups of SNPs were formed from the genome-wide data and their relative strengths of association were compared using cross-validation. This analysis allowed us to discover novel collections of loci for which interactions between SNPs play significant roles in their disease association. In particular, we considered pathway-based groups of SNPs containing up to ∼10, 000 variants in each group. In addition to pathways related to complement activation, our collective inference pointed to pathway groups involved in phospholipid synthesis, oxidative stress, and apoptosis, consistent with the AMD pathogenesis mechanism where the dysfunction of retinal pigment epithelium cells plays central roles. CONCLUSIONS: The simultaneous inference of collective interaction effects within a set of SNPs has the potential to reveal novel aspects of disease association. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12864-016-2871-3) contains supplementary material, which is available to authorized users. BioMed Central 2016-08-30 /pmc/articles/PMC5006276/ /pubmed/27576376 http://dx.doi.org/10.1186/s12864-016-2871-3 Text en © The Author(s) 2016 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Methodology Article Woo, Hyung Jun Yu, Chenggang Kumar, Kamal Gold, Bert Reifman, Jaques Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism |
title | Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism |
title_full | Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism |
title_fullStr | Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism |
title_full_unstemmed | Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism |
title_short | Genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism |
title_sort | genotype distribution-based inference of collective effects in genome-wide association studies: insights to age-related macular degeneration disease mechanism |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5006276/ https://www.ncbi.nlm.nih.gov/pubmed/27576376 http://dx.doi.org/10.1186/s12864-016-2871-3 |
work_keys_str_mv | AT woohyungjun genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism AT yuchenggang genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism AT kumarkamal genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism AT goldbert genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism AT reifmanjaques genotypedistributionbasedinferenceofcollectiveeffectsingenomewideassociationstudiesinsightstoagerelatedmaculardegenerationdiseasemechanism |