Cargando…

Improving Genetic Association Studies with a Novel Methodology that Unveils the Hidden Complexity of All-Cause Heart Failure

MOTIVATION: Genome-Wide Association Studies (GWAS) commonly assume phenotypic and genetic homogeneity that is not present in complex conditions. We designed Transformative Regression Analysis of Combined Effects (TRACE), a GWAS methodology that better accounts for clinical phenotype heterogeneity an...

Descripción completa

Detalles Bibliográficos
Autores principales: Gregg, John T., Himes, Blanca E., Asselbergs, Folkert W., Moore, Jason H.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10418568/
https://www.ncbi.nlm.nih.gov/pubmed/37577697
http://dx.doi.org/10.1101/2023.08.02.23293567
_version_ 1785088295427899392
author Gregg, John T.
Himes, Blanca E.
Asselbergs, Folkert W.
Moore, Jason H.
author_facet Gregg, John T.
Himes, Blanca E.
Asselbergs, Folkert W.
Moore, Jason H.
author_sort Gregg, John T.
collection PubMed
description MOTIVATION: Genome-Wide Association Studies (GWAS) commonly assume phenotypic and genetic homogeneity that is not present in complex conditions. We designed Transformative Regression Analysis of Combined Effects (TRACE), a GWAS methodology that better accounts for clinical phenotype heterogeneity and identifies gene-by-environment (GxE) interactions. We demonstrated with UK Biobank (UKB) data that TRACE increased the variance explained in All-Cause Heart Failure (AHF) via the discovery of novel single nucleotide polymorphism (SNP) and SNP-by-environment (i.e. GxE) interaction associations. First, we transformed 312 AHF-related ICD10 codes (including AHF) into continuous low-dimensional features (i.e., latent phenotypes) for a more nuanced disease representation. Then, we ran a standard GWAS on our latent phenotypes to discover main effects and identified GxE interactions with target encoding. Genes near associated SNPs subsequently underwent enrichment analysis to explore potential functional mechanisms underlying associations. Latent phenotypes were regressed against their SNP hits and the estimated latent phenotype values were used to measure the amount of AHF variance explained. RESULTS: Our method identified over 100 main GWAS effects that were consistent with prior studies and hundreds of novel gene-by-smoking interactions, which collectively accounted for approximately 10% of AHF variance. This represents an improvement over traditional GWAS whose results account for a negligible proportion of AHF variance. Enrichment analyses suggested that hundreds of miRNAs mediated the SNP effect on various AHF-related biological pathways. The TRACE framework can be applied to decode the genetics of other complex diseases. AVAILABILITY: All code is available at https://github.com/EpistasisLab/latent_phenotype_project
format Online
Article
Text
id pubmed-10418568
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-104185682023-08-12 Improving Genetic Association Studies with a Novel Methodology that Unveils the Hidden Complexity of All-Cause Heart Failure Gregg, John T. Himes, Blanca E. Asselbergs, Folkert W. Moore, Jason H. medRxiv Article MOTIVATION: Genome-Wide Association Studies (GWAS) commonly assume phenotypic and genetic homogeneity that is not present in complex conditions. We designed Transformative Regression Analysis of Combined Effects (TRACE), a GWAS methodology that better accounts for clinical phenotype heterogeneity and identifies gene-by-environment (GxE) interactions. We demonstrated with UK Biobank (UKB) data that TRACE increased the variance explained in All-Cause Heart Failure (AHF) via the discovery of novel single nucleotide polymorphism (SNP) and SNP-by-environment (i.e. GxE) interaction associations. First, we transformed 312 AHF-related ICD10 codes (including AHF) into continuous low-dimensional features (i.e., latent phenotypes) for a more nuanced disease representation. Then, we ran a standard GWAS on our latent phenotypes to discover main effects and identified GxE interactions with target encoding. Genes near associated SNPs subsequently underwent enrichment analysis to explore potential functional mechanisms underlying associations. Latent phenotypes were regressed against their SNP hits and the estimated latent phenotype values were used to measure the amount of AHF variance explained. RESULTS: Our method identified over 100 main GWAS effects that were consistent with prior studies and hundreds of novel gene-by-smoking interactions, which collectively accounted for approximately 10% of AHF variance. This represents an improvement over traditional GWAS whose results account for a negligible proportion of AHF variance. Enrichment analyses suggested that hundreds of miRNAs mediated the SNP effect on various AHF-related biological pathways. The TRACE framework can be applied to decode the genetics of other complex diseases. AVAILABILITY: All code is available at https://github.com/EpistasisLab/latent_phenotype_project Cold Spring Harbor Laboratory 2023-08-04 /pmc/articles/PMC10418568/ /pubmed/37577697 http://dx.doi.org/10.1101/2023.08.02.23293567 Text en https://creativecommons.org/licenses/by-nd/4.0/This work is licensed under a Creative Commons Attribution-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, and only so long as attribution is given to the creator. The license allows for commercial use.
spellingShingle Article
Gregg, John T.
Himes, Blanca E.
Asselbergs, Folkert W.
Moore, Jason H.
Improving Genetic Association Studies with a Novel Methodology that Unveils the Hidden Complexity of All-Cause Heart Failure
title Improving Genetic Association Studies with a Novel Methodology that Unveils the Hidden Complexity of All-Cause Heart Failure
title_full Improving Genetic Association Studies with a Novel Methodology that Unveils the Hidden Complexity of All-Cause Heart Failure
title_fullStr Improving Genetic Association Studies with a Novel Methodology that Unveils the Hidden Complexity of All-Cause Heart Failure
title_full_unstemmed Improving Genetic Association Studies with a Novel Methodology that Unveils the Hidden Complexity of All-Cause Heart Failure
title_short Improving Genetic Association Studies with a Novel Methodology that Unveils the Hidden Complexity of All-Cause Heart Failure
title_sort improving genetic association studies with a novel methodology that unveils the hidden complexity of all-cause heart failure
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10418568/
https://www.ncbi.nlm.nih.gov/pubmed/37577697
http://dx.doi.org/10.1101/2023.08.02.23293567
work_keys_str_mv AT greggjohnt improvinggeneticassociationstudieswithanovelmethodologythatunveilsthehiddencomplexityofallcauseheartfailure
AT himesblancae improvinggeneticassociationstudieswithanovelmethodologythatunveilsthehiddencomplexityofallcauseheartfailure
AT asselbergsfolkertw improvinggeneticassociationstudieswithanovelmethodologythatunveilsthehiddencomplexityofallcauseheartfailure
AT moorejasonh improvinggeneticassociationstudieswithanovelmethodologythatunveilsthehiddencomplexityofallcauseheartfailure