Cargando…

A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics

Genome-wide association studies (GWAS) across thousands of traits have revealed the pervasive pleiotropy of trait-associated genetic variants. While methods have been proposed to characterize pleiotropic components across groups of phenotypes, scaling these approaches to ultra large-scale biobanks h...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Zixuan, Jung, Junghyun, Kim, Artem, Suboc, Noah, Gazal, Steven, Mancuso, Nicholas
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Cold Spring Harbor Laboratory 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10081403/
https://www.ncbi.nlm.nih.gov/pubmed/37034739
http://dx.doi.org/10.1101/2023.03.27.23287801
_version_ 1785021117100982272
author Zhang, Zixuan
Jung, Junghyun
Kim, Artem
Suboc, Noah
Gazal, Steven
Mancuso, Nicholas
author_facet Zhang, Zixuan
Jung, Junghyun
Kim, Artem
Suboc, Noah
Gazal, Steven
Mancuso, Nicholas
author_sort Zhang, Zixuan
collection PubMed
description Genome-wide association studies (GWAS) across thousands of traits have revealed the pervasive pleiotropy of trait-associated genetic variants. While methods have been proposed to characterize pleiotropic components across groups of phenotypes, scaling these approaches to ultra large-scale biobanks has been challenging. Here, we propose FactorGo, a scalable variational factor analysis model to identify and characterize pleiotropic components using biobank GWAS summary data. In extensive simulations, we observe that FactorGo outperforms the state-of-the-art (model-free) approach tSVD in capturing latent pleiotropic factors across phenotypes, while maintaining a similar computational cost. We apply FactorGo to estimate 100 latent pleiotropic factors from GWAS summary data of 2,483 phenotypes measured in European-ancestry Pan-UK BioBank individuals (N=420,531). Next, we find that factors from FactorGo are more enriched with relevant tissue-specific annotations than those identified by tSVD (P=2.58E-10), and validate our approach by recapitulating brain-specific enrichment for BMI and the height-related connection between reproductive system and muscular-skeletal growth. Finally, our analyses suggest novel shared etiologies between rheumatoid arthritis and periodontal condition, in addition to alkaline phosphatase as a candidate prognostic biomarker for prostate cancer. Overall, FactorGo improves our biological understanding of shared etiologies across thousands of GWAS.
format Online
Article
Text
id pubmed-10081403
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Cold Spring Harbor Laboratory
record_format MEDLINE/PubMed
spelling pubmed-100814032023-04-08 A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics Zhang, Zixuan Jung, Junghyun Kim, Artem Suboc, Noah Gazal, Steven Mancuso, Nicholas medRxiv Article Genome-wide association studies (GWAS) across thousands of traits have revealed the pervasive pleiotropy of trait-associated genetic variants. While methods have been proposed to characterize pleiotropic components across groups of phenotypes, scaling these approaches to ultra large-scale biobanks has been challenging. Here, we propose FactorGo, a scalable variational factor analysis model to identify and characterize pleiotropic components using biobank GWAS summary data. In extensive simulations, we observe that FactorGo outperforms the state-of-the-art (model-free) approach tSVD in capturing latent pleiotropic factors across phenotypes, while maintaining a similar computational cost. We apply FactorGo to estimate 100 latent pleiotropic factors from GWAS summary data of 2,483 phenotypes measured in European-ancestry Pan-UK BioBank individuals (N=420,531). Next, we find that factors from FactorGo are more enriched with relevant tissue-specific annotations than those identified by tSVD (P=2.58E-10), and validate our approach by recapitulating brain-specific enrichment for BMI and the height-related connection between reproductive system and muscular-skeletal growth. Finally, our analyses suggest novel shared etiologies between rheumatoid arthritis and periodontal condition, in addition to alkaline phosphatase as a candidate prognostic biomarker for prostate cancer. Overall, FactorGo improves our biological understanding of shared etiologies across thousands of GWAS. Cold Spring Harbor Laboratory 2023-03-29 /pmc/articles/PMC10081403/ /pubmed/37034739 http://dx.doi.org/10.1101/2023.03.27.23287801 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator.
spellingShingle Article
Zhang, Zixuan
Jung, Junghyun
Kim, Artem
Suboc, Noah
Gazal, Steven
Mancuso, Nicholas
A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics
title A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics
title_full A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics
title_fullStr A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics
title_full_unstemmed A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics
title_short A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics
title_sort scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using gwas summary statistics
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10081403/
https://www.ncbi.nlm.nih.gov/pubmed/37034739
http://dx.doi.org/10.1101/2023.03.27.23287801
work_keys_str_mv AT zhangzixuan ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT jungjunghyun ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT kimartem ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT subocnoah ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT gazalsteven ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT mancusonicholas ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT zhangzixuan scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT jungjunghyun scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT kimartem scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT subocnoah scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT gazalsteven scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics
AT mancusonicholas scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics