Cargando…
A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics
Genome-wide association studies (GWAS) across thousands of traits have revealed the pervasive pleiotropy of trait-associated genetic variants. While methods have been proposed to characterize pleiotropic components across groups of phenotypes, scaling these approaches to ultra large-scale biobanks h...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Cold Spring Harbor Laboratory
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10081403/ https://www.ncbi.nlm.nih.gov/pubmed/37034739 http://dx.doi.org/10.1101/2023.03.27.23287801 |
_version_ | 1785021117100982272 |
---|---|
author | Zhang, Zixuan Jung, Junghyun Kim, Artem Suboc, Noah Gazal, Steven Mancuso, Nicholas |
author_facet | Zhang, Zixuan Jung, Junghyun Kim, Artem Suboc, Noah Gazal, Steven Mancuso, Nicholas |
author_sort | Zhang, Zixuan |
collection | PubMed |
description | Genome-wide association studies (GWAS) across thousands of traits have revealed the pervasive pleiotropy of trait-associated genetic variants. While methods have been proposed to characterize pleiotropic components across groups of phenotypes, scaling these approaches to ultra large-scale biobanks has been challenging. Here, we propose FactorGo, a scalable variational factor analysis model to identify and characterize pleiotropic components using biobank GWAS summary data. In extensive simulations, we observe that FactorGo outperforms the state-of-the-art (model-free) approach tSVD in capturing latent pleiotropic factors across phenotypes, while maintaining a similar computational cost. We apply FactorGo to estimate 100 latent pleiotropic factors from GWAS summary data of 2,483 phenotypes measured in European-ancestry Pan-UK BioBank individuals (N=420,531). Next, we find that factors from FactorGo are more enriched with relevant tissue-specific annotations than those identified by tSVD (P=2.58E-10), and validate our approach by recapitulating brain-specific enrichment for BMI and the height-related connection between reproductive system and muscular-skeletal growth. Finally, our analyses suggest novel shared etiologies between rheumatoid arthritis and periodontal condition, in addition to alkaline phosphatase as a candidate prognostic biomarker for prostate cancer. Overall, FactorGo improves our biological understanding of shared etiologies across thousands of GWAS. |
format | Online Article Text |
id | pubmed-10081403 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Cold Spring Harbor Laboratory |
record_format | MEDLINE/PubMed |
spelling | pubmed-100814032023-04-08 A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics Zhang, Zixuan Jung, Junghyun Kim, Artem Suboc, Noah Gazal, Steven Mancuso, Nicholas medRxiv Article Genome-wide association studies (GWAS) across thousands of traits have revealed the pervasive pleiotropy of trait-associated genetic variants. While methods have been proposed to characterize pleiotropic components across groups of phenotypes, scaling these approaches to ultra large-scale biobanks has been challenging. Here, we propose FactorGo, a scalable variational factor analysis model to identify and characterize pleiotropic components using biobank GWAS summary data. In extensive simulations, we observe that FactorGo outperforms the state-of-the-art (model-free) approach tSVD in capturing latent pleiotropic factors across phenotypes, while maintaining a similar computational cost. We apply FactorGo to estimate 100 latent pleiotropic factors from GWAS summary data of 2,483 phenotypes measured in European-ancestry Pan-UK BioBank individuals (N=420,531). Next, we find that factors from FactorGo are more enriched with relevant tissue-specific annotations than those identified by tSVD (P=2.58E-10), and validate our approach by recapitulating brain-specific enrichment for BMI and the height-related connection between reproductive system and muscular-skeletal growth. Finally, our analyses suggest novel shared etiologies between rheumatoid arthritis and periodontal condition, in addition to alkaline phosphatase as a candidate prognostic biomarker for prostate cancer. Overall, FactorGo improves our biological understanding of shared etiologies across thousands of GWAS. Cold Spring Harbor Laboratory 2023-03-29 /pmc/articles/PMC10081403/ /pubmed/37034739 http://dx.doi.org/10.1101/2023.03.27.23287801 Text en https://creativecommons.org/licenses/by-nc-nd/4.0/This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (https://creativecommons.org/licenses/by-nc-nd/4.0/) , which allows reusers to copy and distribute the material in any medium or format in unadapted form only, for noncommercial purposes only, and only so long as attribution is given to the creator. |
spellingShingle | Article Zhang, Zixuan Jung, Junghyun Kim, Artem Suboc, Noah Gazal, Steven Mancuso, Nicholas A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics |
title | A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics |
title_full | A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics |
title_fullStr | A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics |
title_full_unstemmed | A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics |
title_short | A scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using GWAS summary statistics |
title_sort | scalable variational approach to characterize pleiotropic components across thousands of human diseases and complex traits using gwas summary statistics |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10081403/ https://www.ncbi.nlm.nih.gov/pubmed/37034739 http://dx.doi.org/10.1101/2023.03.27.23287801 |
work_keys_str_mv | AT zhangzixuan ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT jungjunghyun ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT kimartem ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT subocnoah ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT gazalsteven ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT mancusonicholas ascalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT zhangzixuan scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT jungjunghyun scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT kimartem scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT subocnoah scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT gazalsteven scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics AT mancusonicholas scalablevariationalapproachtocharacterizepleiotropiccomponentsacrossthousandsofhumandiseasesandcomplextraitsusinggwassummarystatistics |