Cargando…

graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data

Genome-wide association studies (GWAS) have successfully identified a large number of genetic variants associated with traits and diseases. However, it still remains challenging to fully understand the functional mechanisms underlying many associated variants. This is especially the case when we are...

Descripción completa

Detalles Bibliográficos
Autores principales: Deng, Qiaolan, Gupta, Arkobrato, Jeon, Hyeongseon, Nam, Jin Hyun, Yilmaz, Ayse Selen, Chang, Won, Pietrzak, Maciej, Li, Lang, Kim, Hang J., Chung, Dongjun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370274/
https://www.ncbi.nlm.nih.gov/pubmed/37501720
http://dx.doi.org/10.3389/fgene.2023.1079198
_version_ 1785077914732068864
author Deng, Qiaolan
Gupta, Arkobrato
Jeon, Hyeongseon
Nam, Jin Hyun
Yilmaz, Ayse Selen
Chang, Won
Pietrzak, Maciej
Li, Lang
Kim, Hang J.
Chung, Dongjun
author_facet Deng, Qiaolan
Gupta, Arkobrato
Jeon, Hyeongseon
Nam, Jin Hyun
Yilmaz, Ayse Selen
Chang, Won
Pietrzak, Maciej
Li, Lang
Kim, Hang J.
Chung, Dongjun
author_sort Deng, Qiaolan
collection PubMed
description Genome-wide association studies (GWAS) have successfully identified a large number of genetic variants associated with traits and diseases. However, it still remains challenging to fully understand the functional mechanisms underlying many associated variants. This is especially the case when we are interested in variants shared across multiple phenotypes. To address this challenge, we propose graph-GPA 2.0 (GGPA 2.0), a statistical framework to integrate GWAS datasets for multiple phenotypes and incorporate functional annotations within a unified framework. Our simulation studies showed that incorporating functional annotation data using GGPA 2.0 not only improves the detection of disease-associated variants, but also provides a more accurate estimation of relationships among diseases. Next, we analyzed five autoimmune diseases and five psychiatric disorders with the functional annotations derived from GenoSkyline and GenoSkyline-Plus, along with the prior disease graph generated by biomedical literature mining. For autoimmune diseases, GGPA 2.0 identified enrichment for blood-related epigenetic marks, especially B cells and regulatory T cells, across multiple diseases. Psychiatric disorders were enriched for brain-related epigenetic marks, especially the prefrontal cortex and the inferior temporal lobe for bipolar disorder and schizophrenia, respectively. In addition, the pleiotropy between bipolar disorder and schizophrenia was also detected. Finally, we found that GGPA 2.0 is robust to the use of irrelevant and/or incorrect functional annotations. These results demonstrate that GGPA 2.0 can be a powerful tool to identify genetic variants associated with each phenotype or those shared across multiple phenotypes, while also promoting an understanding of functional mechanisms underlying the associated variants.
format Online
Article
Text
id pubmed-10370274
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-103702742023-07-27 graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data Deng, Qiaolan Gupta, Arkobrato Jeon, Hyeongseon Nam, Jin Hyun Yilmaz, Ayse Selen Chang, Won Pietrzak, Maciej Li, Lang Kim, Hang J. Chung, Dongjun Front Genet Genetics Genome-wide association studies (GWAS) have successfully identified a large number of genetic variants associated with traits and diseases. However, it still remains challenging to fully understand the functional mechanisms underlying many associated variants. This is especially the case when we are interested in variants shared across multiple phenotypes. To address this challenge, we propose graph-GPA 2.0 (GGPA 2.0), a statistical framework to integrate GWAS datasets for multiple phenotypes and incorporate functional annotations within a unified framework. Our simulation studies showed that incorporating functional annotation data using GGPA 2.0 not only improves the detection of disease-associated variants, but also provides a more accurate estimation of relationships among diseases. Next, we analyzed five autoimmune diseases and five psychiatric disorders with the functional annotations derived from GenoSkyline and GenoSkyline-Plus, along with the prior disease graph generated by biomedical literature mining. For autoimmune diseases, GGPA 2.0 identified enrichment for blood-related epigenetic marks, especially B cells and regulatory T cells, across multiple diseases. Psychiatric disorders were enriched for brain-related epigenetic marks, especially the prefrontal cortex and the inferior temporal lobe for bipolar disorder and schizophrenia, respectively. In addition, the pleiotropy between bipolar disorder and schizophrenia was also detected. Finally, we found that GGPA 2.0 is robust to the use of irrelevant and/or incorrect functional annotations. These results demonstrate that GGPA 2.0 can be a powerful tool to identify genetic variants associated with each phenotype or those shared across multiple phenotypes, while also promoting an understanding of functional mechanisms underlying the associated variants. Frontiers Media S.A. 2023-07-12 /pmc/articles/PMC10370274/ /pubmed/37501720 http://dx.doi.org/10.3389/fgene.2023.1079198 Text en Copyright © 2023 Deng, Gupta, Jeon, Nam, Yilmaz, Chang, Pietrzak, Li, Kim and Chung. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Deng, Qiaolan
Gupta, Arkobrato
Jeon, Hyeongseon
Nam, Jin Hyun
Yilmaz, Ayse Selen
Chang, Won
Pietrzak, Maciej
Li, Lang
Kim, Hang J.
Chung, Dongjun
graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data
title graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data
title_full graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data
title_fullStr graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data
title_full_unstemmed graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data
title_short graph-GPA 2.0: improving multi-disease genetic analysis with integration of functional annotation data
title_sort graph-gpa 2.0: improving multi-disease genetic analysis with integration of functional annotation data
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10370274/
https://www.ncbi.nlm.nih.gov/pubmed/37501720
http://dx.doi.org/10.3389/fgene.2023.1079198
work_keys_str_mv AT dengqiaolan graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata
AT guptaarkobrato graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata
AT jeonhyeongseon graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata
AT namjinhyun graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata
AT yilmazayseselen graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata
AT changwon graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata
AT pietrzakmaciej graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata
AT lilang graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata
AT kimhangj graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata
AT chungdongjun graphgpa20improvingmultidiseasegeneticanalysiswithintegrationoffunctionalannotationdata