Cargando…

LPG: A four-group probabilistic approach to leveraging pleiotropy in genome-wide association studies

BACKGROUND: To date, genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants among a variety of traits/diseases, shedding light on the genetic architecture of complex disease. The polygenicity of complex diseases is a widely accepted phenomenon throu...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Yi, Dai, Mingwei, Huang, Jian, Lin, Xinyi, Yang, Can, Chen, Min, Liu, Jin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022345/
https://www.ncbi.nlm.nih.gov/pubmed/29954342
http://dx.doi.org/10.1186/s12864-018-4851-2
_version_ 1783335660217696256
author Yang, Yi
Dai, Mingwei
Huang, Jian
Lin, Xinyi
Yang, Can
Chen, Min
Liu, Jin
author_facet Yang, Yi
Dai, Mingwei
Huang, Jian
Lin, Xinyi
Yang, Can
Chen, Min
Liu, Jin
author_sort Yang, Yi
collection PubMed
description BACKGROUND: To date, genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants among a variety of traits/diseases, shedding light on the genetic architecture of complex disease. The polygenicity of complex diseases is a widely accepted phenomenon through which a vast number of risk variants, each with a modest individual effect, collectively contribute to the heritability of complex diseases. This imposes a major challenge on fully characterizing the genetic bases of complex diseases. An immediate implication of polygenicity is that a much larger sample size is required to detect individual risk variants with weak/moderate effects. Meanwhile, accumulating evidence suggests that different complex diseases can share genetic risk variants, a phenomenon known as pleiotropy. RESULTS: In this study, we propose a statistical framework for Leveraging Pleiotropic effects in large-scale GWAS data (LPG). LPG utilizes a variational Bayesian expectation-maximization (VBEM) algorithm, making it computationally efficient and scalable for genome-wide-scale analysis. To demonstrate the advantages of LPG over existing methods that do not leverage pleiotropy, we conducted extensive simulation studies and applied LPG to analyze two pairs of disorders (Crohn’s disease and Type 1 diabetes, as well as rheumatoid arthritis and Type 1 diabetes). The results indicate that by levelaging pleiotropy, LPG can improve the power of prioritization of risk variants and the accuracy of risk prediction. CONCLUSIONS: Our methodology provides a novel and efficient tool to detect pleiotropy among GWAS data for multiple traits/diseases collected from different studies. The software is available at https://github.com/Shufeyangyi2015310117/LPG. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4851-2) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6022345
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-60223452018-07-09 LPG: A four-group probabilistic approach to leveraging pleiotropy in genome-wide association studies Yang, Yi Dai, Mingwei Huang, Jian Lin, Xinyi Yang, Can Chen, Min Liu, Jin BMC Genomics Methodology Article BACKGROUND: To date, genome-wide association studies (GWAS) have successfully identified tens of thousands of genetic variants among a variety of traits/diseases, shedding light on the genetic architecture of complex disease. The polygenicity of complex diseases is a widely accepted phenomenon through which a vast number of risk variants, each with a modest individual effect, collectively contribute to the heritability of complex diseases. This imposes a major challenge on fully characterizing the genetic bases of complex diseases. An immediate implication of polygenicity is that a much larger sample size is required to detect individual risk variants with weak/moderate effects. Meanwhile, accumulating evidence suggests that different complex diseases can share genetic risk variants, a phenomenon known as pleiotropy. RESULTS: In this study, we propose a statistical framework for Leveraging Pleiotropic effects in large-scale GWAS data (LPG). LPG utilizes a variational Bayesian expectation-maximization (VBEM) algorithm, making it computationally efficient and scalable for genome-wide-scale analysis. To demonstrate the advantages of LPG over existing methods that do not leverage pleiotropy, we conducted extensive simulation studies and applied LPG to analyze two pairs of disorders (Crohn’s disease and Type 1 diabetes, as well as rheumatoid arthritis and Type 1 diabetes). The results indicate that by levelaging pleiotropy, LPG can improve the power of prioritization of risk variants and the accuracy of risk prediction. CONCLUSIONS: Our methodology provides a novel and efficient tool to detect pleiotropy among GWAS data for multiple traits/diseases collected from different studies. The software is available at https://github.com/Shufeyangyi2015310117/LPG. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s12864-018-4851-2) contains supplementary material, which is available to authorized users. BioMed Central 2018-06-28 /pmc/articles/PMC6022345/ /pubmed/29954342 http://dx.doi.org/10.1186/s12864-018-4851-2 Text en © The Author(s) 2018 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License(http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Methodology Article
Yang, Yi
Dai, Mingwei
Huang, Jian
Lin, Xinyi
Yang, Can
Chen, Min
Liu, Jin
LPG: A four-group probabilistic approach to leveraging pleiotropy in genome-wide association studies
title LPG: A four-group probabilistic approach to leveraging pleiotropy in genome-wide association studies
title_full LPG: A four-group probabilistic approach to leveraging pleiotropy in genome-wide association studies
title_fullStr LPG: A four-group probabilistic approach to leveraging pleiotropy in genome-wide association studies
title_full_unstemmed LPG: A four-group probabilistic approach to leveraging pleiotropy in genome-wide association studies
title_short LPG: A four-group probabilistic approach to leveraging pleiotropy in genome-wide association studies
title_sort lpg: a four-group probabilistic approach to leveraging pleiotropy in genome-wide association studies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6022345/
https://www.ncbi.nlm.nih.gov/pubmed/29954342
http://dx.doi.org/10.1186/s12864-018-4851-2
work_keys_str_mv AT yangyi lpgafourgroupprobabilisticapproachtoleveragingpleiotropyingenomewideassociationstudies
AT daimingwei lpgafourgroupprobabilisticapproachtoleveragingpleiotropyingenomewideassociationstudies
AT huangjian lpgafourgroupprobabilisticapproachtoleveragingpleiotropyingenomewideassociationstudies
AT linxinyi lpgafourgroupprobabilisticapproachtoleveragingpleiotropyingenomewideassociationstudies
AT yangcan lpgafourgroupprobabilisticapproachtoleveragingpleiotropyingenomewideassociationstudies
AT chenmin lpgafourgroupprobabilisticapproachtoleveragingpleiotropyingenomewideassociationstudies
AT liujin lpgafourgroupprobabilisticapproachtoleveragingpleiotropyingenomewideassociationstudies