Cargando…

Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies

Genetic risk prediction is an important problem in human genetics, and accurate prediction can facilitate disease prevention and treatment. Calculating polygenic risk score (PRS) has become widely used due to its simplicity and effectiveness, where only summary statistics from genome-wide associatio...

Descripción completa

Detalles Bibliográficos
Autores principales: Song, Shuang, Jiang, Wei, Hou, Lin, Zhao, Hongyu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7039528/
https://www.ncbi.nlm.nih.gov/pubmed/32045423
http://dx.doi.org/10.1371/journal.pcbi.1007565
_version_ 1783500826569867264
author Song, Shuang
Jiang, Wei
Hou, Lin
Zhao, Hongyu
author_facet Song, Shuang
Jiang, Wei
Hou, Lin
Zhao, Hongyu
author_sort Song, Shuang
collection PubMed
description Genetic risk prediction is an important problem in human genetics, and accurate prediction can facilitate disease prevention and treatment. Calculating polygenic risk score (PRS) has become widely used due to its simplicity and effectiveness, where only summary statistics from genome-wide association studies are needed in the standard method. Recently, several methods have been proposed to improve standard PRS by utilizing external information, such as linkage disequilibrium and functional annotations. In this paper, we introduce EB-PRS, a novel method that leverages information for effect sizes across all the markers to improve prediction accuracy. Compared to most existing genetic risk prediction methods, our method does not need to tune parameters nor external information. Real data applications on six diseases, including asthma, breast cancer, celiac disease, Crohn’s disease, Parkinson’s disease and type 2 diabetes show that EB-PRS achieved 307.1%, 42.8%, 25.5%, 3.1%, 74.3% and 49.6% relative improvements in terms of predictive r(2) over standard PRS method with optimally tuned parameters. Besides, compared to LDpred that makes use of LD information, EB-PRS also achieved 37.9%, 33.6%, 8.6%, 36.2%, 40.6% and 10.8% relative improvements. We note that our method is not the first method leveraging effect size distributions. Here we first justify our method by presenting theoretical optimal property over existing methods in this class of methods, and substantiate our theoretical result with extensive simulation results. The R-package EBPRS that implements our method is available on CRAN.
format Online
Article
Text
id pubmed-7039528
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-70395282020-03-06 Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies Song, Shuang Jiang, Wei Hou, Lin Zhao, Hongyu PLoS Comput Biol Research Article Genetic risk prediction is an important problem in human genetics, and accurate prediction can facilitate disease prevention and treatment. Calculating polygenic risk score (PRS) has become widely used due to its simplicity and effectiveness, where only summary statistics from genome-wide association studies are needed in the standard method. Recently, several methods have been proposed to improve standard PRS by utilizing external information, such as linkage disequilibrium and functional annotations. In this paper, we introduce EB-PRS, a novel method that leverages information for effect sizes across all the markers to improve prediction accuracy. Compared to most existing genetic risk prediction methods, our method does not need to tune parameters nor external information. Real data applications on six diseases, including asthma, breast cancer, celiac disease, Crohn’s disease, Parkinson’s disease and type 2 diabetes show that EB-PRS achieved 307.1%, 42.8%, 25.5%, 3.1%, 74.3% and 49.6% relative improvements in terms of predictive r(2) over standard PRS method with optimally tuned parameters. Besides, compared to LDpred that makes use of LD information, EB-PRS also achieved 37.9%, 33.6%, 8.6%, 36.2%, 40.6% and 10.8% relative improvements. We note that our method is not the first method leveraging effect size distributions. Here we first justify our method by presenting theoretical optimal property over existing methods in this class of methods, and substantiate our theoretical result with extensive simulation results. The R-package EBPRS that implements our method is available on CRAN. Public Library of Science 2020-02-11 /pmc/articles/PMC7039528/ /pubmed/32045423 http://dx.doi.org/10.1371/journal.pcbi.1007565 Text en © 2020 Song et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
spellingShingle Research Article
Song, Shuang
Jiang, Wei
Hou, Lin
Zhao, Hongyu
Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies
title Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies
title_full Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies
title_fullStr Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies
title_full_unstemmed Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies
title_short Leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies
title_sort leveraging effect size distributions to improve polygenic risk scores derived from summary statistics of genome-wide association studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7039528/
https://www.ncbi.nlm.nih.gov/pubmed/32045423
http://dx.doi.org/10.1371/journal.pcbi.1007565
work_keys_str_mv AT songshuang leveragingeffectsizedistributionstoimprovepolygenicriskscoresderivedfromsummarystatisticsofgenomewideassociationstudies
AT jiangwei leveragingeffectsizedistributionstoimprovepolygenicriskscoresderivedfromsummarystatisticsofgenomewideassociationstudies
AT houlin leveragingeffectsizedistributionstoimprovepolygenicriskscoresderivedfromsummarystatisticsofgenomewideassociationstudies
AT zhaohongyu leveragingeffectsizedistributionstoimprovepolygenicriskscoresderivedfromsummarystatisticsofgenomewideassociationstudies