Cargando…

Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets

Polygenic risk prediction is a widely investigated topic because of its promising clinical applications. Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a method for polygenic prediction, LDpred-funct, that leverages trait-specific...

Descripción completa

Detalles Bibliográficos
Autores principales: Márquez-Luna, Carla, Gazal, Steven, Loh, Po-Ru, Kim, Samuel S., Furlotte, Nicholas, Auton, Adam, Price, Alkes L.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8523709/
https://www.ncbi.nlm.nih.gov/pubmed/34663819
http://dx.doi.org/10.1038/s41467-021-25171-9
_version_ 1784585348333961216
author Márquez-Luna, Carla
Gazal, Steven
Loh, Po-Ru
Kim, Samuel S.
Furlotte, Nicholas
Auton, Adam
Price, Alkes L.
author_facet Márquez-Luna, Carla
Gazal, Steven
Loh, Po-Ru
Kim, Samuel S.
Furlotte, Nicholas
Auton, Adam
Price, Alkes L.
author_sort Márquez-Luna, Carla
collection PubMed
description Polygenic risk prediction is a widely investigated topic because of its promising clinical applications. Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, including coding, conserved, regulatory, and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank (avg N = 373 K as training data). LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg prediction R(2) = 0.144; highest R(2) = 0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (N = 1107 K) increased prediction R(2) to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits.
format Online
Article
Text
id pubmed-8523709
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-85237092021-11-15 Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets Márquez-Luna, Carla Gazal, Steven Loh, Po-Ru Kim, Samuel S. Furlotte, Nicholas Auton, Adam Price, Alkes L. Nat Commun Article Polygenic risk prediction is a widely investigated topic because of its promising clinical applications. Genetic variants in functional regions of the genome are enriched for complex trait heritability. Here, we introduce a method for polygenic prediction, LDpred-funct, that leverages trait-specific functional priors to increase prediction accuracy. We fit priors using the recently developed baseline-LD model, including coding, conserved, regulatory, and LD-related annotations. We analytically estimate posterior mean causal effect sizes and then use cross-validation to regularize these estimates, improving prediction accuracy for sparse architectures. We applied LDpred-funct to predict 21 highly heritable traits in the UK Biobank (avg N = 373 K as training data). LDpred-funct attained a +4.6% relative improvement in average prediction accuracy (avg prediction R(2) = 0.144; highest R(2) = 0.413 for height) compared to SBayesR (the best method that does not incorporate functional information). For height, meta-analyzing training data from UK Biobank and 23andMe cohorts (N = 1107 K) increased prediction R(2) to 0.431. Our results show that incorporating functional priors improves polygenic prediction accuracy, consistent with the functional architecture of complex traits. Nature Publishing Group UK 2021-10-18 /pmc/articles/PMC8523709/ /pubmed/34663819 http://dx.doi.org/10.1038/s41467-021-25171-9 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Márquez-Luna, Carla
Gazal, Steven
Loh, Po-Ru
Kim, Samuel S.
Furlotte, Nicholas
Auton, Adam
Price, Alkes L.
Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets
title Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets
title_full Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets
title_fullStr Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets
title_full_unstemmed Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets
title_short Incorporating functional priors improves polygenic prediction accuracy in UK Biobank and 23andMe data sets
title_sort incorporating functional priors improves polygenic prediction accuracy in uk biobank and 23andme data sets
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8523709/
https://www.ncbi.nlm.nih.gov/pubmed/34663819
http://dx.doi.org/10.1038/s41467-021-25171-9
work_keys_str_mv AT marquezlunacarla incorporatingfunctionalpriorsimprovespolygenicpredictionaccuracyinukbiobankand23andmedatasets
AT gazalsteven incorporatingfunctionalpriorsimprovespolygenicpredictionaccuracyinukbiobankand23andmedatasets
AT lohporu incorporatingfunctionalpriorsimprovespolygenicpredictionaccuracyinukbiobankand23andmedatasets
AT kimsamuels incorporatingfunctionalpriorsimprovespolygenicpredictionaccuracyinukbiobankand23andmedatasets
AT furlottenicholas incorporatingfunctionalpriorsimprovespolygenicpredictionaccuracyinukbiobankand23andmedatasets
AT autonadam incorporatingfunctionalpriorsimprovespolygenicpredictionaccuracyinukbiobankand23andmedatasets
AT incorporatingfunctionalpriorsimprovespolygenicpredictionaccuracyinukbiobankand23andmedatasets
AT pricealkesl incorporatingfunctionalpriorsimprovespolygenicpredictionaccuracyinukbiobankand23andmedatasets