Cargando…

Improved polygenic prediction by Bayesian multiple regression on summary statistics

Accurate prediction of an individual’s phenotype from their DNA sequence is one of the great promises of genomics and precision medicine. We extend a powerful individual-level data Bayesian multiple regression model (BayesR) to one that utilises summary statistics from genome-wide association studie...

Descripción completa

Detalles Bibliográficos
Autores principales: Lloyd-Jones, Luke R., Zeng, Jian, Sidorenko, Julia, Yengo, Loïc, Moser, Gerhard, Kemper, Kathryn E., Wang, Huanwei, Zheng, Zhili, Magi, Reedik, Esko, Tõnu, Metspalu, Andres, Wray, Naomi R., Goddard, Michael E., Yang, Jian, Visscher, Peter M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6841727/
https://www.ncbi.nlm.nih.gov/pubmed/31704910
http://dx.doi.org/10.1038/s41467-019-12653-0
Descripción
Sumario:Accurate prediction of an individual’s phenotype from their DNA sequence is one of the great promises of genomics and precision medicine. We extend a powerful individual-level data Bayesian multiple regression model (BayesR) to one that utilises summary statistics from genome-wide association studies (GWAS), SBayesR. In simulation and cross-validation using 12 real traits and 1.1 million variants on 350,000 individuals from the UK Biobank, SBayesR improves prediction accuracy relative to commonly used state-of-the-art summary statistics methods at a fraction of the computational resources. Furthermore, using summary statistics for variants from the largest GWAS meta-analysis (n ≈ 700, 000) on height and BMI, we show that on average across traits and two independent data sets that SBayesR improves prediction R(2) by 5.2% relative to LDpred and by 26.5% relative to clumping and p value thresholding.