Cargando…

Predictive accuracy of combined genetic and environmental risk scores

The substantial heritability of most complex diseases suggests that genetic data could provide useful risk prediction. To date the performance of genetic risk scores has fallen short of the potential implied by heritability, but this can be explained by insufficient sample sizes for estimating highl...

Descripción completa

Detalles Bibliográficos
Autores principales: Dudbridge, Frank, Pashayan, Nora, Yang, Jian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5847122/
https://www.ncbi.nlm.nih.gov/pubmed/29178508
http://dx.doi.org/10.1002/gepi.22092
_version_ 1783305689206095872
author Dudbridge, Frank
Pashayan, Nora
Yang, Jian
author_facet Dudbridge, Frank
Pashayan, Nora
Yang, Jian
author_sort Dudbridge, Frank
collection PubMed
description The substantial heritability of most complex diseases suggests that genetic data could provide useful risk prediction. To date the performance of genetic risk scores has fallen short of the potential implied by heritability, but this can be explained by insufficient sample sizes for estimating highly polygenic models. When risk predictors already exist based on environment or lifestyle, two key questions are to what extent can they be improved by adding genetic information, and what is the ultimate potential of combined genetic and environmental risk scores? Here, we extend previous work on the predictive accuracy of polygenic scores to allow for an environmental score that may be correlated with the polygenic score, for example when the environmental factors mediate the genetic risk. We derive common measures of predictive accuracy and improvement as functions of the training sample size, chip heritabilities of disease and environmental score, and genetic correlation between disease and environmental risk factors. We consider simple addition of the two scores and a weighted sum that accounts for their correlation. Using examples from studies of cardiovascular disease and breast cancer, we show that improvements in discrimination are generally small but reasonable degrees of reclassification could be obtained with current sample sizes. Correlation between genetic and environmental scores has only minor effects on numerical results in realistic scenarios. In the longer term, as the accuracy of polygenic scores improves they will come to dominate the predictive accuracy compared to environmental scores.
format Online
Article
Text
id pubmed-5847122
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-58471222018-03-20 Predictive accuracy of combined genetic and environmental risk scores Dudbridge, Frank Pashayan, Nora Yang, Jian Genet Epidemiol Research Articles The substantial heritability of most complex diseases suggests that genetic data could provide useful risk prediction. To date the performance of genetic risk scores has fallen short of the potential implied by heritability, but this can be explained by insufficient sample sizes for estimating highly polygenic models. When risk predictors already exist based on environment or lifestyle, two key questions are to what extent can they be improved by adding genetic information, and what is the ultimate potential of combined genetic and environmental risk scores? Here, we extend previous work on the predictive accuracy of polygenic scores to allow for an environmental score that may be correlated with the polygenic score, for example when the environmental factors mediate the genetic risk. We derive common measures of predictive accuracy and improvement as functions of the training sample size, chip heritabilities of disease and environmental score, and genetic correlation between disease and environmental risk factors. We consider simple addition of the two scores and a weighted sum that accounts for their correlation. Using examples from studies of cardiovascular disease and breast cancer, we show that improvements in discrimination are generally small but reasonable degrees of reclassification could be obtained with current sample sizes. Correlation between genetic and environmental scores has only minor effects on numerical results in realistic scenarios. In the longer term, as the accuracy of polygenic scores improves they will come to dominate the predictive accuracy compared to environmental scores. John Wiley and Sons Inc. 2017-11-26 2018-02 /pmc/articles/PMC5847122/ /pubmed/29178508 http://dx.doi.org/10.1002/gepi.22092 Text en © 2017 The Authors. Genetic Epidemiology Published by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Articles
Dudbridge, Frank
Pashayan, Nora
Yang, Jian
Predictive accuracy of combined genetic and environmental risk scores
title Predictive accuracy of combined genetic and environmental risk scores
title_full Predictive accuracy of combined genetic and environmental risk scores
title_fullStr Predictive accuracy of combined genetic and environmental risk scores
title_full_unstemmed Predictive accuracy of combined genetic and environmental risk scores
title_short Predictive accuracy of combined genetic and environmental risk scores
title_sort predictive accuracy of combined genetic and environmental risk scores
topic Research Articles
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5847122/
https://www.ncbi.nlm.nih.gov/pubmed/29178508
http://dx.doi.org/10.1002/gepi.22092
work_keys_str_mv AT dudbridgefrank predictiveaccuracyofcombinedgeneticandenvironmentalriskscores
AT pashayannora predictiveaccuracyofcombinedgeneticandenvironmentalriskscores
AT yangjian predictiveaccuracyofcombinedgeneticandenvironmentalriskscores