Cargando…
Predictive accuracy of combined genetic and environmental risk scores
The substantial heritability of most complex diseases suggests that genetic data could provide useful risk prediction. To date the performance of genetic risk scores has fallen short of the potential implied by heritability, but this can be explained by insufficient sample sizes for estimating highl...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
John Wiley and Sons Inc.
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5847122/ https://www.ncbi.nlm.nih.gov/pubmed/29178508 http://dx.doi.org/10.1002/gepi.22092 |
_version_ | 1783305689206095872 |
---|---|
author | Dudbridge, Frank Pashayan, Nora Yang, Jian |
author_facet | Dudbridge, Frank Pashayan, Nora Yang, Jian |
author_sort | Dudbridge, Frank |
collection | PubMed |
description | The substantial heritability of most complex diseases suggests that genetic data could provide useful risk prediction. To date the performance of genetic risk scores has fallen short of the potential implied by heritability, but this can be explained by insufficient sample sizes for estimating highly polygenic models. When risk predictors already exist based on environment or lifestyle, two key questions are to what extent can they be improved by adding genetic information, and what is the ultimate potential of combined genetic and environmental risk scores? Here, we extend previous work on the predictive accuracy of polygenic scores to allow for an environmental score that may be correlated with the polygenic score, for example when the environmental factors mediate the genetic risk. We derive common measures of predictive accuracy and improvement as functions of the training sample size, chip heritabilities of disease and environmental score, and genetic correlation between disease and environmental risk factors. We consider simple addition of the two scores and a weighted sum that accounts for their correlation. Using examples from studies of cardiovascular disease and breast cancer, we show that improvements in discrimination are generally small but reasonable degrees of reclassification could be obtained with current sample sizes. Correlation between genetic and environmental scores has only minor effects on numerical results in realistic scenarios. In the longer term, as the accuracy of polygenic scores improves they will come to dominate the predictive accuracy compared to environmental scores. |
format | Online Article Text |
id | pubmed-5847122 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | John Wiley and Sons Inc. |
record_format | MEDLINE/PubMed |
spelling | pubmed-58471222018-03-20 Predictive accuracy of combined genetic and environmental risk scores Dudbridge, Frank Pashayan, Nora Yang, Jian Genet Epidemiol Research Articles The substantial heritability of most complex diseases suggests that genetic data could provide useful risk prediction. To date the performance of genetic risk scores has fallen short of the potential implied by heritability, but this can be explained by insufficient sample sizes for estimating highly polygenic models. When risk predictors already exist based on environment or lifestyle, two key questions are to what extent can they be improved by adding genetic information, and what is the ultimate potential of combined genetic and environmental risk scores? Here, we extend previous work on the predictive accuracy of polygenic scores to allow for an environmental score that may be correlated with the polygenic score, for example when the environmental factors mediate the genetic risk. We derive common measures of predictive accuracy and improvement as functions of the training sample size, chip heritabilities of disease and environmental score, and genetic correlation between disease and environmental risk factors. We consider simple addition of the two scores and a weighted sum that accounts for their correlation. Using examples from studies of cardiovascular disease and breast cancer, we show that improvements in discrimination are generally small but reasonable degrees of reclassification could be obtained with current sample sizes. Correlation between genetic and environmental scores has only minor effects on numerical results in realistic scenarios. In the longer term, as the accuracy of polygenic scores improves they will come to dominate the predictive accuracy compared to environmental scores. John Wiley and Sons Inc. 2017-11-26 2018-02 /pmc/articles/PMC5847122/ /pubmed/29178508 http://dx.doi.org/10.1002/gepi.22092 Text en © 2017 The Authors. Genetic Epidemiology Published by Wiley Periodicals, Inc. This is an open access article under the terms of the Creative Commons Attribution (http://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Articles Dudbridge, Frank Pashayan, Nora Yang, Jian Predictive accuracy of combined genetic and environmental risk scores |
title | Predictive accuracy of combined genetic and environmental risk scores |
title_full | Predictive accuracy of combined genetic and environmental risk scores |
title_fullStr | Predictive accuracy of combined genetic and environmental risk scores |
title_full_unstemmed | Predictive accuracy of combined genetic and environmental risk scores |
title_short | Predictive accuracy of combined genetic and environmental risk scores |
title_sort | predictive accuracy of combined genetic and environmental risk scores |
topic | Research Articles |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5847122/ https://www.ncbi.nlm.nih.gov/pubmed/29178508 http://dx.doi.org/10.1002/gepi.22092 |
work_keys_str_mv | AT dudbridgefrank predictiveaccuracyofcombinedgeneticandenvironmentalriskscores AT pashayannora predictiveaccuracyofcombinedgeneticandenvironmentalriskscores AT yangjian predictiveaccuracyofcombinedgeneticandenvironmentalriskscores |