Cargando…
Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies
We report a new model to project the predictive performance of polygenic models based on the number and distribution of effect sizes for the underlying susceptibility alleles and the size of the training dataset. Using estimates of effect-size distribution and heritability derived from current studi...
Autores principales: | , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2013
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3729116/ https://www.ncbi.nlm.nih.gov/pubmed/23455638 http://dx.doi.org/10.1038/ng.2579 |
_version_ | 1782278946626207744 |
---|---|
author | Chatterjee, Nilanjan Wheeler, Bill Sampson, Joshua Hartge, Patricia Chanock, Stephen J. Park, Ju-Hyun |
author_facet | Chatterjee, Nilanjan Wheeler, Bill Sampson, Joshua Hartge, Patricia Chanock, Stephen J. Park, Ju-Hyun |
author_sort | Chatterjee, Nilanjan |
collection | PubMed |
description | We report a new model to project the predictive performance of polygenic models based on the number and distribution of effect sizes for the underlying susceptibility alleles and the size of the training dataset. Using estimates of effect-size distribution and heritability derived from current studies, we project that while 45% of the variance of height has been attributed to common tagging Single Nucleotide Polymorphisms (SNP), a model trained on one million people may only explain 33.4% of variance of the trait. Current studies can identify 3.0%, 1.1%, and 7.0%, of the populations who are at two-fold or higher than average risk for Type 2 diabetes, coronary artery disease and prostate cancer, respectively. Tripling of sample sizes could elevate the percentages to 18.8%, 6.1%, and 12.2%, respectively. The utility of future polygenic models will depend on achievable sample sizes, underlying genetic architecture and information on other risk-factors, including family history. |
format | Online Article Text |
id | pubmed-3729116 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2013 |
record_format | MEDLINE/PubMed |
spelling | pubmed-37291162013-10-01 Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies Chatterjee, Nilanjan Wheeler, Bill Sampson, Joshua Hartge, Patricia Chanock, Stephen J. Park, Ju-Hyun Nat Genet Article We report a new model to project the predictive performance of polygenic models based on the number and distribution of effect sizes for the underlying susceptibility alleles and the size of the training dataset. Using estimates of effect-size distribution and heritability derived from current studies, we project that while 45% of the variance of height has been attributed to common tagging Single Nucleotide Polymorphisms (SNP), a model trained on one million people may only explain 33.4% of variance of the trait. Current studies can identify 3.0%, 1.1%, and 7.0%, of the populations who are at two-fold or higher than average risk for Type 2 diabetes, coronary artery disease and prostate cancer, respectively. Tripling of sample sizes could elevate the percentages to 18.8%, 6.1%, and 12.2%, respectively. The utility of future polygenic models will depend on achievable sample sizes, underlying genetic architecture and information on other risk-factors, including family history. 2013-03-03 2013-04 /pmc/articles/PMC3729116/ /pubmed/23455638 http://dx.doi.org/10.1038/ng.2579 Text en Users may view, print, copy, download and text and data- mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms |
spellingShingle | Article Chatterjee, Nilanjan Wheeler, Bill Sampson, Joshua Hartge, Patricia Chanock, Stephen J. Park, Ju-Hyun Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies |
title | Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies |
title_full | Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies |
title_fullStr | Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies |
title_full_unstemmed | Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies |
title_short | Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies |
title_sort | projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3729116/ https://www.ncbi.nlm.nih.gov/pubmed/23455638 http://dx.doi.org/10.1038/ng.2579 |
work_keys_str_mv | AT chatterjeenilanjan projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies AT wheelerbill projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies AT sampsonjoshua projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies AT hartgepatricia projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies AT chanockstephenj projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies AT parkjuhyun projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies |