Cargando…

Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies

We report a new model to project the predictive performance of polygenic models based on the number and distribution of effect sizes for the underlying susceptibility alleles and the size of the training dataset. Using estimates of effect-size distribution and heritability derived from current studi...

Descripción completa

Detalles Bibliográficos
Autores principales: Chatterjee, Nilanjan, Wheeler, Bill, Sampson, Joshua, Hartge, Patricia, Chanock, Stephen J., Park, Ju-Hyun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3729116/
https://www.ncbi.nlm.nih.gov/pubmed/23455638
http://dx.doi.org/10.1038/ng.2579
_version_ 1782278946626207744
author Chatterjee, Nilanjan
Wheeler, Bill
Sampson, Joshua
Hartge, Patricia
Chanock, Stephen J.
Park, Ju-Hyun
author_facet Chatterjee, Nilanjan
Wheeler, Bill
Sampson, Joshua
Hartge, Patricia
Chanock, Stephen J.
Park, Ju-Hyun
author_sort Chatterjee, Nilanjan
collection PubMed
description We report a new model to project the predictive performance of polygenic models based on the number and distribution of effect sizes for the underlying susceptibility alleles and the size of the training dataset. Using estimates of effect-size distribution and heritability derived from current studies, we project that while 45% of the variance of height has been attributed to common tagging Single Nucleotide Polymorphisms (SNP), a model trained on one million people may only explain 33.4% of variance of the trait. Current studies can identify 3.0%, 1.1%, and 7.0%, of the populations who are at two-fold or higher than average risk for Type 2 diabetes, coronary artery disease and prostate cancer, respectively. Tripling of sample sizes could elevate the percentages to 18.8%, 6.1%, and 12.2%, respectively. The utility of future polygenic models will depend on achievable sample sizes, underlying genetic architecture and information on other risk-factors, including family history.
format Online
Article
Text
id pubmed-3729116
institution National Center for Biotechnology Information
language English
publishDate 2013
record_format MEDLINE/PubMed
spelling pubmed-37291162013-10-01 Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies Chatterjee, Nilanjan Wheeler, Bill Sampson, Joshua Hartge, Patricia Chanock, Stephen J. Park, Ju-Hyun Nat Genet Article We report a new model to project the predictive performance of polygenic models based on the number and distribution of effect sizes for the underlying susceptibility alleles and the size of the training dataset. Using estimates of effect-size distribution and heritability derived from current studies, we project that while 45% of the variance of height has been attributed to common tagging Single Nucleotide Polymorphisms (SNP), a model trained on one million people may only explain 33.4% of variance of the trait. Current studies can identify 3.0%, 1.1%, and 7.0%, of the populations who are at two-fold or higher than average risk for Type 2 diabetes, coronary artery disease and prostate cancer, respectively. Tripling of sample sizes could elevate the percentages to 18.8%, 6.1%, and 12.2%, respectively. The utility of future polygenic models will depend on achievable sample sizes, underlying genetic architecture and information on other risk-factors, including family history. 2013-03-03 2013-04 /pmc/articles/PMC3729116/ /pubmed/23455638 http://dx.doi.org/10.1038/ng.2579 Text en Users may view, print, copy, download and text and data- mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Chatterjee, Nilanjan
Wheeler, Bill
Sampson, Joshua
Hartge, Patricia
Chanock, Stephen J.
Park, Ju-Hyun
Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies
title Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies
title_full Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies
title_fullStr Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies
title_full_unstemmed Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies
title_short Projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies
title_sort projecting the performance of risk prediction based on polygenic analyses of genome-wide association studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3729116/
https://www.ncbi.nlm.nih.gov/pubmed/23455638
http://dx.doi.org/10.1038/ng.2579
work_keys_str_mv AT chatterjeenilanjan projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies
AT wheelerbill projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies
AT sampsonjoshua projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies
AT hartgepatricia projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies
AT chanockstephenj projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies
AT parkjuhyun projectingtheperformanceofriskpredictionbasedonpolygenicanalysesofgenomewideassociationstudies