Cargando…

How Many Genetic Variants Remain to Be Discovered?

A great majority of genetic markers discovered in recent genome-wide association studies have small effect sizes, and they explain only a small fraction of the genetic contribution to the diseases. How many more variants can we expect to discover and what study sizes are needed? We derive the connec...

Descripción completa

Detalles Bibliográficos
Autores principales: Pawitan, Yudi, Seng, Ku Chee, Magnusson, Patrik K. E.
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2780697/
https://www.ncbi.nlm.nih.gov/pubmed/19956539
http://dx.doi.org/10.1371/journal.pone.0007969
_version_ 1782174520550883328
author Pawitan, Yudi
Seng, Ku Chee
Magnusson, Patrik K. E.
author_facet Pawitan, Yudi
Seng, Ku Chee
Magnusson, Patrik K. E.
author_sort Pawitan, Yudi
collection PubMed
description A great majority of genetic markers discovered in recent genome-wide association studies have small effect sizes, and they explain only a small fraction of the genetic contribution to the diseases. How many more variants can we expect to discover and what study sizes are needed? We derive the connection between the cumulative risk of the SNP variants to the latent genetic risk model and heritability of the disease. We determine the sample size required for case-control studies in order to achieve a certain expected number of discoveries in a collection of most significant SNPs. Assuming similar allele frequencies and effect sizes of the currently validated SNPs, complex phenotypes such as type-2 diabetes would need approximately 800 variants to explain its 40% heritability. Much smaller numbers of variants are needed if we assume rare-variants but higher penetrance models. We estimate that up to 50,000 cases and an equal number of controls are needed to discover 800 common low-penetrant variants among the top 5000 SNPs. Under common and rare low-penetrance models, the very large studies required to discover the numerous variants are probably at the limit of practical feasibility. Under rare-variant with medium- to high-penetrance models (odds-ratios between 1.6 and 4.0), studies comparable in size to many existing studies are adequate provided the genotyping technology can interrogate more and rarer variants.
format Text
id pubmed-2780697
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-27806972009-12-03 How Many Genetic Variants Remain to Be Discovered? Pawitan, Yudi Seng, Ku Chee Magnusson, Patrik K. E. PLoS One Research Article A great majority of genetic markers discovered in recent genome-wide association studies have small effect sizes, and they explain only a small fraction of the genetic contribution to the diseases. How many more variants can we expect to discover and what study sizes are needed? We derive the connection between the cumulative risk of the SNP variants to the latent genetic risk model and heritability of the disease. We determine the sample size required for case-control studies in order to achieve a certain expected number of discoveries in a collection of most significant SNPs. Assuming similar allele frequencies and effect sizes of the currently validated SNPs, complex phenotypes such as type-2 diabetes would need approximately 800 variants to explain its 40% heritability. Much smaller numbers of variants are needed if we assume rare-variants but higher penetrance models. We estimate that up to 50,000 cases and an equal number of controls are needed to discover 800 common low-penetrant variants among the top 5000 SNPs. Under common and rare low-penetrance models, the very large studies required to discover the numerous variants are probably at the limit of practical feasibility. Under rare-variant with medium- to high-penetrance models (odds-ratios between 1.6 and 4.0), studies comparable in size to many existing studies are adequate provided the genotyping technology can interrogate more and rarer variants. Public Library of Science 2009-12-02 /pmc/articles/PMC2780697/ /pubmed/19956539 http://dx.doi.org/10.1371/journal.pone.0007969 Text en Pawitan et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Pawitan, Yudi
Seng, Ku Chee
Magnusson, Patrik K. E.
How Many Genetic Variants Remain to Be Discovered?
title How Many Genetic Variants Remain to Be Discovered?
title_full How Many Genetic Variants Remain to Be Discovered?
title_fullStr How Many Genetic Variants Remain to Be Discovered?
title_full_unstemmed How Many Genetic Variants Remain to Be Discovered?
title_short How Many Genetic Variants Remain to Be Discovered?
title_sort how many genetic variants remain to be discovered?
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2780697/
https://www.ncbi.nlm.nih.gov/pubmed/19956539
http://dx.doi.org/10.1371/journal.pone.0007969
work_keys_str_mv AT pawitanyudi howmanygeneticvariantsremaintobediscovered
AT sengkuchee howmanygeneticvariantsremaintobediscovered
AT magnussonpatrikke howmanygeneticvariantsremaintobediscovered