Cargando…

Sample Size and Statistical Power Calculation in Genetic Association Studies

A sample size with sufficient statistical power is critical to the success of genetic association studies to detect causal genes of human complex diseases. Genome-wide association studies require much larger sample sizes to achieve an adequate statistical power. We estimated the statistical power wi...

Descripción completa

Detalles Bibliográficos
Autores principales: Hong, Eun Pyo, Park, Ji Wan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Korea Genome Organization 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480678/
https://www.ncbi.nlm.nih.gov/pubmed/23105939
http://dx.doi.org/10.5808/GI.2012.10.2.117
_version_ 1782247596422594560
author Hong, Eun Pyo
Park, Ji Wan
author_facet Hong, Eun Pyo
Park, Ji Wan
author_sort Hong, Eun Pyo
collection PubMed
description A sample size with sufficient statistical power is critical to the success of genetic association studies to detect causal genes of human complex diseases. Genome-wide association studies require much larger sample sizes to achieve an adequate statistical power. We estimated the statistical power with increasing numbers of markers analyzed and compared the sample sizes that were required in case-control studies and case-parent studies. We computed the effective sample size and statistical power using Genetic Power Calculator. An analysis using a larger number of markers requires a larger sample size. Testing a single-nucleotide polymorphism (SNP) marker requires 248 cases, while testing 500,000 SNPs and 1 million markers requires 1,206 cases and 1,255 cases, respectively, under the assumption of an odds ratio of 2, 5% disease prevalence, 5% minor allele frequency, complete linkage disequilibrium (LD), 1:1 case/control ratio, and a 5% error rate in an allelic test. Under a dominant model, a smaller sample size is required to achieve 80% power than other genetic models. We found that a much lower sample size was required with a strong effect size, common SNP, and increased LD. In addition, studying a common disease in a case-control study of a 1:4 case-control ratio is one way to achieve higher statistical power. We also found that case-parent studies require more samples than case-control studies. Although we have not covered all plausible cases in study design, the estimates of sample size and statistical power computed under various assumptions in this study may be useful to determine the sample size in designing a population-based genetic association study.
format Online
Article
Text
id pubmed-3480678
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Korea Genome Organization
record_format MEDLINE/PubMed
spelling pubmed-34806782012-10-26 Sample Size and Statistical Power Calculation in Genetic Association Studies Hong, Eun Pyo Park, Ji Wan Genomics Inf Article A sample size with sufficient statistical power is critical to the success of genetic association studies to detect causal genes of human complex diseases. Genome-wide association studies require much larger sample sizes to achieve an adequate statistical power. We estimated the statistical power with increasing numbers of markers analyzed and compared the sample sizes that were required in case-control studies and case-parent studies. We computed the effective sample size and statistical power using Genetic Power Calculator. An analysis using a larger number of markers requires a larger sample size. Testing a single-nucleotide polymorphism (SNP) marker requires 248 cases, while testing 500,000 SNPs and 1 million markers requires 1,206 cases and 1,255 cases, respectively, under the assumption of an odds ratio of 2, 5% disease prevalence, 5% minor allele frequency, complete linkage disequilibrium (LD), 1:1 case/control ratio, and a 5% error rate in an allelic test. Under a dominant model, a smaller sample size is required to achieve 80% power than other genetic models. We found that a much lower sample size was required with a strong effect size, common SNP, and increased LD. In addition, studying a common disease in a case-control study of a 1:4 case-control ratio is one way to achieve higher statistical power. We also found that case-parent studies require more samples than case-control studies. Although we have not covered all plausible cases in study design, the estimates of sample size and statistical power computed under various assumptions in this study may be useful to determine the sample size in designing a population-based genetic association study. Korea Genome Organization 2012-06 2012-06-30 /pmc/articles/PMC3480678/ /pubmed/23105939 http://dx.doi.org/10.5808/GI.2012.10.2.117 Text en Copyright © 2012 by The Korea Genome Organization http://creativecommons.org/licenses/by-nc/3.0 It is identical to the Creative Commons Attribution Non-Commercial License (http://creativecommons.org/licenses/by-nc/3.0/).
spellingShingle Article
Hong, Eun Pyo
Park, Ji Wan
Sample Size and Statistical Power Calculation in Genetic Association Studies
title Sample Size and Statistical Power Calculation in Genetic Association Studies
title_full Sample Size and Statistical Power Calculation in Genetic Association Studies
title_fullStr Sample Size and Statistical Power Calculation in Genetic Association Studies
title_full_unstemmed Sample Size and Statistical Power Calculation in Genetic Association Studies
title_short Sample Size and Statistical Power Calculation in Genetic Association Studies
title_sort sample size and statistical power calculation in genetic association studies
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3480678/
https://www.ncbi.nlm.nih.gov/pubmed/23105939
http://dx.doi.org/10.5808/GI.2012.10.2.117
work_keys_str_mv AT hongeunpyo samplesizeandstatisticalpowercalculationingeneticassociationstudies
AT parkjiwan samplesizeandstatisticalpowercalculationingeneticassociationstudies