Cargando…

Incorporation of genetic model parameters for cost-effective designs of genetic association studies using DNA pooling

BACKGROUND: Studies of association methods using DNA pooling of single nucleotide polymorphisms (SNPs) have focused primarily on the effects of "machine-error", number of replicates, and the size of the pool. We use the non-centrality parameter (NCP) for the analysis of variance test to co...

Descripción completa

Detalles Bibliográficos
Autores principales: Ji, Fei, Finch, Stephen J, Haynes, Chad, Mendell, Nancy R, Gordon, Derek
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2007
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1947971/
https://www.ncbi.nlm.nih.gov/pubmed/17634103
http://dx.doi.org/10.1186/1471-2164-8-238
_version_ 1782134495043911680
author Ji, Fei
Finch, Stephen J
Haynes, Chad
Mendell, Nancy R
Gordon, Derek
author_facet Ji, Fei
Finch, Stephen J
Haynes, Chad
Mendell, Nancy R
Gordon, Derek
author_sort Ji, Fei
collection PubMed
description BACKGROUND: Studies of association methods using DNA pooling of single nucleotide polymorphisms (SNPs) have focused primarily on the effects of "machine-error", number of replicates, and the size of the pool. We use the non-centrality parameter (NCP) for the analysis of variance test to compute the approximate power for genetic association tests with DNA pooling data on cases and controls. We incorporate genetic model parameters into the computation of the NCP. Parameters involved in the power calculation are disease allele frequency, frequency of the marker SNP allele in coupling with the disease locus, disease prevalence, genotype relative risk, sample size, genetic model, number of pools, number of replicates of each pool, and the proportion of variance of the pooled frequency estimate due to machine variability. We compute power for different settings of number of replicates and total number of genotypings when the genetic model parameters are fixed. Several significance levels are considered, including stringent significance levels (due to the increasing popularity of 100 K and 500 K SNP "chip" data). We use a factorial design with two to four settings of each parameter and multiple regression analysis to assess which parameters most significantly affect power. RESULTS: The power can increase substantially as the genotyping number increases. For a fixed number of genotypings, the power is a function of the number of replicates of each pool such that there is a setting with maximum power. The four most significant parameters affecting power for association are: (1) genotype relative risk, (2) genetic model, (3) sample size, and (4) the interaction term between disease and SNP marker allele probabilities. CONCLUSION: For a fixed number of genotypings, there is an optimal number of replicates of each pool that increases as the number of genotypings increases. Power is not substantially reduced when the number of replicates is close to but not equal to the optimal setting.
format Text
id pubmed-1947971
institution National Center for Biotechnology Information
language English
publishDate 2007
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-19479712007-08-14 Incorporation of genetic model parameters for cost-effective designs of genetic association studies using DNA pooling Ji, Fei Finch, Stephen J Haynes, Chad Mendell, Nancy R Gordon, Derek BMC Genomics Methodology Article BACKGROUND: Studies of association methods using DNA pooling of single nucleotide polymorphisms (SNPs) have focused primarily on the effects of "machine-error", number of replicates, and the size of the pool. We use the non-centrality parameter (NCP) for the analysis of variance test to compute the approximate power for genetic association tests with DNA pooling data on cases and controls. We incorporate genetic model parameters into the computation of the NCP. Parameters involved in the power calculation are disease allele frequency, frequency of the marker SNP allele in coupling with the disease locus, disease prevalence, genotype relative risk, sample size, genetic model, number of pools, number of replicates of each pool, and the proportion of variance of the pooled frequency estimate due to machine variability. We compute power for different settings of number of replicates and total number of genotypings when the genetic model parameters are fixed. Several significance levels are considered, including stringent significance levels (due to the increasing popularity of 100 K and 500 K SNP "chip" data). We use a factorial design with two to four settings of each parameter and multiple regression analysis to assess which parameters most significantly affect power. RESULTS: The power can increase substantially as the genotyping number increases. For a fixed number of genotypings, the power is a function of the number of replicates of each pool such that there is a setting with maximum power. The four most significant parameters affecting power for association are: (1) genotype relative risk, (2) genetic model, (3) sample size, and (4) the interaction term between disease and SNP marker allele probabilities. CONCLUSION: For a fixed number of genotypings, there is an optimal number of replicates of each pool that increases as the number of genotypings increases. Power is not substantially reduced when the number of replicates is close to but not equal to the optimal setting. BioMed Central 2007-07-16 /pmc/articles/PMC1947971/ /pubmed/17634103 http://dx.doi.org/10.1186/1471-2164-8-238 Text en Copyright © 2007 Ji et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Ji, Fei
Finch, Stephen J
Haynes, Chad
Mendell, Nancy R
Gordon, Derek
Incorporation of genetic model parameters for cost-effective designs of genetic association studies using DNA pooling
title Incorporation of genetic model parameters for cost-effective designs of genetic association studies using DNA pooling
title_full Incorporation of genetic model parameters for cost-effective designs of genetic association studies using DNA pooling
title_fullStr Incorporation of genetic model parameters for cost-effective designs of genetic association studies using DNA pooling
title_full_unstemmed Incorporation of genetic model parameters for cost-effective designs of genetic association studies using DNA pooling
title_short Incorporation of genetic model parameters for cost-effective designs of genetic association studies using DNA pooling
title_sort incorporation of genetic model parameters for cost-effective designs of genetic association studies using dna pooling
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1947971/
https://www.ncbi.nlm.nih.gov/pubmed/17634103
http://dx.doi.org/10.1186/1471-2164-8-238
work_keys_str_mv AT jifei incorporationofgeneticmodelparametersforcosteffectivedesignsofgeneticassociationstudiesusingdnapooling
AT finchstephenj incorporationofgeneticmodelparametersforcosteffectivedesignsofgeneticassociationstudiesusingdnapooling
AT hayneschad incorporationofgeneticmodelparametersforcosteffectivedesignsofgeneticassociationstudiesusingdnapooling
AT mendellnancyr incorporationofgeneticmodelparametersforcosteffectivedesignsofgeneticassociationstudiesusingdnapooling
AT gordonderek incorporationofgeneticmodelparametersforcosteffectivedesignsofgeneticassociationstudiesusingdnapooling