Cargando…

Identification of disease causing loci using an array-based genotyping approach on pooled DNA

BACKGROUND: Pooling genomic DNA samples within clinical classes of disease followed by genotyping on whole-genome SNP microarrays, allows for rapid and inexpensive genome-wide association studies. Key to the success of these studies is the accuracy of the allelic frequency calculations, the ability...

Descripción completa

Detalles Bibliográficos
Autores principales: Craig, David W, Huentelman, Matthew J, Hu-Lince, Diane, Zismann, Victoria L, Kruer, Michael C, Lee, Anne M, Puffenberger, Erik G, Pearson, John M, Stephan, Dietrich A
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2005
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1262713/
https://www.ncbi.nlm.nih.gov/pubmed/16197552
http://dx.doi.org/10.1186/1471-2164-6-138
_version_ 1782125889084981248
author Craig, David W
Huentelman, Matthew J
Hu-Lince, Diane
Zismann, Victoria L
Kruer, Michael C
Lee, Anne M
Puffenberger, Erik G
Pearson, John M
Stephan, Dietrich A
author_facet Craig, David W
Huentelman, Matthew J
Hu-Lince, Diane
Zismann, Victoria L
Kruer, Michael C
Lee, Anne M
Puffenberger, Erik G
Pearson, John M
Stephan, Dietrich A
author_sort Craig, David W
collection PubMed
description BACKGROUND: Pooling genomic DNA samples within clinical classes of disease followed by genotyping on whole-genome SNP microarrays, allows for rapid and inexpensive genome-wide association studies. Key to the success of these studies is the accuracy of the allelic frequency calculations, the ability to identify false-positives arising from assay variability and the ability to better resolve association signals through analysis of neighbouring SNPs. RESULTS: We report the accuracy of allelic frequency measurements on pooled genomic DNA samples by comparing these measurements to the known allelic frequencies as determined by individual genotyping. We describe modifications to the calculation of k-correction factors from relative allele signal (RAS) values that remove biases and result in more accurate allelic frequency predictions. Our results show that the least accurate SNPs, those most likely to give false-positives in an association study, are identifiable by comparing their frequencies to both those from a known database of individual genotypes and those of the pooled replicates. In a disease with a previously identified genetic mutation, we demonstrate that one can identify the disease locus through the comparison of the predicted allelic frequencies in case and control pools. Furthermore, we demonstrate improved resolution of association signals using the mean of individual test-statistics for consecutive SNPs windowed across the genome. A database of k-correction factors for predicting allelic frequencies for each SNP, derived from several thousand individually genotyped samples, is provided. Lastly, a Perl script for calculating RAS values for the Affymetrix platform is provided. CONCLUSION: Our results illustrate that pooling of DNA samples is an effective initial strategy to identify a genetic locus. However, it is important to eliminate inaccurate SNPs prior to analysis by comparing them to a database of individually genotyped samples as well as by comparing them to replicates of the pool. Lastly, detection of association signals can be improved by incorporating data from neighbouring SNPs.
format Text
id pubmed-1262713
institution National Center for Biotechnology Information
language English
publishDate 2005
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-12627132005-10-22 Identification of disease causing loci using an array-based genotyping approach on pooled DNA Craig, David W Huentelman, Matthew J Hu-Lince, Diane Zismann, Victoria L Kruer, Michael C Lee, Anne M Puffenberger, Erik G Pearson, John M Stephan, Dietrich A BMC Genomics Methodology Article BACKGROUND: Pooling genomic DNA samples within clinical classes of disease followed by genotyping on whole-genome SNP microarrays, allows for rapid and inexpensive genome-wide association studies. Key to the success of these studies is the accuracy of the allelic frequency calculations, the ability to identify false-positives arising from assay variability and the ability to better resolve association signals through analysis of neighbouring SNPs. RESULTS: We report the accuracy of allelic frequency measurements on pooled genomic DNA samples by comparing these measurements to the known allelic frequencies as determined by individual genotyping. We describe modifications to the calculation of k-correction factors from relative allele signal (RAS) values that remove biases and result in more accurate allelic frequency predictions. Our results show that the least accurate SNPs, those most likely to give false-positives in an association study, are identifiable by comparing their frequencies to both those from a known database of individual genotypes and those of the pooled replicates. In a disease with a previously identified genetic mutation, we demonstrate that one can identify the disease locus through the comparison of the predicted allelic frequencies in case and control pools. Furthermore, we demonstrate improved resolution of association signals using the mean of individual test-statistics for consecutive SNPs windowed across the genome. A database of k-correction factors for predicting allelic frequencies for each SNP, derived from several thousand individually genotyped samples, is provided. Lastly, a Perl script for calculating RAS values for the Affymetrix platform is provided. CONCLUSION: Our results illustrate that pooling of DNA samples is an effective initial strategy to identify a genetic locus. However, it is important to eliminate inaccurate SNPs prior to analysis by comparing them to a database of individually genotyped samples as well as by comparing them to replicates of the pool. Lastly, detection of association signals can be improved by incorporating data from neighbouring SNPs. BioMed Central 2005-09-30 /pmc/articles/PMC1262713/ /pubmed/16197552 http://dx.doi.org/10.1186/1471-2164-6-138 Text en Copyright © 2005 Craig et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Craig, David W
Huentelman, Matthew J
Hu-Lince, Diane
Zismann, Victoria L
Kruer, Michael C
Lee, Anne M
Puffenberger, Erik G
Pearson, John M
Stephan, Dietrich A
Identification of disease causing loci using an array-based genotyping approach on pooled DNA
title Identification of disease causing loci using an array-based genotyping approach on pooled DNA
title_full Identification of disease causing loci using an array-based genotyping approach on pooled DNA
title_fullStr Identification of disease causing loci using an array-based genotyping approach on pooled DNA
title_full_unstemmed Identification of disease causing loci using an array-based genotyping approach on pooled DNA
title_short Identification of disease causing loci using an array-based genotyping approach on pooled DNA
title_sort identification of disease causing loci using an array-based genotyping approach on pooled dna
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC1262713/
https://www.ncbi.nlm.nih.gov/pubmed/16197552
http://dx.doi.org/10.1186/1471-2164-6-138
work_keys_str_mv AT craigdavidw identificationofdiseasecausinglociusinganarraybasedgenotypingapproachonpooleddna
AT huentelmanmatthewj identificationofdiseasecausinglociusinganarraybasedgenotypingapproachonpooleddna
AT hulincediane identificationofdiseasecausinglociusinganarraybasedgenotypingapproachonpooleddna
AT zismannvictorial identificationofdiseasecausinglociusinganarraybasedgenotypingapproachonpooleddna
AT kruermichaelc identificationofdiseasecausinglociusinganarraybasedgenotypingapproachonpooleddna
AT leeannem identificationofdiseasecausinglociusinganarraybasedgenotypingapproachonpooleddna
AT puffenbergererikg identificationofdiseasecausinglociusinganarraybasedgenotypingapproachonpooleddna
AT pearsonjohnm identificationofdiseasecausinglociusinganarraybasedgenotypingapproachonpooleddna
AT stephandietricha identificationofdiseasecausinglociusinganarraybasedgenotypingapproachonpooleddna