Cargando…

Genotyping and inflated type I error rate in genome-wide association case/control studies

BACKGROUND: One common goal of a case/control genome wide association study (GWAS) is to find SNPs associated with a disease. Traditionally, the first step in such studies is to assign a genotype to each SNP in each subject, based on a statistic summarizing fluorescence measurements. When the distri...

Descripción completa

Detalles Bibliográficos
Autores principales: Sampson, Joshua N, Zhao, Hongyu
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679732/
https://www.ncbi.nlm.nih.gov/pubmed/19236714
http://dx.doi.org/10.1186/1471-2105-10-68
_version_ 1782166916222156800
author Sampson, Joshua N
Zhao, Hongyu
author_facet Sampson, Joshua N
Zhao, Hongyu
author_sort Sampson, Joshua N
collection PubMed
description BACKGROUND: One common goal of a case/control genome wide association study (GWAS) is to find SNPs associated with a disease. Traditionally, the first step in such studies is to assign a genotype to each SNP in each subject, based on a statistic summarizing fluorescence measurements. When the distributions of the summary statistics are not well separated by genotype, the act of genotype assignment can lead to more potential problems than acknowledged by the literature. RESULTS: Specifically, we show that the proportions of each called genotype need not equal the true proportions in the population, even as the number of subjects grows infinitely large. The called genotypes for two subjects need not be independent, even when their true genotypes are independent. Consequently, p-values from tests of association can be anti-conservative, even when the distributions of the summary statistic for the cases and controls are identical. To address these problems, we propose two new tests designed to reduce the inflation in the type I error rate caused by these problems. The first algorithm, logiCALL, measures call quality by fully exploring the likelihood profile of intensity measurements, and the second algorithm avoids genotyping by using a likelihood ratio statistic. CONCLUSION: Genotyping can introduce avoidable false positives in GWAS.
format Text
id pubmed-2679732
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-26797322009-05-11 Genotyping and inflated type I error rate in genome-wide association case/control studies Sampson, Joshua N Zhao, Hongyu BMC Bioinformatics Methodology Article BACKGROUND: One common goal of a case/control genome wide association study (GWAS) is to find SNPs associated with a disease. Traditionally, the first step in such studies is to assign a genotype to each SNP in each subject, based on a statistic summarizing fluorescence measurements. When the distributions of the summary statistics are not well separated by genotype, the act of genotype assignment can lead to more potential problems than acknowledged by the literature. RESULTS: Specifically, we show that the proportions of each called genotype need not equal the true proportions in the population, even as the number of subjects grows infinitely large. The called genotypes for two subjects need not be independent, even when their true genotypes are independent. Consequently, p-values from tests of association can be anti-conservative, even when the distributions of the summary statistic for the cases and controls are identical. To address these problems, we propose two new tests designed to reduce the inflation in the type I error rate caused by these problems. The first algorithm, logiCALL, measures call quality by fully exploring the likelihood profile of intensity measurements, and the second algorithm avoids genotyping by using a likelihood ratio statistic. CONCLUSION: Genotyping can introduce avoidable false positives in GWAS. BioMed Central 2009-02-23 /pmc/articles/PMC2679732/ /pubmed/19236714 http://dx.doi.org/10.1186/1471-2105-10-68 Text en Copyright © 2009 Sampson and Zhao; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Sampson, Joshua N
Zhao, Hongyu
Genotyping and inflated type I error rate in genome-wide association case/control studies
title Genotyping and inflated type I error rate in genome-wide association case/control studies
title_full Genotyping and inflated type I error rate in genome-wide association case/control studies
title_fullStr Genotyping and inflated type I error rate in genome-wide association case/control studies
title_full_unstemmed Genotyping and inflated type I error rate in genome-wide association case/control studies
title_short Genotyping and inflated type I error rate in genome-wide association case/control studies
title_sort genotyping and inflated type i error rate in genome-wide association case/control studies
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679732/
https://www.ncbi.nlm.nih.gov/pubmed/19236714
http://dx.doi.org/10.1186/1471-2105-10-68
work_keys_str_mv AT sampsonjoshuan genotypingandinflatedtypeierrorrateingenomewideassociationcasecontrolstudies
AT zhaohongyu genotypingandinflatedtypeierrorrateingenomewideassociationcasecontrolstudies