Cargando…
Genotyping and inflated type I error rate in genome-wide association case/control studies
BACKGROUND: One common goal of a case/control genome wide association study (GWAS) is to find SNPs associated with a disease. Traditionally, the first step in such studies is to assign a genotype to each SNP in each subject, based on a statistic summarizing fluorescence measurements. When the distri...
Autores principales: | , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679732/ https://www.ncbi.nlm.nih.gov/pubmed/19236714 http://dx.doi.org/10.1186/1471-2105-10-68 |
_version_ | 1782166916222156800 |
---|---|
author | Sampson, Joshua N Zhao, Hongyu |
author_facet | Sampson, Joshua N Zhao, Hongyu |
author_sort | Sampson, Joshua N |
collection | PubMed |
description | BACKGROUND: One common goal of a case/control genome wide association study (GWAS) is to find SNPs associated with a disease. Traditionally, the first step in such studies is to assign a genotype to each SNP in each subject, based on a statistic summarizing fluorescence measurements. When the distributions of the summary statistics are not well separated by genotype, the act of genotype assignment can lead to more potential problems than acknowledged by the literature. RESULTS: Specifically, we show that the proportions of each called genotype need not equal the true proportions in the population, even as the number of subjects grows infinitely large. The called genotypes for two subjects need not be independent, even when their true genotypes are independent. Consequently, p-values from tests of association can be anti-conservative, even when the distributions of the summary statistic for the cases and controls are identical. To address these problems, we propose two new tests designed to reduce the inflation in the type I error rate caused by these problems. The first algorithm, logiCALL, measures call quality by fully exploring the likelihood profile of intensity measurements, and the second algorithm avoids genotyping by using a likelihood ratio statistic. CONCLUSION: Genotyping can introduce avoidable false positives in GWAS. |
format | Text |
id | pubmed-2679732 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-26797322009-05-11 Genotyping and inflated type I error rate in genome-wide association case/control studies Sampson, Joshua N Zhao, Hongyu BMC Bioinformatics Methodology Article BACKGROUND: One common goal of a case/control genome wide association study (GWAS) is to find SNPs associated with a disease. Traditionally, the first step in such studies is to assign a genotype to each SNP in each subject, based on a statistic summarizing fluorescence measurements. When the distributions of the summary statistics are not well separated by genotype, the act of genotype assignment can lead to more potential problems than acknowledged by the literature. RESULTS: Specifically, we show that the proportions of each called genotype need not equal the true proportions in the population, even as the number of subjects grows infinitely large. The called genotypes for two subjects need not be independent, even when their true genotypes are independent. Consequently, p-values from tests of association can be anti-conservative, even when the distributions of the summary statistic for the cases and controls are identical. To address these problems, we propose two new tests designed to reduce the inflation in the type I error rate caused by these problems. The first algorithm, logiCALL, measures call quality by fully exploring the likelihood profile of intensity measurements, and the second algorithm avoids genotyping by using a likelihood ratio statistic. CONCLUSION: Genotyping can introduce avoidable false positives in GWAS. BioMed Central 2009-02-23 /pmc/articles/PMC2679732/ /pubmed/19236714 http://dx.doi.org/10.1186/1471-2105-10-68 Text en Copyright © 2009 Sampson and Zhao; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Sampson, Joshua N Zhao, Hongyu Genotyping and inflated type I error rate in genome-wide association case/control studies |
title | Genotyping and inflated type I error rate in genome-wide association case/control studies |
title_full | Genotyping and inflated type I error rate in genome-wide association case/control studies |
title_fullStr | Genotyping and inflated type I error rate in genome-wide association case/control studies |
title_full_unstemmed | Genotyping and inflated type I error rate in genome-wide association case/control studies |
title_short | Genotyping and inflated type I error rate in genome-wide association case/control studies |
title_sort | genotyping and inflated type i error rate in genome-wide association case/control studies |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2679732/ https://www.ncbi.nlm.nih.gov/pubmed/19236714 http://dx.doi.org/10.1186/1471-2105-10-68 |
work_keys_str_mv | AT sampsonjoshuan genotypingandinflatedtypeierrorrateingenomewideassociationcasecontrolstudies AT zhaohongyu genotypingandinflatedtypeierrorrateingenomewideassociationcasecontrolstudies |