Cargando…
Reducing Bias of Allele Frequency Estimates by Modeling SNP Genotype Data with Informative Missingness
The presence of missing single-nucleotide polymorphism (SNP) genotypes is common in genetic studies. For studies with low-density SNPs, the most commonly used approach to dealing with genotype missingness is to simply remove the observations with missing genotypes from the analyses. This naïve metho...
Autores principales: | , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Research Foundation
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376470/ https://www.ncbi.nlm.nih.gov/pubmed/22719749 http://dx.doi.org/10.3389/fgene.2012.00107 |
_version_ | 1782235828161871872 |
---|---|
author | Lin, Wan-Yu Liu, Nianjun |
author_facet | Lin, Wan-Yu Liu, Nianjun |
author_sort | Lin, Wan-Yu |
collection | PubMed |
description | The presence of missing single-nucleotide polymorphism (SNP) genotypes is common in genetic studies. For studies with low-density SNPs, the most commonly used approach to dealing with genotype missingness is to simply remove the observations with missing genotypes from the analyses. This naïve method is straightforward but is valid only when the missingness is random. However, a given assay often has a different capability in genotyping heterozygotes and homozygotes, causing the phenomenon of “differential dropout” in the sense that the missing rates of heterozygotes and homozygotes are different. In practice, differential dropout among genotypes exists in even carefully designed studies, such as the data from the HapMap project and the Wellcome Trust Case Control Consortium. Under the assumption of Hardy–Weinberg equilibrium and no genotyping error, we here propose a statistical method to model the differential dropout among different genotypes. Compared with the naïve method, our method provides more accurate allele frequency estimates when the differential dropout is present. To demonstrate its practical use, we further apply our method to the HapMap data and a scleroderma data set. |
format | Online Article Text |
id | pubmed-3376470 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Frontiers Research Foundation |
record_format | MEDLINE/PubMed |
spelling | pubmed-33764702012-06-20 Reducing Bias of Allele Frequency Estimates by Modeling SNP Genotype Data with Informative Missingness Lin, Wan-Yu Liu, Nianjun Front Genet Genetics The presence of missing single-nucleotide polymorphism (SNP) genotypes is common in genetic studies. For studies with low-density SNPs, the most commonly used approach to dealing with genotype missingness is to simply remove the observations with missing genotypes from the analyses. This naïve method is straightforward but is valid only when the missingness is random. However, a given assay often has a different capability in genotyping heterozygotes and homozygotes, causing the phenomenon of “differential dropout” in the sense that the missing rates of heterozygotes and homozygotes are different. In practice, differential dropout among genotypes exists in even carefully designed studies, such as the data from the HapMap project and the Wellcome Trust Case Control Consortium. Under the assumption of Hardy–Weinberg equilibrium and no genotyping error, we here propose a statistical method to model the differential dropout among different genotypes. Compared with the naïve method, our method provides more accurate allele frequency estimates when the differential dropout is present. To demonstrate its practical use, we further apply our method to the HapMap data and a scleroderma data set. Frontiers Research Foundation 2012-06-18 /pmc/articles/PMC3376470/ /pubmed/22719749 http://dx.doi.org/10.3389/fgene.2012.00107 Text en Copyright © 2012 Lin and Liu. http://www.frontiersin.org/licenseagreement This is an open-access article distributed under the terms of the Creative Commons Attribution Non Commercial License, which permits non-commercial use, distribution, and reproduction in other forums, provided the original authors and source are credited. |
spellingShingle | Genetics Lin, Wan-Yu Liu, Nianjun Reducing Bias of Allele Frequency Estimates by Modeling SNP Genotype Data with Informative Missingness |
title | Reducing Bias of Allele Frequency Estimates by Modeling SNP Genotype Data with Informative Missingness |
title_full | Reducing Bias of Allele Frequency Estimates by Modeling SNP Genotype Data with Informative Missingness |
title_fullStr | Reducing Bias of Allele Frequency Estimates by Modeling SNP Genotype Data with Informative Missingness |
title_full_unstemmed | Reducing Bias of Allele Frequency Estimates by Modeling SNP Genotype Data with Informative Missingness |
title_short | Reducing Bias of Allele Frequency Estimates by Modeling SNP Genotype Data with Informative Missingness |
title_sort | reducing bias of allele frequency estimates by modeling snp genotype data with informative missingness |
topic | Genetics |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3376470/ https://www.ncbi.nlm.nih.gov/pubmed/22719749 http://dx.doi.org/10.3389/fgene.2012.00107 |
work_keys_str_mv | AT linwanyu reducingbiasofallelefrequencyestimatesbymodelingsnpgenotypedatawithinformativemissingness AT liunianjun reducingbiasofallelefrequencyestimatesbymodelingsnpgenotypedatawithinformativemissingness |