Cargando…
Genotype determination for polymorphisms in linkage disequilibrium
BACKGROUND: Genome-wide association studies with single nucleotide polymorphisms (SNPs) show great promise to identify genetic determinants of complex human traits. In current analyses, genotype calling and imputation of missing genotypes are usually considered as two separated tasks. The genotypes...
Autores principales: | , , , , |
---|---|
Formato: | Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2009
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2753842/ https://www.ncbi.nlm.nih.gov/pubmed/19228433 http://dx.doi.org/10.1186/1471-2105-10-63 |
_version_ | 1782172366463303680 |
---|---|
author | Yu, Zhaoxia Garner, Chad Ziogas, Argyrios Anton-Culver, Hoda Schaid, Daniel J |
author_facet | Yu, Zhaoxia Garner, Chad Ziogas, Argyrios Anton-Culver, Hoda Schaid, Daniel J |
author_sort | Yu, Zhaoxia |
collection | PubMed |
description | BACKGROUND: Genome-wide association studies with single nucleotide polymorphisms (SNPs) show great promise to identify genetic determinants of complex human traits. In current analyses, genotype calling and imputation of missing genotypes are usually considered as two separated tasks. The genotypes of SNPs are first determined one at a time from allele signal intensities. Then the missing genotypes, i.e., no-calls caused by not perfectly separated signal clouds, are imputed based on the linkage disequilibrium (LD) between multiple SNPs. Although many statistical methods have been developed to improve either genotype calling or imputation of missing genotypes, treating the two steps independently can lead to loss of genetic information. RESULTS: We propose a novel genotype calling framework. In this framework, we consider the signal intensities and underlying LD structure of SNPs simultaneously by estimating both cluster parameters and haplotype frequencies. As a result, our new method outperforms some existing algorithms in terms of both call rates and genotyping accuracy. Our studies also suggest that jointly analyzing multiple SNPs in LD provides more accurate estimation of haplotypes than haplotype reconstruction methods that only use called genotypes. CONCLUSION: Our study demonstrates that jointly analyzing signal intensities and LD structure of multiple SNPs is a better way to determine genotypes and estimate LD parameters. |
format | Text |
id | pubmed-2753842 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2009 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-27538422009-10-13 Genotype determination for polymorphisms in linkage disequilibrium Yu, Zhaoxia Garner, Chad Ziogas, Argyrios Anton-Culver, Hoda Schaid, Daniel J BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide association studies with single nucleotide polymorphisms (SNPs) show great promise to identify genetic determinants of complex human traits. In current analyses, genotype calling and imputation of missing genotypes are usually considered as two separated tasks. The genotypes of SNPs are first determined one at a time from allele signal intensities. Then the missing genotypes, i.e., no-calls caused by not perfectly separated signal clouds, are imputed based on the linkage disequilibrium (LD) between multiple SNPs. Although many statistical methods have been developed to improve either genotype calling or imputation of missing genotypes, treating the two steps independently can lead to loss of genetic information. RESULTS: We propose a novel genotype calling framework. In this framework, we consider the signal intensities and underlying LD structure of SNPs simultaneously by estimating both cluster parameters and haplotype frequencies. As a result, our new method outperforms some existing algorithms in terms of both call rates and genotyping accuracy. Our studies also suggest that jointly analyzing multiple SNPs in LD provides more accurate estimation of haplotypes than haplotype reconstruction methods that only use called genotypes. CONCLUSION: Our study demonstrates that jointly analyzing signal intensities and LD structure of multiple SNPs is a better way to determine genotypes and estimate LD parameters. BioMed Central 2009-02-20 /pmc/articles/PMC2753842/ /pubmed/19228433 http://dx.doi.org/10.1186/1471-2105-10-63 Text en Copyright © 2009 Yu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Methodology Article Yu, Zhaoxia Garner, Chad Ziogas, Argyrios Anton-Culver, Hoda Schaid, Daniel J Genotype determination for polymorphisms in linkage disequilibrium |
title | Genotype determination for polymorphisms in linkage disequilibrium |
title_full | Genotype determination for polymorphisms in linkage disequilibrium |
title_fullStr | Genotype determination for polymorphisms in linkage disequilibrium |
title_full_unstemmed | Genotype determination for polymorphisms in linkage disequilibrium |
title_short | Genotype determination for polymorphisms in linkage disequilibrium |
title_sort | genotype determination for polymorphisms in linkage disequilibrium |
topic | Methodology Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2753842/ https://www.ncbi.nlm.nih.gov/pubmed/19228433 http://dx.doi.org/10.1186/1471-2105-10-63 |
work_keys_str_mv | AT yuzhaoxia genotypedeterminationforpolymorphismsinlinkagedisequilibrium AT garnerchad genotypedeterminationforpolymorphismsinlinkagedisequilibrium AT ziogasargyrios genotypedeterminationforpolymorphismsinlinkagedisequilibrium AT antonculverhoda genotypedeterminationforpolymorphismsinlinkagedisequilibrium AT schaiddanielj genotypedeterminationforpolymorphismsinlinkagedisequilibrium |