Cargando…

Genotype determination for polymorphisms in linkage disequilibrium

BACKGROUND: Genome-wide association studies with single nucleotide polymorphisms (SNPs) show great promise to identify genetic determinants of complex human traits. In current analyses, genotype calling and imputation of missing genotypes are usually considered as two separated tasks. The genotypes...

Descripción completa

Detalles Bibliográficos
Autores principales: Yu, Zhaoxia, Garner, Chad, Ziogas, Argyrios, Anton-Culver, Hoda, Schaid, Daniel J
Formato: Texto
Lenguaje:English
Publicado: BioMed Central 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2753842/
https://www.ncbi.nlm.nih.gov/pubmed/19228433
http://dx.doi.org/10.1186/1471-2105-10-63
_version_ 1782172366463303680
author Yu, Zhaoxia
Garner, Chad
Ziogas, Argyrios
Anton-Culver, Hoda
Schaid, Daniel J
author_facet Yu, Zhaoxia
Garner, Chad
Ziogas, Argyrios
Anton-Culver, Hoda
Schaid, Daniel J
author_sort Yu, Zhaoxia
collection PubMed
description BACKGROUND: Genome-wide association studies with single nucleotide polymorphisms (SNPs) show great promise to identify genetic determinants of complex human traits. In current analyses, genotype calling and imputation of missing genotypes are usually considered as two separated tasks. The genotypes of SNPs are first determined one at a time from allele signal intensities. Then the missing genotypes, i.e., no-calls caused by not perfectly separated signal clouds, are imputed based on the linkage disequilibrium (LD) between multiple SNPs. Although many statistical methods have been developed to improve either genotype calling or imputation of missing genotypes, treating the two steps independently can lead to loss of genetic information. RESULTS: We propose a novel genotype calling framework. In this framework, we consider the signal intensities and underlying LD structure of SNPs simultaneously by estimating both cluster parameters and haplotype frequencies. As a result, our new method outperforms some existing algorithms in terms of both call rates and genotyping accuracy. Our studies also suggest that jointly analyzing multiple SNPs in LD provides more accurate estimation of haplotypes than haplotype reconstruction methods that only use called genotypes. CONCLUSION: Our study demonstrates that jointly analyzing signal intensities and LD structure of multiple SNPs is a better way to determine genotypes and estimate LD parameters.
format Text
id pubmed-2753842
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-27538422009-10-13 Genotype determination for polymorphisms in linkage disequilibrium Yu, Zhaoxia Garner, Chad Ziogas, Argyrios Anton-Culver, Hoda Schaid, Daniel J BMC Bioinformatics Methodology Article BACKGROUND: Genome-wide association studies with single nucleotide polymorphisms (SNPs) show great promise to identify genetic determinants of complex human traits. In current analyses, genotype calling and imputation of missing genotypes are usually considered as two separated tasks. The genotypes of SNPs are first determined one at a time from allele signal intensities. Then the missing genotypes, i.e., no-calls caused by not perfectly separated signal clouds, are imputed based on the linkage disequilibrium (LD) between multiple SNPs. Although many statistical methods have been developed to improve either genotype calling or imputation of missing genotypes, treating the two steps independently can lead to loss of genetic information. RESULTS: We propose a novel genotype calling framework. In this framework, we consider the signal intensities and underlying LD structure of SNPs simultaneously by estimating both cluster parameters and haplotype frequencies. As a result, our new method outperforms some existing algorithms in terms of both call rates and genotyping accuracy. Our studies also suggest that jointly analyzing multiple SNPs in LD provides more accurate estimation of haplotypes than haplotype reconstruction methods that only use called genotypes. CONCLUSION: Our study demonstrates that jointly analyzing signal intensities and LD structure of multiple SNPs is a better way to determine genotypes and estimate LD parameters. BioMed Central 2009-02-20 /pmc/articles/PMC2753842/ /pubmed/19228433 http://dx.doi.org/10.1186/1471-2105-10-63 Text en Copyright © 2009 Yu et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( (http://creativecommons.org/licenses/by/2.0) ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology Article
Yu, Zhaoxia
Garner, Chad
Ziogas, Argyrios
Anton-Culver, Hoda
Schaid, Daniel J
Genotype determination for polymorphisms in linkage disequilibrium
title Genotype determination for polymorphisms in linkage disequilibrium
title_full Genotype determination for polymorphisms in linkage disequilibrium
title_fullStr Genotype determination for polymorphisms in linkage disequilibrium
title_full_unstemmed Genotype determination for polymorphisms in linkage disequilibrium
title_short Genotype determination for polymorphisms in linkage disequilibrium
title_sort genotype determination for polymorphisms in linkage disequilibrium
topic Methodology Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2753842/
https://www.ncbi.nlm.nih.gov/pubmed/19228433
http://dx.doi.org/10.1186/1471-2105-10-63
work_keys_str_mv AT yuzhaoxia genotypedeterminationforpolymorphismsinlinkagedisequilibrium
AT garnerchad genotypedeterminationforpolymorphismsinlinkagedisequilibrium
AT ziogasargyrios genotypedeterminationforpolymorphismsinlinkagedisequilibrium
AT antonculverhoda genotypedeterminationforpolymorphismsinlinkagedisequilibrium
AT schaiddanielj genotypedeterminationforpolymorphismsinlinkagedisequilibrium