Cargando…

A Coalescent Model for Genotype Imputation

The potential for imputed genotypes to enhance an analysis of genetic data depends largely on the accuracy of imputation, which in turn depends on properties of the reference panel of template haplotypes used to perform the imputation. To provide a basis for exploring how properties of the reference...

Descripción completa

Detalles Bibliográficos
Autores principales: Jewett, Ethan M., Zawistowski, Matthew, Rosenberg, Noah A., Zöllner, Sebastian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416004/
https://www.ncbi.nlm.nih.gov/pubmed/22595242
http://dx.doi.org/10.1534/genetics.111.137984
_version_ 1782240407016439808
author Jewett, Ethan M.
Zawistowski, Matthew
Rosenberg, Noah A.
Zöllner, Sebastian
author_facet Jewett, Ethan M.
Zawistowski, Matthew
Rosenberg, Noah A.
Zöllner, Sebastian
author_sort Jewett, Ethan M.
collection PubMed
description The potential for imputed genotypes to enhance an analysis of genetic data depends largely on the accuracy of imputation, which in turn depends on properties of the reference panel of template haplotypes used to perform the imputation. To provide a basis for exploring how properties of the reference panel affect imputation accuracy theoretically rather than with computationally intensive imputation experiments, we introduce a coalescent model that considers imputation accuracy in terms of population-genetic parameters. Our model allows us to investigate sampling designs in the frequently occurring scenario in which imputation targets and templates are sampled from different populations. In particular, we derive expressions for expected imputation accuracy as a function of reference panel size and divergence time between the reference and target populations. We find that a modestly sized “internal” reference panel from the same population as a target haplotype yields, on average, greater imputation accuracy than a larger “external” panel from a different population, even if the divergence time between the two populations is small. The improvement in accuracy for the internal panel increases with increasing divergence time between the target and reference populations. Thus, in humans, our model predicts that imputation accuracy can be improved by generating small population-specific custom reference panels to augment existing collections such as those of the HapMap or 1000 Genomes Projects. Our approach can be extended to understand additional factors that affect imputation accuracy in complex population-genetic settings, and the results can ultimately facilitate improvements in imputation study designs.
format Online
Article
Text
id pubmed-3416004
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-34160042012-08-16 A Coalescent Model for Genotype Imputation Jewett, Ethan M. Zawistowski, Matthew Rosenberg, Noah A. Zöllner, Sebastian Genetics Investigations The potential for imputed genotypes to enhance an analysis of genetic data depends largely on the accuracy of imputation, which in turn depends on properties of the reference panel of template haplotypes used to perform the imputation. To provide a basis for exploring how properties of the reference panel affect imputation accuracy theoretically rather than with computationally intensive imputation experiments, we introduce a coalescent model that considers imputation accuracy in terms of population-genetic parameters. Our model allows us to investigate sampling designs in the frequently occurring scenario in which imputation targets and templates are sampled from different populations. In particular, we derive expressions for expected imputation accuracy as a function of reference panel size and divergence time between the reference and target populations. We find that a modestly sized “internal” reference panel from the same population as a target haplotype yields, on average, greater imputation accuracy than a larger “external” panel from a different population, even if the divergence time between the two populations is small. The improvement in accuracy for the internal panel increases with increasing divergence time between the target and reference populations. Thus, in humans, our model predicts that imputation accuracy can be improved by generating small population-specific custom reference panels to augment existing collections such as those of the HapMap or 1000 Genomes Projects. Our approach can be extended to understand additional factors that affect imputation accuracy in complex population-genetic settings, and the results can ultimately facilitate improvements in imputation study designs. Genetics Society of America 2012-08 /pmc/articles/PMC3416004/ /pubmed/22595242 http://dx.doi.org/10.1534/genetics.111.137984 Text en Copyright © 2012 by the Genetics Society of America Available freely online through the author-supported open access option.
spellingShingle Investigations
Jewett, Ethan M.
Zawistowski, Matthew
Rosenberg, Noah A.
Zöllner, Sebastian
A Coalescent Model for Genotype Imputation
title A Coalescent Model for Genotype Imputation
title_full A Coalescent Model for Genotype Imputation
title_fullStr A Coalescent Model for Genotype Imputation
title_full_unstemmed A Coalescent Model for Genotype Imputation
title_short A Coalescent Model for Genotype Imputation
title_sort coalescent model for genotype imputation
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416004/
https://www.ncbi.nlm.nih.gov/pubmed/22595242
http://dx.doi.org/10.1534/genetics.111.137984
work_keys_str_mv AT jewettethanm acoalescentmodelforgenotypeimputation
AT zawistowskimatthew acoalescentmodelforgenotypeimputation
AT rosenbergnoaha acoalescentmodelforgenotypeimputation
AT zollnersebastian acoalescentmodelforgenotypeimputation
AT jewettethanm coalescentmodelforgenotypeimputation
AT zawistowskimatthew coalescentmodelforgenotypeimputation
AT rosenbergnoaha coalescentmodelforgenotypeimputation
AT zollnersebastian coalescentmodelforgenotypeimputation