Cargando…
A Coalescent Model for Genotype Imputation
The potential for imputed genotypes to enhance an analysis of genetic data depends largely on the accuracy of imputation, which in turn depends on properties of the reference panel of template haplotypes used to perform the imputation. To provide a basis for exploring how properties of the reference...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416004/ https://www.ncbi.nlm.nih.gov/pubmed/22595242 http://dx.doi.org/10.1534/genetics.111.137984 |
_version_ | 1782240407016439808 |
---|---|
author | Jewett, Ethan M. Zawistowski, Matthew Rosenberg, Noah A. Zöllner, Sebastian |
author_facet | Jewett, Ethan M. Zawistowski, Matthew Rosenberg, Noah A. Zöllner, Sebastian |
author_sort | Jewett, Ethan M. |
collection | PubMed |
description | The potential for imputed genotypes to enhance an analysis of genetic data depends largely on the accuracy of imputation, which in turn depends on properties of the reference panel of template haplotypes used to perform the imputation. To provide a basis for exploring how properties of the reference panel affect imputation accuracy theoretically rather than with computationally intensive imputation experiments, we introduce a coalescent model that considers imputation accuracy in terms of population-genetic parameters. Our model allows us to investigate sampling designs in the frequently occurring scenario in which imputation targets and templates are sampled from different populations. In particular, we derive expressions for expected imputation accuracy as a function of reference panel size and divergence time between the reference and target populations. We find that a modestly sized “internal” reference panel from the same population as a target haplotype yields, on average, greater imputation accuracy than a larger “external” panel from a different population, even if the divergence time between the two populations is small. The improvement in accuracy for the internal panel increases with increasing divergence time between the target and reference populations. Thus, in humans, our model predicts that imputation accuracy can be improved by generating small population-specific custom reference panels to augment existing collections such as those of the HapMap or 1000 Genomes Projects. Our approach can be extended to understand additional factors that affect imputation accuracy in complex population-genetic settings, and the results can ultimately facilitate improvements in imputation study designs. |
format | Online Article Text |
id | pubmed-3416004 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-34160042012-08-16 A Coalescent Model for Genotype Imputation Jewett, Ethan M. Zawistowski, Matthew Rosenberg, Noah A. Zöllner, Sebastian Genetics Investigations The potential for imputed genotypes to enhance an analysis of genetic data depends largely on the accuracy of imputation, which in turn depends on properties of the reference panel of template haplotypes used to perform the imputation. To provide a basis for exploring how properties of the reference panel affect imputation accuracy theoretically rather than with computationally intensive imputation experiments, we introduce a coalescent model that considers imputation accuracy in terms of population-genetic parameters. Our model allows us to investigate sampling designs in the frequently occurring scenario in which imputation targets and templates are sampled from different populations. In particular, we derive expressions for expected imputation accuracy as a function of reference panel size and divergence time between the reference and target populations. We find that a modestly sized “internal” reference panel from the same population as a target haplotype yields, on average, greater imputation accuracy than a larger “external” panel from a different population, even if the divergence time between the two populations is small. The improvement in accuracy for the internal panel increases with increasing divergence time between the target and reference populations. Thus, in humans, our model predicts that imputation accuracy can be improved by generating small population-specific custom reference panels to augment existing collections such as those of the HapMap or 1000 Genomes Projects. Our approach can be extended to understand additional factors that affect imputation accuracy in complex population-genetic settings, and the results can ultimately facilitate improvements in imputation study designs. Genetics Society of America 2012-08 /pmc/articles/PMC3416004/ /pubmed/22595242 http://dx.doi.org/10.1534/genetics.111.137984 Text en Copyright © 2012 by the Genetics Society of America Available freely online through the author-supported open access option. |
spellingShingle | Investigations Jewett, Ethan M. Zawistowski, Matthew Rosenberg, Noah A. Zöllner, Sebastian A Coalescent Model for Genotype Imputation |
title | A Coalescent Model for Genotype Imputation |
title_full | A Coalescent Model for Genotype Imputation |
title_fullStr | A Coalescent Model for Genotype Imputation |
title_full_unstemmed | A Coalescent Model for Genotype Imputation |
title_short | A Coalescent Model for Genotype Imputation |
title_sort | coalescent model for genotype imputation |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3416004/ https://www.ncbi.nlm.nih.gov/pubmed/22595242 http://dx.doi.org/10.1534/genetics.111.137984 |
work_keys_str_mv | AT jewettethanm acoalescentmodelforgenotypeimputation AT zawistowskimatthew acoalescentmodelforgenotypeimputation AT rosenbergnoaha acoalescentmodelforgenotypeimputation AT zollnersebastian acoalescentmodelforgenotypeimputation AT jewettethanm coalescentmodelforgenotypeimputation AT zawistowskimatthew coalescentmodelforgenotypeimputation AT rosenbergnoaha coalescentmodelforgenotypeimputation AT zollnersebastian coalescentmodelforgenotypeimputation |