Cargando…

A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies

Genotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000...

Descripción completa

Detalles Bibliográficos
Autores principales: Howie, Bryan N., Donnelly, Peter, Marchini, Jonathan
Formato: Texto
Lenguaje:English
Publicado: Public Library of Science 2009
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2689936/
https://www.ncbi.nlm.nih.gov/pubmed/19543373
http://dx.doi.org/10.1371/journal.pgen.1000529
_version_ 1782167825219059712
author Howie, Bryan N.
Donnelly, Peter
Marchini, Jonathan
author_facet Howie, Bryan N.
Donnelly, Peter
Marchini, Jonathan
author_sort Howie, Bryan N.
collection PubMed
description Genotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000 Genomes Project) will soon allow a broader range of SNPs to be imputed with higher accuracy, thereby increasing power. We describe a genotype imputation method (IMPUTE version 2) that is designed to address the challenges presented by these new datasets. The main innovation of our approach is a flexible modelling framework that increases accuracy and combines information across multiple reference panels while remaining computationally feasible. We find that IMPUTE v2 attains higher accuracy than other methods when the HapMap provides the sole reference panel, but that the size of the panel constrains the improvements that can be made. We also find that imputation accuracy can be greatly enhanced by expanding the reference panel to contain thousands of chromosomes and that IMPUTE v2 outperforms other methods in this setting at both rare and common SNPs, with overall error rates that are 15%–20% lower than those of the closest competing method. One particularly challenging aspect of next-generation association studies is to integrate information across multiple reference panels genotyped on different sets of SNPs; we show that our approach to this problem has practical advantages over other suggested solutions.
format Text
id pubmed-2689936
institution National Center for Biotechnology Information
language English
publishDate 2009
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-26899362009-06-19 A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies Howie, Bryan N. Donnelly, Peter Marchini, Jonathan PLoS Genet Research Article Genotype imputation methods are now being widely used in the analysis of genome-wide association studies. Most imputation analyses to date have used the HapMap as a reference dataset, but new reference panels (such as controls genotyped on multiple SNP chips and densely typed samples from the 1,000 Genomes Project) will soon allow a broader range of SNPs to be imputed with higher accuracy, thereby increasing power. We describe a genotype imputation method (IMPUTE version 2) that is designed to address the challenges presented by these new datasets. The main innovation of our approach is a flexible modelling framework that increases accuracy and combines information across multiple reference panels while remaining computationally feasible. We find that IMPUTE v2 attains higher accuracy than other methods when the HapMap provides the sole reference panel, but that the size of the panel constrains the improvements that can be made. We also find that imputation accuracy can be greatly enhanced by expanding the reference panel to contain thousands of chromosomes and that IMPUTE v2 outperforms other methods in this setting at both rare and common SNPs, with overall error rates that are 15%–20% lower than those of the closest competing method. One particularly challenging aspect of next-generation association studies is to integrate information across multiple reference panels genotyped on different sets of SNPs; we show that our approach to this problem has practical advantages over other suggested solutions. Public Library of Science 2009-06-19 /pmc/articles/PMC2689936/ /pubmed/19543373 http://dx.doi.org/10.1371/journal.pgen.1000529 Text en Howie et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Howie, Bryan N.
Donnelly, Peter
Marchini, Jonathan
A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies
title A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies
title_full A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies
title_fullStr A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies
title_full_unstemmed A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies
title_short A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies
title_sort flexible and accurate genotype imputation method for the next generation of genome-wide association studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC2689936/
https://www.ncbi.nlm.nih.gov/pubmed/19543373
http://dx.doi.org/10.1371/journal.pgen.1000529
work_keys_str_mv AT howiebryann aflexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT donnellypeter aflexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT marchinijonathan aflexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT howiebryann flexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT donnellypeter flexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies
AT marchinijonathan flexibleandaccurategenotypeimputationmethodforthenextgenerationofgenomewideassociationstudies