Cargando…

Performance of Genotype Imputation for Rare Variants Identified in Exons and Flanking Regions of Genes

Genotype imputation has the potential to assess human genetic variation at a lower cost than assaying the variants using laboratory techniques. The performance of imputation for rare variants has not been comprehensively studied. We utilized 8865 human samples with high depth resequencing data for t...

Descripción completa

Detalles Bibliográficos
Autores principales: Li, Li, Li, Yun, Browning, Sharon R., Browning, Brian L., Slater, Andrew J., Kong, Xiangyang, Aponte, Jennifer L., Mooser, Vincent E., Chissoe, Stephanie L., Whittaker, John C., Nelson, Matthew R., Ehm, Margaret Gelder
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3176314/
https://www.ncbi.nlm.nih.gov/pubmed/21949800
http://dx.doi.org/10.1371/journal.pone.0024945
_version_ 1782212215223353344
author Li, Li
Li, Yun
Browning, Sharon R.
Browning, Brian L.
Slater, Andrew J.
Kong, Xiangyang
Aponte, Jennifer L.
Mooser, Vincent E.
Chissoe, Stephanie L.
Whittaker, John C.
Nelson, Matthew R.
Ehm, Margaret Gelder
author_facet Li, Li
Li, Yun
Browning, Sharon R.
Browning, Brian L.
Slater, Andrew J.
Kong, Xiangyang
Aponte, Jennifer L.
Mooser, Vincent E.
Chissoe, Stephanie L.
Whittaker, John C.
Nelson, Matthew R.
Ehm, Margaret Gelder
author_sort Li, Li
collection PubMed
description Genotype imputation has the potential to assess human genetic variation at a lower cost than assaying the variants using laboratory techniques. The performance of imputation for rare variants has not been comprehensively studied. We utilized 8865 human samples with high depth resequencing data for the exons and flanking regions of 202 genes and Genome-Wide Association Study (GWAS) data to characterize the performance of genotype imputation for rare variants. We evaluated reference sets ranging from 100 to 3713 subjects for imputing into samples typed for the Affymetrix (500K and 6.0) and Illumina 550K GWAS panels. The proportion of variants that could be well imputed (true r(2)>0.7) with a reference panel of 3713 individuals was: 31% (Illumina 550K) or 25% (Affymetrix 500K) with MAF (Minor Allele Frequency) less than or equal 0.001, 48% or 35% with 0.001<MAF< = 0.005, 54% or 38% with 0.005<MAF< = 0.01, 78% or 57% with 0.01<MAF< = 0.05, and 97% or 86% with MAF>0.05. The performance for common SNPs (MAF>0.05) within exons and flanking regions is comparable to imputation of more uniformly distributed SNPs. The performance for rare SNPs (0.01<MAF< = 0.05) was much more dependent on the GWAS panel and the number of reference samples. These results suggest routine use of genotype imputation for extending the assessment of common variants identified in humans via targeted exon resequencing into additional samples with GWAS data, but imputation of very rare variants (MAF< = 0.005) will require reference panels with thousands of subjects.
format Online
Article
Text
id pubmed-3176314
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-31763142011-09-26 Performance of Genotype Imputation for Rare Variants Identified in Exons and Flanking Regions of Genes Li, Li Li, Yun Browning, Sharon R. Browning, Brian L. Slater, Andrew J. Kong, Xiangyang Aponte, Jennifer L. Mooser, Vincent E. Chissoe, Stephanie L. Whittaker, John C. Nelson, Matthew R. Ehm, Margaret Gelder PLoS One Research Article Genotype imputation has the potential to assess human genetic variation at a lower cost than assaying the variants using laboratory techniques. The performance of imputation for rare variants has not been comprehensively studied. We utilized 8865 human samples with high depth resequencing data for the exons and flanking regions of 202 genes and Genome-Wide Association Study (GWAS) data to characterize the performance of genotype imputation for rare variants. We evaluated reference sets ranging from 100 to 3713 subjects for imputing into samples typed for the Affymetrix (500K and 6.0) and Illumina 550K GWAS panels. The proportion of variants that could be well imputed (true r(2)>0.7) with a reference panel of 3713 individuals was: 31% (Illumina 550K) or 25% (Affymetrix 500K) with MAF (Minor Allele Frequency) less than or equal 0.001, 48% or 35% with 0.001<MAF< = 0.005, 54% or 38% with 0.005<MAF< = 0.01, 78% or 57% with 0.01<MAF< = 0.05, and 97% or 86% with MAF>0.05. The performance for common SNPs (MAF>0.05) within exons and flanking regions is comparable to imputation of more uniformly distributed SNPs. The performance for rare SNPs (0.01<MAF< = 0.05) was much more dependent on the GWAS panel and the number of reference samples. These results suggest routine use of genotype imputation for extending the assessment of common variants identified in humans via targeted exon resequencing into additional samples with GWAS data, but imputation of very rare variants (MAF< = 0.005) will require reference panels with thousands of subjects. Public Library of Science 2011-09-19 /pmc/articles/PMC3176314/ /pubmed/21949800 http://dx.doi.org/10.1371/journal.pone.0024945 Text en Li et al. http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Li, Li
Li, Yun
Browning, Sharon R.
Browning, Brian L.
Slater, Andrew J.
Kong, Xiangyang
Aponte, Jennifer L.
Mooser, Vincent E.
Chissoe, Stephanie L.
Whittaker, John C.
Nelson, Matthew R.
Ehm, Margaret Gelder
Performance of Genotype Imputation for Rare Variants Identified in Exons and Flanking Regions of Genes
title Performance of Genotype Imputation for Rare Variants Identified in Exons and Flanking Regions of Genes
title_full Performance of Genotype Imputation for Rare Variants Identified in Exons and Flanking Regions of Genes
title_fullStr Performance of Genotype Imputation for Rare Variants Identified in Exons and Flanking Regions of Genes
title_full_unstemmed Performance of Genotype Imputation for Rare Variants Identified in Exons and Flanking Regions of Genes
title_short Performance of Genotype Imputation for Rare Variants Identified in Exons and Flanking Regions of Genes
title_sort performance of genotype imputation for rare variants identified in exons and flanking regions of genes
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3176314/
https://www.ncbi.nlm.nih.gov/pubmed/21949800
http://dx.doi.org/10.1371/journal.pone.0024945
work_keys_str_mv AT lili performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT liyun performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT browningsharonr performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT browningbrianl performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT slaterandrewj performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT kongxiangyang performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT apontejenniferl performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT mooservincente performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT chissoestephaniel performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT whittakerjohnc performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT nelsonmatthewr performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes
AT ehmmargaretgelder performanceofgenotypeimputationforrarevariantsidentifiedinexonsandflankingregionsofgenes