Cargando…

Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project

Next-generation genotyping microarrays have been designed with insights from large-scale sequencing of exomes and whole genomes. The exome genotyping arrays promise to query the functional regions of the human genome at a fraction of the sequencing cost, thus allowing large number of samples to be g...

Descripción completa

Detalles Bibliográficos
Autores principales: Tantoso, Erwin, Wong, Lai-Ping, Li, Bowen, Saw, Woei-Yuh, Xu, Wenting, Little, Peter, Ong, Rick Twee-Hee, Teo, Yik-Ying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4159276/
https://www.ncbi.nlm.nih.gov/pubmed/25203698
http://dx.doi.org/10.1371/journal.pone.0106681
_version_ 1782334188279562240
author Tantoso, Erwin
Wong, Lai-Ping
Li, Bowen
Saw, Woei-Yuh
Xu, Wenting
Little, Peter
Ong, Rick Twee-Hee
Teo, Yik-Ying
author_facet Tantoso, Erwin
Wong, Lai-Ping
Li, Bowen
Saw, Woei-Yuh
Xu, Wenting
Little, Peter
Ong, Rick Twee-Hee
Teo, Yik-Ying
author_sort Tantoso, Erwin
collection PubMed
description Next-generation genotyping microarrays have been designed with insights from large-scale sequencing of exomes and whole genomes. The exome genotyping arrays promise to query the functional regions of the human genome at a fraction of the sequencing cost, thus allowing large number of samples to be genotyped. However, two pertinent questions exist: firstly, how representative is the content of the exome chip for populations not involved in the design of the chip; secondly, can the content of the exome chip be imputed with the reference data from the 1000 Genomes Project (1KGP). By deep whole-genome sequencing two Asian populations that are not part of the 1KGP, comprising 96 Southeast Asian Malays and 36 South Asian Indians for which the same samples have also been genotyped on both the Illumina 2.5 M and exome microarrays, we discovered the exome chip is a poor representation of exonic content in our two populations. However, up to 94.1% of the variants on the exome chip that are polymorphic in our populations can be confidently imputed with existing non-exome-centric microarrays using the 1KGP panel. The coverage further increases if there exists population-specific reference data from whole-genome sequencing. There is thus limited gain in using the exome chip for populations not involved in the microarray design. Instead, for the same cost of genotyping 2,000 samples on the exome chip, performing whole-genome sequencing of at least 35 samples in that population to complement the 1KGP may yield a higher coverage of the exonic content from imputation instead.
format Online
Article
Text
id pubmed-4159276
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41592762014-09-12 Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project Tantoso, Erwin Wong, Lai-Ping Li, Bowen Saw, Woei-Yuh Xu, Wenting Little, Peter Ong, Rick Twee-Hee Teo, Yik-Ying PLoS One Research Article Next-generation genotyping microarrays have been designed with insights from large-scale sequencing of exomes and whole genomes. The exome genotyping arrays promise to query the functional regions of the human genome at a fraction of the sequencing cost, thus allowing large number of samples to be genotyped. However, two pertinent questions exist: firstly, how representative is the content of the exome chip for populations not involved in the design of the chip; secondly, can the content of the exome chip be imputed with the reference data from the 1000 Genomes Project (1KGP). By deep whole-genome sequencing two Asian populations that are not part of the 1KGP, comprising 96 Southeast Asian Malays and 36 South Asian Indians for which the same samples have also been genotyped on both the Illumina 2.5 M and exome microarrays, we discovered the exome chip is a poor representation of exonic content in our two populations. However, up to 94.1% of the variants on the exome chip that are polymorphic in our populations can be confidently imputed with existing non-exome-centric microarrays using the 1KGP panel. The coverage further increases if there exists population-specific reference data from whole-genome sequencing. There is thus limited gain in using the exome chip for populations not involved in the microarray design. Instead, for the same cost of genotyping 2,000 samples on the exome chip, performing whole-genome sequencing of at least 35 samples in that population to complement the 1KGP may yield a higher coverage of the exonic content from imputation instead. Public Library of Science 2014-09-09 /pmc/articles/PMC4159276/ /pubmed/25203698 http://dx.doi.org/10.1371/journal.pone.0106681 Text en © 2014 Tantoso et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Tantoso, Erwin
Wong, Lai-Ping
Li, Bowen
Saw, Woei-Yuh
Xu, Wenting
Little, Peter
Ong, Rick Twee-Hee
Teo, Yik-Ying
Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project
title Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project
title_full Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project
title_fullStr Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project
title_full_unstemmed Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project
title_short Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project
title_sort evaluating the coverage and potential of imputing the exome microarray with next-generation imputation using the 1000 genomes project
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4159276/
https://www.ncbi.nlm.nih.gov/pubmed/25203698
http://dx.doi.org/10.1371/journal.pone.0106681
work_keys_str_mv AT tantosoerwin evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject
AT wonglaiping evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject
AT libowen evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject
AT sawwoeiyuh evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject
AT xuwenting evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject
AT littlepeter evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject
AT ongricktweehee evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject
AT teoyikying evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject