Cargando…
Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project
Next-generation genotyping microarrays have been designed with insights from large-scale sequencing of exomes and whole genomes. The exome genotyping arrays promise to query the functional regions of the human genome at a fraction of the sequencing cost, thus allowing large number of samples to be g...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2014
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4159276/ https://www.ncbi.nlm.nih.gov/pubmed/25203698 http://dx.doi.org/10.1371/journal.pone.0106681 |
_version_ | 1782334188279562240 |
---|---|
author | Tantoso, Erwin Wong, Lai-Ping Li, Bowen Saw, Woei-Yuh Xu, Wenting Little, Peter Ong, Rick Twee-Hee Teo, Yik-Ying |
author_facet | Tantoso, Erwin Wong, Lai-Ping Li, Bowen Saw, Woei-Yuh Xu, Wenting Little, Peter Ong, Rick Twee-Hee Teo, Yik-Ying |
author_sort | Tantoso, Erwin |
collection | PubMed |
description | Next-generation genotyping microarrays have been designed with insights from large-scale sequencing of exomes and whole genomes. The exome genotyping arrays promise to query the functional regions of the human genome at a fraction of the sequencing cost, thus allowing large number of samples to be genotyped. However, two pertinent questions exist: firstly, how representative is the content of the exome chip for populations not involved in the design of the chip; secondly, can the content of the exome chip be imputed with the reference data from the 1000 Genomes Project (1KGP). By deep whole-genome sequencing two Asian populations that are not part of the 1KGP, comprising 96 Southeast Asian Malays and 36 South Asian Indians for which the same samples have also been genotyped on both the Illumina 2.5 M and exome microarrays, we discovered the exome chip is a poor representation of exonic content in our two populations. However, up to 94.1% of the variants on the exome chip that are polymorphic in our populations can be confidently imputed with existing non-exome-centric microarrays using the 1KGP panel. The coverage further increases if there exists population-specific reference data from whole-genome sequencing. There is thus limited gain in using the exome chip for populations not involved in the microarray design. Instead, for the same cost of genotyping 2,000 samples on the exome chip, performing whole-genome sequencing of at least 35 samples in that population to complement the 1KGP may yield a higher coverage of the exonic content from imputation instead. |
format | Online Article Text |
id | pubmed-4159276 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2014 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-41592762014-09-12 Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project Tantoso, Erwin Wong, Lai-Ping Li, Bowen Saw, Woei-Yuh Xu, Wenting Little, Peter Ong, Rick Twee-Hee Teo, Yik-Ying PLoS One Research Article Next-generation genotyping microarrays have been designed with insights from large-scale sequencing of exomes and whole genomes. The exome genotyping arrays promise to query the functional regions of the human genome at a fraction of the sequencing cost, thus allowing large number of samples to be genotyped. However, two pertinent questions exist: firstly, how representative is the content of the exome chip for populations not involved in the design of the chip; secondly, can the content of the exome chip be imputed with the reference data from the 1000 Genomes Project (1KGP). By deep whole-genome sequencing two Asian populations that are not part of the 1KGP, comprising 96 Southeast Asian Malays and 36 South Asian Indians for which the same samples have also been genotyped on both the Illumina 2.5 M and exome microarrays, we discovered the exome chip is a poor representation of exonic content in our two populations. However, up to 94.1% of the variants on the exome chip that are polymorphic in our populations can be confidently imputed with existing non-exome-centric microarrays using the 1KGP panel. The coverage further increases if there exists population-specific reference data from whole-genome sequencing. There is thus limited gain in using the exome chip for populations not involved in the microarray design. Instead, for the same cost of genotyping 2,000 samples on the exome chip, performing whole-genome sequencing of at least 35 samples in that population to complement the 1KGP may yield a higher coverage of the exonic content from imputation instead. Public Library of Science 2014-09-09 /pmc/articles/PMC4159276/ /pubmed/25203698 http://dx.doi.org/10.1371/journal.pone.0106681 Text en © 2014 Tantoso et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited. |
spellingShingle | Research Article Tantoso, Erwin Wong, Lai-Ping Li, Bowen Saw, Woei-Yuh Xu, Wenting Little, Peter Ong, Rick Twee-Hee Teo, Yik-Ying Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project |
title | Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project |
title_full | Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project |
title_fullStr | Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project |
title_full_unstemmed | Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project |
title_short | Evaluating the Coverage and Potential of Imputing the Exome Microarray with Next-Generation Imputation Using the 1000 Genomes Project |
title_sort | evaluating the coverage and potential of imputing the exome microarray with next-generation imputation using the 1000 genomes project |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4159276/ https://www.ncbi.nlm.nih.gov/pubmed/25203698 http://dx.doi.org/10.1371/journal.pone.0106681 |
work_keys_str_mv | AT tantosoerwin evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject AT wonglaiping evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject AT libowen evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject AT sawwoeiyuh evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject AT xuwenting evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject AT littlepeter evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject AT ongricktweehee evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject AT teoyikying evaluatingthecoverageandpotentialofimputingtheexomemicroarraywithnextgenerationimputationusingthe1000genomesproject |