Cargando…

Rapid genotype imputation from sequence without reference panels

Inexpensive genotyping methods are essential for genetic studies requiring large sample sizes. In human studies, array-based microarrays and high-density haplotype reference panels allow efficient genotype imputation for this purpose. However, these resources are typically unavailable in non-human s...

Descripción completa

Detalles Bibliográficos
Autores principales: Davies, Robert W., Flint, Jonathan, Myers, Simon, Mott, Richard
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4966640/
https://www.ncbi.nlm.nih.gov/pubmed/27376236
http://dx.doi.org/10.1038/ng.3594
_version_ 1782445407860686848
author Davies, Robert W.
Flint, Jonathan
Myers, Simon
Mott, Richard
author_facet Davies, Robert W.
Flint, Jonathan
Myers, Simon
Mott, Richard
author_sort Davies, Robert W.
collection PubMed
description Inexpensive genotyping methods are essential for genetic studies requiring large sample sizes. In human studies, array-based microarrays and high-density haplotype reference panels allow efficient genotype imputation for this purpose. However, these resources are typically unavailable in non-human settings. Here we describe a method (STITCH) for imputation based only on sequencing read data, without requiring additional reference panels or array data. We demonstrate its applicability even in settings of extremely low sequencing coverage, by accurately imputing 5.7 million SNPs at a mean r(2) of 0.98 in 2,073 outbred laboratory mice (0.15X sequencing coverage). In a sample of 11,670 Han Chinese (1.7X), we achieve accuracy similar to alternative approaches that require a reference panel, demonstrating that this approach can work for genetically diverse populations. Our method enables straightforward progression from low-coverage sequence to imputed genotypes, overcoming barriers that at present restrict the application of genome-wide association study technology outside humans.
format Online
Article
Text
id pubmed-4966640
institution National Center for Biotechnology Information
language English
publishDate 2016
record_format MEDLINE/PubMed
spelling pubmed-49666402017-01-04 Rapid genotype imputation from sequence without reference panels Davies, Robert W. Flint, Jonathan Myers, Simon Mott, Richard Nat Genet Article Inexpensive genotyping methods are essential for genetic studies requiring large sample sizes. In human studies, array-based microarrays and high-density haplotype reference panels allow efficient genotype imputation for this purpose. However, these resources are typically unavailable in non-human settings. Here we describe a method (STITCH) for imputation based only on sequencing read data, without requiring additional reference panels or array data. We demonstrate its applicability even in settings of extremely low sequencing coverage, by accurately imputing 5.7 million SNPs at a mean r(2) of 0.98 in 2,073 outbred laboratory mice (0.15X sequencing coverage). In a sample of 11,670 Han Chinese (1.7X), we achieve accuracy similar to alternative approaches that require a reference panel, demonstrating that this approach can work for genetically diverse populations. Our method enables straightforward progression from low-coverage sequence to imputed genotypes, overcoming barriers that at present restrict the application of genome-wide association study technology outside humans. 2016-07-04 2016-08 /pmc/articles/PMC4966640/ /pubmed/27376236 http://dx.doi.org/10.1038/ng.3594 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Davies, Robert W.
Flint, Jonathan
Myers, Simon
Mott, Richard
Rapid genotype imputation from sequence without reference panels
title Rapid genotype imputation from sequence without reference panels
title_full Rapid genotype imputation from sequence without reference panels
title_fullStr Rapid genotype imputation from sequence without reference panels
title_full_unstemmed Rapid genotype imputation from sequence without reference panels
title_short Rapid genotype imputation from sequence without reference panels
title_sort rapid genotype imputation from sequence without reference panels
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4966640/
https://www.ncbi.nlm.nih.gov/pubmed/27376236
http://dx.doi.org/10.1038/ng.3594
work_keys_str_mv AT daviesrobertw rapidgenotypeimputationfromsequencewithoutreferencepanels
AT flintjonathan rapidgenotypeimputationfromsequencewithoutreferencepanels
AT myerssimon rapidgenotypeimputationfromsequencewithoutreferencepanels
AT mottrichard rapidgenotypeimputationfromsequencewithoutreferencepanels