Cargando…

Rapid genotype imputation from sequence with reference panels

Inexpensive genotyping methods are essential to modern genomics. Here we present QUILT, which performs diploid genotype imputation using low-coverage whole genome sequence data. QUILT employs Gibbs sampling to partition reads into maternal and paternal sets, facilitating rapid haploid imputation usi...

Descripción completa

Detalles Bibliográficos
Autores principales: Davies, Robert W, Kucka, Marek, Su, Dingwen, Shi, Sinan, Flanagan, Maeve, Cunniff, Christopher M, Chan, Yingguang Frank, Myers, Simon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7611184/
https://www.ncbi.nlm.nih.gov/pubmed/34083788
http://dx.doi.org/10.1038/s41588-021-00877-0
_version_ 1783605260817793024
author Davies, Robert W
Kucka, Marek
Su, Dingwen
Shi, Sinan
Flanagan, Maeve
Cunniff, Christopher M
Chan, Yingguang Frank
Myers, Simon
author_facet Davies, Robert W
Kucka, Marek
Su, Dingwen
Shi, Sinan
Flanagan, Maeve
Cunniff, Christopher M
Chan, Yingguang Frank
Myers, Simon
author_sort Davies, Robert W
collection PubMed
description Inexpensive genotyping methods are essential to modern genomics. Here we present QUILT, which performs diploid genotype imputation using low-coverage whole genome sequence data. QUILT employs Gibbs sampling to partition reads into maternal and paternal sets, facilitating rapid haploid imputation using large reference panels. We show this partitioning to be accurate over many megabases, enabling highly accurate imputation close to theoretical limits and outperforming existing methods. Moreover, QUILT can impute accurately using diverse technologies, including using long reads from Oxford Nanopore Technologies, and a novel form of low-cost barcoded Illumina sequencing called haplotagging, with the latter showing improved accuracy at low coverages. Relative to DNA genotyping microarrays, QUILT offers improved accuracy at reduced cost, particularly for diverse populations that are traditionally underserved in modern genomic analyses, with accuracy nearly doubling at rare SNPs. Finally, QUILT can accurately impute (4-digit) HLA types, the first such method from low-coverage sequence data.
format Online
Article
Text
id pubmed-7611184
institution National Center for Biotechnology Information
language English
publishDate 2021
record_format MEDLINE/PubMed
spelling pubmed-76111842021-12-03 Rapid genotype imputation from sequence with reference panels Davies, Robert W Kucka, Marek Su, Dingwen Shi, Sinan Flanagan, Maeve Cunniff, Christopher M Chan, Yingguang Frank Myers, Simon Nat Genet Article Inexpensive genotyping methods are essential to modern genomics. Here we present QUILT, which performs diploid genotype imputation using low-coverage whole genome sequence data. QUILT employs Gibbs sampling to partition reads into maternal and paternal sets, facilitating rapid haploid imputation using large reference panels. We show this partitioning to be accurate over many megabases, enabling highly accurate imputation close to theoretical limits and outperforming existing methods. Moreover, QUILT can impute accurately using diverse technologies, including using long reads from Oxford Nanopore Technologies, and a novel form of low-cost barcoded Illumina sequencing called haplotagging, with the latter showing improved accuracy at low coverages. Relative to DNA genotyping microarrays, QUILT offers improved accuracy at reduced cost, particularly for diverse populations that are traditionally underserved in modern genomic analyses, with accuracy nearly doubling at rare SNPs. Finally, QUILT can accurately impute (4-digit) HLA types, the first such method from low-coverage sequence data. 2021-07-01 2021-06-03 /pmc/articles/PMC7611184/ /pubmed/34083788 http://dx.doi.org/10.1038/s41588-021-00877-0 Text en http://www.nature.com/authors/editorial_policies/license.html#termsUsers may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Davies, Robert W
Kucka, Marek
Su, Dingwen
Shi, Sinan
Flanagan, Maeve
Cunniff, Christopher M
Chan, Yingguang Frank
Myers, Simon
Rapid genotype imputation from sequence with reference panels
title Rapid genotype imputation from sequence with reference panels
title_full Rapid genotype imputation from sequence with reference panels
title_fullStr Rapid genotype imputation from sequence with reference panels
title_full_unstemmed Rapid genotype imputation from sequence with reference panels
title_short Rapid genotype imputation from sequence with reference panels
title_sort rapid genotype imputation from sequence with reference panels
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7611184/
https://www.ncbi.nlm.nih.gov/pubmed/34083788
http://dx.doi.org/10.1038/s41588-021-00877-0
work_keys_str_mv AT daviesrobertw rapidgenotypeimputationfromsequencewithreferencepanels
AT kuckamarek rapidgenotypeimputationfromsequencewithreferencepanels
AT sudingwen rapidgenotypeimputationfromsequencewithreferencepanels
AT shisinan rapidgenotypeimputationfromsequencewithreferencepanels
AT flanaganmaeve rapidgenotypeimputationfromsequencewithreferencepanels
AT cunniffchristopherm rapidgenotypeimputationfromsequencewithreferencepanels
AT chanyingguangfrank rapidgenotypeimputationfromsequencewithreferencepanels
AT myerssimon rapidgenotypeimputationfromsequencewithreferencepanels