Cargando…
Rapid genotype imputation from sequence with reference panels
Inexpensive genotyping methods are essential to modern genomics. Here we present QUILT, which performs diploid genotype imputation using low-coverage whole genome sequence data. QUILT employs Gibbs sampling to partition reads into maternal and paternal sets, facilitating rapid haploid imputation usi...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7611184/ https://www.ncbi.nlm.nih.gov/pubmed/34083788 http://dx.doi.org/10.1038/s41588-021-00877-0 |
_version_ | 1783605260817793024 |
---|---|
author | Davies, Robert W Kucka, Marek Su, Dingwen Shi, Sinan Flanagan, Maeve Cunniff, Christopher M Chan, Yingguang Frank Myers, Simon |
author_facet | Davies, Robert W Kucka, Marek Su, Dingwen Shi, Sinan Flanagan, Maeve Cunniff, Christopher M Chan, Yingguang Frank Myers, Simon |
author_sort | Davies, Robert W |
collection | PubMed |
description | Inexpensive genotyping methods are essential to modern genomics. Here we present QUILT, which performs diploid genotype imputation using low-coverage whole genome sequence data. QUILT employs Gibbs sampling to partition reads into maternal and paternal sets, facilitating rapid haploid imputation using large reference panels. We show this partitioning to be accurate over many megabases, enabling highly accurate imputation close to theoretical limits and outperforming existing methods. Moreover, QUILT can impute accurately using diverse technologies, including using long reads from Oxford Nanopore Technologies, and a novel form of low-cost barcoded Illumina sequencing called haplotagging, with the latter showing improved accuracy at low coverages. Relative to DNA genotyping microarrays, QUILT offers improved accuracy at reduced cost, particularly for diverse populations that are traditionally underserved in modern genomic analyses, with accuracy nearly doubling at rare SNPs. Finally, QUILT can accurately impute (4-digit) HLA types, the first such method from low-coverage sequence data. |
format | Online Article Text |
id | pubmed-7611184 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
record_format | MEDLINE/PubMed |
spelling | pubmed-76111842021-12-03 Rapid genotype imputation from sequence with reference panels Davies, Robert W Kucka, Marek Su, Dingwen Shi, Sinan Flanagan, Maeve Cunniff, Christopher M Chan, Yingguang Frank Myers, Simon Nat Genet Article Inexpensive genotyping methods are essential to modern genomics. Here we present QUILT, which performs diploid genotype imputation using low-coverage whole genome sequence data. QUILT employs Gibbs sampling to partition reads into maternal and paternal sets, facilitating rapid haploid imputation using large reference panels. We show this partitioning to be accurate over many megabases, enabling highly accurate imputation close to theoretical limits and outperforming existing methods. Moreover, QUILT can impute accurately using diverse technologies, including using long reads from Oxford Nanopore Technologies, and a novel form of low-cost barcoded Illumina sequencing called haplotagging, with the latter showing improved accuracy at low coverages. Relative to DNA genotyping microarrays, QUILT offers improved accuracy at reduced cost, particularly for diverse populations that are traditionally underserved in modern genomic analyses, with accuracy nearly doubling at rare SNPs. Finally, QUILT can accurately impute (4-digit) HLA types, the first such method from low-coverage sequence data. 2021-07-01 2021-06-03 /pmc/articles/PMC7611184/ /pubmed/34083788 http://dx.doi.org/10.1038/s41588-021-00877-0 Text en http://www.nature.com/authors/editorial_policies/license.html#termsUsers may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms |
spellingShingle | Article Davies, Robert W Kucka, Marek Su, Dingwen Shi, Sinan Flanagan, Maeve Cunniff, Christopher M Chan, Yingguang Frank Myers, Simon Rapid genotype imputation from sequence with reference panels |
title | Rapid genotype imputation from sequence with reference panels |
title_full | Rapid genotype imputation from sequence with reference panels |
title_fullStr | Rapid genotype imputation from sequence with reference panels |
title_full_unstemmed | Rapid genotype imputation from sequence with reference panels |
title_short | Rapid genotype imputation from sequence with reference panels |
title_sort | rapid genotype imputation from sequence with reference panels |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7611184/ https://www.ncbi.nlm.nih.gov/pubmed/34083788 http://dx.doi.org/10.1038/s41588-021-00877-0 |
work_keys_str_mv | AT daviesrobertw rapidgenotypeimputationfromsequencewithreferencepanels AT kuckamarek rapidgenotypeimputationfromsequencewithreferencepanels AT sudingwen rapidgenotypeimputationfromsequencewithreferencepanels AT shisinan rapidgenotypeimputationfromsequencewithreferencepanels AT flanaganmaeve rapidgenotypeimputationfromsequencewithreferencepanels AT cunniffchristopherm rapidgenotypeimputationfromsequencewithreferencepanels AT chanyingguangfrank rapidgenotypeimputationfromsequencewithreferencepanels AT myerssimon rapidgenotypeimputationfromsequencewithreferencepanels |