Cargando…
Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank
Phasing involves distinguishing the two parentally inherited copies of each chromosome into haplotypes. Here, we introduce SHAPEIT5, a new phasing method that quickly and accurately processes large sequencing datasets and applied it to UK Biobank (UKB) whole-genome and whole-exome sequencing data. W...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10335929/ https://www.ncbi.nlm.nih.gov/pubmed/37386248 http://dx.doi.org/10.1038/s41588-023-01415-w |
_version_ | 1785071100179251200 |
---|---|
author | Hofmeister, Robin J. Ribeiro, Diogo M. Rubinacci, Simone Delaneau, Olivier |
author_facet | Hofmeister, Robin J. Ribeiro, Diogo M. Rubinacci, Simone Delaneau, Olivier |
author_sort | Hofmeister, Robin J. |
collection | PubMed |
description | Phasing involves distinguishing the two parentally inherited copies of each chromosome into haplotypes. Here, we introduce SHAPEIT5, a new phasing method that quickly and accurately processes large sequencing datasets and applied it to UK Biobank (UKB) whole-genome and whole-exome sequencing data. We demonstrate that SHAPEIT5 phases rare variants with low switch error rates of below 5% for variants present in just 1 sample out of 100,000. Furthermore, we outline a method for phasing singletons, which, although less precise, constitutes an important step towards future developments. We then demonstrate that the use of UKB as a reference panel improves the accuracy of genotype imputation, which is even more pronounced when phased with SHAPEIT5 compared with other methods. Finally, we screen the UKB data for loss-of-function compound heterozygous events and identify 549 genes where both gene copies are knocked out. These genes complement current knowledge of gene essentiality in the human genome. |
format | Online Article Text |
id | pubmed-10335929 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-103359292023-07-13 Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank Hofmeister, Robin J. Ribeiro, Diogo M. Rubinacci, Simone Delaneau, Olivier Nat Genet Technical Report Phasing involves distinguishing the two parentally inherited copies of each chromosome into haplotypes. Here, we introduce SHAPEIT5, a new phasing method that quickly and accurately processes large sequencing datasets and applied it to UK Biobank (UKB) whole-genome and whole-exome sequencing data. We demonstrate that SHAPEIT5 phases rare variants with low switch error rates of below 5% for variants present in just 1 sample out of 100,000. Furthermore, we outline a method for phasing singletons, which, although less precise, constitutes an important step towards future developments. We then demonstrate that the use of UKB as a reference panel improves the accuracy of genotype imputation, which is even more pronounced when phased with SHAPEIT5 compared with other methods. Finally, we screen the UKB data for loss-of-function compound heterozygous events and identify 549 genes where both gene copies are knocked out. These genes complement current knowledge of gene essentiality in the human genome. Nature Publishing Group US 2023-06-29 2023 /pmc/articles/PMC10335929/ /pubmed/37386248 http://dx.doi.org/10.1038/s41588-023-01415-w Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Technical Report Hofmeister, Robin J. Ribeiro, Diogo M. Rubinacci, Simone Delaneau, Olivier Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank |
title | Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank |
title_full | Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank |
title_fullStr | Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank |
title_full_unstemmed | Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank |
title_short | Accurate rare variant phasing of whole-genome and whole-exome sequencing data in the UK Biobank |
title_sort | accurate rare variant phasing of whole-genome and whole-exome sequencing data in the uk biobank |
topic | Technical Report |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10335929/ https://www.ncbi.nlm.nih.gov/pubmed/37386248 http://dx.doi.org/10.1038/s41588-023-01415-w |
work_keys_str_mv | AT hofmeisterrobinj accuraterarevariantphasingofwholegenomeandwholeexomesequencingdataintheukbiobank AT ribeirodiogom accuraterarevariantphasingofwholegenomeandwholeexomesequencingdataintheukbiobank AT rubinaccisimone accuraterarevariantphasingofwholegenomeandwholeexomesequencingdataintheukbiobank AT delaneauolivier accuraterarevariantphasingofwholegenomeandwholeexomesequencingdataintheukbiobank |