Cargando…

Fast and accurate long-range phasing in a UK Biobank cohort

Recent work has leveraged the extensive genotyping of the Icelandic population to perform long-range phasing (LRP), enabling accurate imputation and association analysis of rare variants in target samples typed on genotyping arrays. Here, we develop a fast and accurate LRP method, Eagle, that extend...

Descripción completa

Detalles Bibliográficos
Autores principales: Loh, Po-Ru, Palamara, Pier Francesco, Price, Alkes L
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2016
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4925291/
https://www.ncbi.nlm.nih.gov/pubmed/27270109
http://dx.doi.org/10.1038/ng.3571
_version_ 1782439958516072448
author Loh, Po-Ru
Palamara, Pier Francesco
Price, Alkes L
author_facet Loh, Po-Ru
Palamara, Pier Francesco
Price, Alkes L
author_sort Loh, Po-Ru
collection PubMed
description Recent work has leveraged the extensive genotyping of the Icelandic population to perform long-range phasing (LRP), enabling accurate imputation and association analysis of rare variants in target samples typed on genotyping arrays. Here, we develop a fast and accurate LRP method, Eagle, that extends this paradigm to populations with much smaller proportions of genotyped samples by harnessing long (>4cM) identical-by-descent (IBD) tracts shared among distantly related individuals. We applied Eagle to N≈150,000 samples (0.2% of the British population) from the UK Biobank, and we determined that it is 1–2 orders of magnitude faster than existing methods while achieving similar or better phasing accuracy (switch error rate ≈0.3%, corresponding to perfect phase in a majority of 10Mb segments). We also observed that when used within an imputation pipeline, Eagle pre-phasing improved downstream imputation accuracy compared to pre-phasing in batches using existing methods (as necessary to achieve comparable computational cost).
format Online
Article
Text
id pubmed-4925291
institution National Center for Biotechnology Information
language English
publishDate 2016
record_format MEDLINE/PubMed
spelling pubmed-49252912016-12-06 Fast and accurate long-range phasing in a UK Biobank cohort Loh, Po-Ru Palamara, Pier Francesco Price, Alkes L Nat Genet Article Recent work has leveraged the extensive genotyping of the Icelandic population to perform long-range phasing (LRP), enabling accurate imputation and association analysis of rare variants in target samples typed on genotyping arrays. Here, we develop a fast and accurate LRP method, Eagle, that extends this paradigm to populations with much smaller proportions of genotyped samples by harnessing long (>4cM) identical-by-descent (IBD) tracts shared among distantly related individuals. We applied Eagle to N≈150,000 samples (0.2% of the British population) from the UK Biobank, and we determined that it is 1–2 orders of magnitude faster than existing methods while achieving similar or better phasing accuracy (switch error rate ≈0.3%, corresponding to perfect phase in a majority of 10Mb segments). We also observed that when used within an imputation pipeline, Eagle pre-phasing improved downstream imputation accuracy compared to pre-phasing in batches using existing methods (as necessary to achieve comparable computational cost). 2016-06-06 2016-07 /pmc/articles/PMC4925291/ /pubmed/27270109 http://dx.doi.org/10.1038/ng.3571 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms
spellingShingle Article
Loh, Po-Ru
Palamara, Pier Francesco
Price, Alkes L
Fast and accurate long-range phasing in a UK Biobank cohort
title Fast and accurate long-range phasing in a UK Biobank cohort
title_full Fast and accurate long-range phasing in a UK Biobank cohort
title_fullStr Fast and accurate long-range phasing in a UK Biobank cohort
title_full_unstemmed Fast and accurate long-range phasing in a UK Biobank cohort
title_short Fast and accurate long-range phasing in a UK Biobank cohort
title_sort fast and accurate long-range phasing in a uk biobank cohort
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4925291/
https://www.ncbi.nlm.nih.gov/pubmed/27270109
http://dx.doi.org/10.1038/ng.3571
work_keys_str_mv AT lohporu fastandaccuratelongrangephasinginaukbiobankcohort
AT palamarapierfrancesco fastandaccuratelongrangephasinginaukbiobankcohort
AT pricealkesl fastandaccuratelongrangephasinginaukbiobankcohort