Cargando…
Fast and accurate long-range phasing in a UK Biobank cohort
Recent work has leveraged the extensive genotyping of the Icelandic population to perform long-range phasing (LRP), enabling accurate imputation and association analysis of rare variants in target samples typed on genotyping arrays. Here, we develop a fast and accurate LRP method, Eagle, that extend...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
2016
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4925291/ https://www.ncbi.nlm.nih.gov/pubmed/27270109 http://dx.doi.org/10.1038/ng.3571 |
_version_ | 1782439958516072448 |
---|---|
author | Loh, Po-Ru Palamara, Pier Francesco Price, Alkes L |
author_facet | Loh, Po-Ru Palamara, Pier Francesco Price, Alkes L |
author_sort | Loh, Po-Ru |
collection | PubMed |
description | Recent work has leveraged the extensive genotyping of the Icelandic population to perform long-range phasing (LRP), enabling accurate imputation and association analysis of rare variants in target samples typed on genotyping arrays. Here, we develop a fast and accurate LRP method, Eagle, that extends this paradigm to populations with much smaller proportions of genotyped samples by harnessing long (>4cM) identical-by-descent (IBD) tracts shared among distantly related individuals. We applied Eagle to N≈150,000 samples (0.2% of the British population) from the UK Biobank, and we determined that it is 1–2 orders of magnitude faster than existing methods while achieving similar or better phasing accuracy (switch error rate ≈0.3%, corresponding to perfect phase in a majority of 10Mb segments). We also observed that when used within an imputation pipeline, Eagle pre-phasing improved downstream imputation accuracy compared to pre-phasing in batches using existing methods (as necessary to achieve comparable computational cost). |
format | Online Article Text |
id | pubmed-4925291 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2016 |
record_format | MEDLINE/PubMed |
spelling | pubmed-49252912016-12-06 Fast and accurate long-range phasing in a UK Biobank cohort Loh, Po-Ru Palamara, Pier Francesco Price, Alkes L Nat Genet Article Recent work has leveraged the extensive genotyping of the Icelandic population to perform long-range phasing (LRP), enabling accurate imputation and association analysis of rare variants in target samples typed on genotyping arrays. Here, we develop a fast and accurate LRP method, Eagle, that extends this paradigm to populations with much smaller proportions of genotyped samples by harnessing long (>4cM) identical-by-descent (IBD) tracts shared among distantly related individuals. We applied Eagle to N≈150,000 samples (0.2% of the British population) from the UK Biobank, and we determined that it is 1–2 orders of magnitude faster than existing methods while achieving similar or better phasing accuracy (switch error rate ≈0.3%, corresponding to perfect phase in a majority of 10Mb segments). We also observed that when used within an imputation pipeline, Eagle pre-phasing improved downstream imputation accuracy compared to pre-phasing in batches using existing methods (as necessary to achieve comparable computational cost). 2016-06-06 2016-07 /pmc/articles/PMC4925291/ /pubmed/27270109 http://dx.doi.org/10.1038/ng.3571 Text en Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use:http://www.nature.com/authors/editorial_policies/license.html#terms |
spellingShingle | Article Loh, Po-Ru Palamara, Pier Francesco Price, Alkes L Fast and accurate long-range phasing in a UK Biobank cohort |
title | Fast and accurate long-range phasing in a UK Biobank cohort |
title_full | Fast and accurate long-range phasing in a UK Biobank cohort |
title_fullStr | Fast and accurate long-range phasing in a UK Biobank cohort |
title_full_unstemmed | Fast and accurate long-range phasing in a UK Biobank cohort |
title_short | Fast and accurate long-range phasing in a UK Biobank cohort |
title_sort | fast and accurate long-range phasing in a uk biobank cohort |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4925291/ https://www.ncbi.nlm.nih.gov/pubmed/27270109 http://dx.doi.org/10.1038/ng.3571 |
work_keys_str_mv | AT lohporu fastandaccuratelongrangephasinginaukbiobankcohort AT palamarapierfrancesco fastandaccuratelongrangephasinginaukbiobankcohort AT pricealkesl fastandaccuratelongrangephasinginaukbiobankcohort |