Cargando…
Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits
Genome-wide genealogies compactly represent the evolutionary history of a set of genomes and inferring them from genetic data has the potential to facilitate a wide range of analyses. We introduce a method, ARG-Needle, for accurately inferring biobank-scale genealogies from sequencing or genotyping...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181934/ https://www.ncbi.nlm.nih.gov/pubmed/37127670 http://dx.doi.org/10.1038/s41588-023-01379-x |
_version_ | 1785041682036686848 |
---|---|
author | Zhang, Brian C. Biddanda, Arjun Gunnarsson, Árni Freyr Cooper, Fergus Palamara, Pier Francesco |
author_facet | Zhang, Brian C. Biddanda, Arjun Gunnarsson, Árni Freyr Cooper, Fergus Palamara, Pier Francesco |
author_sort | Zhang, Brian C. |
collection | PubMed |
description | Genome-wide genealogies compactly represent the evolutionary history of a set of genomes and inferring them from genetic data has the potential to facilitate a wide range of analyses. We introduce a method, ARG-Needle, for accurately inferring biobank-scale genealogies from sequencing or genotyping array data, as well as strategies to utilize genealogies to perform association and other complex trait analyses. We use these methods to build genome-wide genealogies using genotyping data for 337,464 UK Biobank individuals and test for association across seven complex traits. Genealogy-based association detects more rare and ultra-rare signals (N = 134, frequency range 0.0007−0.1%) than genotype imputation using ~65,000 sequenced haplotypes (N = 64). In a subset of 138,039 exome sequencing samples, these associations strongly tag (average r = 0.72) underlying sequencing variants enriched (4.8×) for loss-of-function variation. These results demonstrate that inferred genome-wide genealogies may be leveraged in the analysis of complex traits, complementing approaches that require the availability of large, population-specific sequencing panels. |
format | Online Article Text |
id | pubmed-10181934 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group US |
record_format | MEDLINE/PubMed |
spelling | pubmed-101819342023-05-14 Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits Zhang, Brian C. Biddanda, Arjun Gunnarsson, Árni Freyr Cooper, Fergus Palamara, Pier Francesco Nat Genet Article Genome-wide genealogies compactly represent the evolutionary history of a set of genomes and inferring them from genetic data has the potential to facilitate a wide range of analyses. We introduce a method, ARG-Needle, for accurately inferring biobank-scale genealogies from sequencing or genotyping array data, as well as strategies to utilize genealogies to perform association and other complex trait analyses. We use these methods to build genome-wide genealogies using genotyping data for 337,464 UK Biobank individuals and test for association across seven complex traits. Genealogy-based association detects more rare and ultra-rare signals (N = 134, frequency range 0.0007−0.1%) than genotype imputation using ~65,000 sequenced haplotypes (N = 64). In a subset of 138,039 exome sequencing samples, these associations strongly tag (average r = 0.72) underlying sequencing variants enriched (4.8×) for loss-of-function variation. These results demonstrate that inferred genome-wide genealogies may be leveraged in the analysis of complex traits, complementing approaches that require the availability of large, population-specific sequencing panels. Nature Publishing Group US 2023-05-01 2023 /pmc/articles/PMC10181934/ /pubmed/37127670 http://dx.doi.org/10.1038/s41588-023-01379-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Zhang, Brian C. Biddanda, Arjun Gunnarsson, Árni Freyr Cooper, Fergus Palamara, Pier Francesco Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits |
title | Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits |
title_full | Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits |
title_fullStr | Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits |
title_full_unstemmed | Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits |
title_short | Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits |
title_sort | biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181934/ https://www.ncbi.nlm.nih.gov/pubmed/37127670 http://dx.doi.org/10.1038/s41588-023-01379-x |
work_keys_str_mv | AT zhangbrianc biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits AT biddandaarjun biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits AT gunnarssonarnifreyr biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits AT cooperfergus biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits AT palamarapierfrancesco biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits |