Cargando…

Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits

Genome-wide genealogies compactly represent the evolutionary history of a set of genomes and inferring them from genetic data has the potential to facilitate a wide range of analyses. We introduce a method, ARG-Needle, for accurately inferring biobank-scale genealogies from sequencing or genotyping...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Brian C., Biddanda, Arjun, Gunnarsson, Árni Freyr, Cooper, Fergus, Palamara, Pier Francesco
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181934/
https://www.ncbi.nlm.nih.gov/pubmed/37127670
http://dx.doi.org/10.1038/s41588-023-01379-x
_version_ 1785041682036686848
author Zhang, Brian C.
Biddanda, Arjun
Gunnarsson, Árni Freyr
Cooper, Fergus
Palamara, Pier Francesco
author_facet Zhang, Brian C.
Biddanda, Arjun
Gunnarsson, Árni Freyr
Cooper, Fergus
Palamara, Pier Francesco
author_sort Zhang, Brian C.
collection PubMed
description Genome-wide genealogies compactly represent the evolutionary history of a set of genomes and inferring them from genetic data has the potential to facilitate a wide range of analyses. We introduce a method, ARG-Needle, for accurately inferring biobank-scale genealogies from sequencing or genotyping array data, as well as strategies to utilize genealogies to perform association and other complex trait analyses. We use these methods to build genome-wide genealogies using genotyping data for 337,464 UK Biobank individuals and test for association across seven complex traits. Genealogy-based association detects more rare and ultra-rare signals (N = 134, frequency range 0.0007−0.1%) than genotype imputation using ~65,000 sequenced haplotypes (N = 64). In a subset of 138,039 exome sequencing samples, these associations strongly tag (average r = 0.72) underlying sequencing variants enriched (4.8×) for loss-of-function variation. These results demonstrate that inferred genome-wide genealogies may be leveraged in the analysis of complex traits, complementing approaches that require the availability of large, population-specific sequencing panels.
format Online
Article
Text
id pubmed-10181934
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group US
record_format MEDLINE/PubMed
spelling pubmed-101819342023-05-14 Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits Zhang, Brian C. Biddanda, Arjun Gunnarsson, Árni Freyr Cooper, Fergus Palamara, Pier Francesco Nat Genet Article Genome-wide genealogies compactly represent the evolutionary history of a set of genomes and inferring them from genetic data has the potential to facilitate a wide range of analyses. We introduce a method, ARG-Needle, for accurately inferring biobank-scale genealogies from sequencing or genotyping array data, as well as strategies to utilize genealogies to perform association and other complex trait analyses. We use these methods to build genome-wide genealogies using genotyping data for 337,464 UK Biobank individuals and test for association across seven complex traits. Genealogy-based association detects more rare and ultra-rare signals (N = 134, frequency range 0.0007−0.1%) than genotype imputation using ~65,000 sequenced haplotypes (N = 64). In a subset of 138,039 exome sequencing samples, these associations strongly tag (average r = 0.72) underlying sequencing variants enriched (4.8×) for loss-of-function variation. These results demonstrate that inferred genome-wide genealogies may be leveraged in the analysis of complex traits, complementing approaches that require the availability of large, population-specific sequencing panels. Nature Publishing Group US 2023-05-01 2023 /pmc/articles/PMC10181934/ /pubmed/37127670 http://dx.doi.org/10.1038/s41588-023-01379-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Zhang, Brian C.
Biddanda, Arjun
Gunnarsson, Árni Freyr
Cooper, Fergus
Palamara, Pier Francesco
Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits
title Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits
title_full Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits
title_fullStr Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits
title_full_unstemmed Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits
title_short Biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits
title_sort biobank-scale inference of ancestral recombination graphs enables genealogical analysis of complex traits
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10181934/
https://www.ncbi.nlm.nih.gov/pubmed/37127670
http://dx.doi.org/10.1038/s41588-023-01379-x
work_keys_str_mv AT zhangbrianc biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits
AT biddandaarjun biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits
AT gunnarssonarnifreyr biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits
AT cooperfergus biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits
AT palamarapierfrancesco biobankscaleinferenceofancestralrecombinationgraphsenablesgenealogicalanalysisofcomplextraits