Cargando…

Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics

Emerging large-scale biobanks pairing genotype data with phenotype data present new opportunities to prioritize shared genetic associations across multiple phenotypes for molecular validation. Past research, by our group and others, has shown gene-level tests of association produce biologically inte...

Descripción completa

Detalles Bibliográficos
Autores principales: McGuirl, Melissa R., Smith, Samuel Pattillo, Sandstede, Björn, Ramachandran, Sohini
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Genetics Society of America 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7268989/
https://www.ncbi.nlm.nih.gov/pubmed/32245788
http://dx.doi.org/10.1534/genetics.120.303096
_version_ 1783541706193371136
author McGuirl, Melissa R.
Smith, Samuel Pattillo
Sandstede, Björn
Ramachandran, Sohini
author_facet McGuirl, Melissa R.
Smith, Samuel Pattillo
Sandstede, Björn
Ramachandran, Sohini
author_sort McGuirl, Melissa R.
collection PubMed
description Emerging large-scale biobanks pairing genotype data with phenotype data present new opportunities to prioritize shared genetic associations across multiple phenotypes for molecular validation. Past research, by our group and others, has shown gene-level tests of association produce biologically interpretable characterization of the genetic architecture of a given phenotype. Here, we present a new method, Ward clustering to identify Internal Node branch length outliers using Gene Scores (WINGS), for identifying shared genetic architecture among multiple phenotypes. The objective of WINGS is to identify groups of phenotypes, or “clusters,” sharing a core set of genes enriched for mutations in cases. We validate WINGS using extensive simulation studies and then combine gene-level association tests with WINGS to identify shared genetic architecture among 81 case-control and seven quantitative phenotypes in 349,468 European-ancestry individuals from the UK Biobank. We identify eight prioritized phenotype clusters and recover multiple published gene-level associations within prioritized clusters.
format Online
Article
Text
id pubmed-7268989
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Genetics Society of America
record_format MEDLINE/PubMed
spelling pubmed-72689892020-06-30 Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics McGuirl, Melissa R. Smith, Samuel Pattillo Sandstede, Björn Ramachandran, Sohini Genetics Investigations Emerging large-scale biobanks pairing genotype data with phenotype data present new opportunities to prioritize shared genetic associations across multiple phenotypes for molecular validation. Past research, by our group and others, has shown gene-level tests of association produce biologically interpretable characterization of the genetic architecture of a given phenotype. Here, we present a new method, Ward clustering to identify Internal Node branch length outliers using Gene Scores (WINGS), for identifying shared genetic architecture among multiple phenotypes. The objective of WINGS is to identify groups of phenotypes, or “clusters,” sharing a core set of genes enriched for mutations in cases. We validate WINGS using extensive simulation studies and then combine gene-level association tests with WINGS to identify shared genetic architecture among 81 case-control and seven quantitative phenotypes in 349,468 European-ancestry individuals from the UK Biobank. We identify eight prioritized phenotype clusters and recover multiple published gene-level associations within prioritized clusters. Genetics Society of America 2020-06 2020-04-03 /pmc/articles/PMC7268989/ /pubmed/32245788 http://dx.doi.org/10.1534/genetics.120.303096 Text en Copyright © 2020 by the Genetics Society of America Available freely online through the author-supported open access option. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Investigations
McGuirl, Melissa R.
Smith, Samuel Pattillo
Sandstede, Björn
Ramachandran, Sohini
Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics
title Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics
title_full Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics
title_fullStr Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics
title_full_unstemmed Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics
title_short Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics
title_sort detecting shared genetic architecture among multiple phenotypes by hierarchical clustering of gene-level association statistics
topic Investigations
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7268989/
https://www.ncbi.nlm.nih.gov/pubmed/32245788
http://dx.doi.org/10.1534/genetics.120.303096
work_keys_str_mv AT mcguirlmelissar detectingsharedgeneticarchitectureamongmultiplephenotypesbyhierarchicalclusteringofgenelevelassociationstatistics
AT smithsamuelpattillo detectingsharedgeneticarchitectureamongmultiplephenotypesbyhierarchicalclusteringofgenelevelassociationstatistics
AT sandstedebjorn detectingsharedgeneticarchitectureamongmultiplephenotypesbyhierarchicalclusteringofgenelevelassociationstatistics
AT ramachandransohini detectingsharedgeneticarchitectureamongmultiplephenotypesbyhierarchicalclusteringofgenelevelassociationstatistics