Cargando…
Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics
Emerging large-scale biobanks pairing genotype data with phenotype data present new opportunities to prioritize shared genetic associations across multiple phenotypes for molecular validation. Past research, by our group and others, has shown gene-level tests of association produce biologically inte...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Genetics Society of America
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7268989/ https://www.ncbi.nlm.nih.gov/pubmed/32245788 http://dx.doi.org/10.1534/genetics.120.303096 |
_version_ | 1783541706193371136 |
---|---|
author | McGuirl, Melissa R. Smith, Samuel Pattillo Sandstede, Björn Ramachandran, Sohini |
author_facet | McGuirl, Melissa R. Smith, Samuel Pattillo Sandstede, Björn Ramachandran, Sohini |
author_sort | McGuirl, Melissa R. |
collection | PubMed |
description | Emerging large-scale biobanks pairing genotype data with phenotype data present new opportunities to prioritize shared genetic associations across multiple phenotypes for molecular validation. Past research, by our group and others, has shown gene-level tests of association produce biologically interpretable characterization of the genetic architecture of a given phenotype. Here, we present a new method, Ward clustering to identify Internal Node branch length outliers using Gene Scores (WINGS), for identifying shared genetic architecture among multiple phenotypes. The objective of WINGS is to identify groups of phenotypes, or “clusters,” sharing a core set of genes enriched for mutations in cases. We validate WINGS using extensive simulation studies and then combine gene-level association tests with WINGS to identify shared genetic architecture among 81 case-control and seven quantitative phenotypes in 349,468 European-ancestry individuals from the UK Biobank. We identify eight prioritized phenotype clusters and recover multiple published gene-level associations within prioritized clusters. |
format | Online Article Text |
id | pubmed-7268989 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Genetics Society of America |
record_format | MEDLINE/PubMed |
spelling | pubmed-72689892020-06-30 Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics McGuirl, Melissa R. Smith, Samuel Pattillo Sandstede, Björn Ramachandran, Sohini Genetics Investigations Emerging large-scale biobanks pairing genotype data with phenotype data present new opportunities to prioritize shared genetic associations across multiple phenotypes for molecular validation. Past research, by our group and others, has shown gene-level tests of association produce biologically interpretable characterization of the genetic architecture of a given phenotype. Here, we present a new method, Ward clustering to identify Internal Node branch length outliers using Gene Scores (WINGS), for identifying shared genetic architecture among multiple phenotypes. The objective of WINGS is to identify groups of phenotypes, or “clusters,” sharing a core set of genes enriched for mutations in cases. We validate WINGS using extensive simulation studies and then combine gene-level association tests with WINGS to identify shared genetic architecture among 81 case-control and seven quantitative phenotypes in 349,468 European-ancestry individuals from the UK Biobank. We identify eight prioritized phenotype clusters and recover multiple published gene-level associations within prioritized clusters. Genetics Society of America 2020-06 2020-04-03 /pmc/articles/PMC7268989/ /pubmed/32245788 http://dx.doi.org/10.1534/genetics.120.303096 Text en Copyright © 2020 by the Genetics Society of America Available freely online through the author-supported open access option. This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Investigations McGuirl, Melissa R. Smith, Samuel Pattillo Sandstede, Björn Ramachandran, Sohini Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics |
title | Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics |
title_full | Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics |
title_fullStr | Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics |
title_full_unstemmed | Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics |
title_short | Detecting Shared Genetic Architecture Among Multiple Phenotypes by Hierarchical Clustering of Gene-Level Association Statistics |
title_sort | detecting shared genetic architecture among multiple phenotypes by hierarchical clustering of gene-level association statistics |
topic | Investigations |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7268989/ https://www.ncbi.nlm.nih.gov/pubmed/32245788 http://dx.doi.org/10.1534/genetics.120.303096 |
work_keys_str_mv | AT mcguirlmelissar detectingsharedgeneticarchitectureamongmultiplephenotypesbyhierarchicalclusteringofgenelevelassociationstatistics AT smithsamuelpattillo detectingsharedgeneticarchitectureamongmultiplephenotypesbyhierarchicalclusteringofgenelevelassociationstatistics AT sandstedebjorn detectingsharedgeneticarchitectureamongmultiplephenotypesbyhierarchicalclusteringofgenelevelassociationstatistics AT ramachandransohini detectingsharedgeneticarchitectureamongmultiplephenotypesbyhierarchicalclusteringofgenelevelassociationstatistics |