Cargando…
Organization of gene programs revealed by unsupervised analysis of diverse gene–trait associations
Genome wide association studies provide statistical measures of gene–trait associations that reveal how genetic variation influences phenotypes. This study develops an unsupervised dimensionality reduction method called UnTANGLeD (Unsupervised Trait Analysis of Networks from Gene Level Data) which o...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Oxford University Press
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9410900/ https://www.ncbi.nlm.nih.gov/pubmed/35716123 http://dx.doi.org/10.1093/nar/gkac413 |
_version_ | 1784775199273517056 |
---|---|
author | Mizikovsky, Dalia Naval Sanchez, Marina Nefzger, Christian M Cuellar Partida, Gabriel Palpant, Nathan J |
author_facet | Mizikovsky, Dalia Naval Sanchez, Marina Nefzger, Christian M Cuellar Partida, Gabriel Palpant, Nathan J |
author_sort | Mizikovsky, Dalia |
collection | PubMed |
description | Genome wide association studies provide statistical measures of gene–trait associations that reveal how genetic variation influences phenotypes. This study develops an unsupervised dimensionality reduction method called UnTANGLeD (Unsupervised Trait Analysis of Networks from Gene Level Data) which organizes 16,849 genes into discrete gene programs by measuring the statistical association between genetic variants and 1,393 diverse complex traits. UnTANGLeD reveals 173 gene clusters enriched for protein–protein interactions and highly distinct biological processes governing development, signalling, disease, and homeostasis. We identify diverse gene networks with robust interactions but not associated with known biological processes. Analysis of independent disease traits shows that UnTANGLeD gene clusters are conserved across all complex traits, providing a simple and powerful framework to predict novel gene candidates and programs influencing orthogonal disease phenotypes. Collectively, this study demonstrates that gene programs co-ordinately orchestrating cell functions can be identified without reliance on prior knowledge, providing a method for use in functional annotation, hypothesis generation, machine learning and prediction algorithms, and the interpretation of diverse genomic data. |
format | Online Article Text |
id | pubmed-9410900 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Oxford University Press |
record_format | MEDLINE/PubMed |
spelling | pubmed-94109002022-08-26 Organization of gene programs revealed by unsupervised analysis of diverse gene–trait associations Mizikovsky, Dalia Naval Sanchez, Marina Nefzger, Christian M Cuellar Partida, Gabriel Palpant, Nathan J Nucleic Acids Res Methods Online Genome wide association studies provide statistical measures of gene–trait associations that reveal how genetic variation influences phenotypes. This study develops an unsupervised dimensionality reduction method called UnTANGLeD (Unsupervised Trait Analysis of Networks from Gene Level Data) which organizes 16,849 genes into discrete gene programs by measuring the statistical association between genetic variants and 1,393 diverse complex traits. UnTANGLeD reveals 173 gene clusters enriched for protein–protein interactions and highly distinct biological processes governing development, signalling, disease, and homeostasis. We identify diverse gene networks with robust interactions but not associated with known biological processes. Analysis of independent disease traits shows that UnTANGLeD gene clusters are conserved across all complex traits, providing a simple and powerful framework to predict novel gene candidates and programs influencing orthogonal disease phenotypes. Collectively, this study demonstrates that gene programs co-ordinately orchestrating cell functions can be identified without reliance on prior knowledge, providing a method for use in functional annotation, hypothesis generation, machine learning and prediction algorithms, and the interpretation of diverse genomic data. Oxford University Press 2022-06-18 /pmc/articles/PMC9410900/ /pubmed/35716123 http://dx.doi.org/10.1093/nar/gkac413 Text en © The Author(s) 2022. Published by Oxford University Press on behalf of Nucleic Acids Research. https://creativecommons.org/licenses/by-nc/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License (https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com |
spellingShingle | Methods Online Mizikovsky, Dalia Naval Sanchez, Marina Nefzger, Christian M Cuellar Partida, Gabriel Palpant, Nathan J Organization of gene programs revealed by unsupervised analysis of diverse gene–trait associations |
title | Organization of gene programs revealed by unsupervised analysis of diverse gene–trait associations |
title_full | Organization of gene programs revealed by unsupervised analysis of diverse gene–trait associations |
title_fullStr | Organization of gene programs revealed by unsupervised analysis of diverse gene–trait associations |
title_full_unstemmed | Organization of gene programs revealed by unsupervised analysis of diverse gene–trait associations |
title_short | Organization of gene programs revealed by unsupervised analysis of diverse gene–trait associations |
title_sort | organization of gene programs revealed by unsupervised analysis of diverse gene–trait associations |
topic | Methods Online |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9410900/ https://www.ncbi.nlm.nih.gov/pubmed/35716123 http://dx.doi.org/10.1093/nar/gkac413 |
work_keys_str_mv | AT mizikovskydalia organizationofgeneprogramsrevealedbyunsupervisedanalysisofdiversegenetraitassociations AT navalsanchezmarina organizationofgeneprogramsrevealedbyunsupervisedanalysisofdiversegenetraitassociations AT nefzgerchristianm organizationofgeneprogramsrevealedbyunsupervisedanalysisofdiversegenetraitassociations AT cuellarpartidagabriel organizationofgeneprogramsrevealedbyunsupervisedanalysisofdiversegenetraitassociations AT palpantnathanj organizationofgeneprogramsrevealedbyunsupervisedanalysisofdiversegenetraitassociations |