Cargando…

Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies

Genome-wide association studies (GWASs) have recently revealed many genetic associations that are shared between different diseases. We propose a method, disPCA, for genome-wide characterization of shared and distinct risk factors between and within disease classes. It flips the conventional GWAS pa...

Descripción completa

Detalles Bibliográficos
Autores principales: Chang, Diana, Keinan, Alon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4161298/
https://www.ncbi.nlm.nih.gov/pubmed/25211452
http://dx.doi.org/10.1371/journal.pcbi.1003820
_version_ 1782334522641088512
author Chang, Diana
Keinan, Alon
author_facet Chang, Diana
Keinan, Alon
author_sort Chang, Diana
collection PubMed
description Genome-wide association studies (GWASs) have recently revealed many genetic associations that are shared between different diseases. We propose a method, disPCA, for genome-wide characterization of shared and distinct risk factors between and within disease classes. It flips the conventional GWAS paradigm by analyzing the diseases themselves, across GWAS datasets, to explore their “shared pathogenetics”. The method applies principal component analysis (PCA) to gene-level significance scores across all genes and across GWASs, thereby revealing shared pathogenetics between diseases in an unsupervised fashion. Importantly, it adjusts for potential sources of heterogeneity present between GWAS which can confound investigation of shared disease etiology. We applied disPCA to 31 GWASs, including autoimmune diseases, cancers, psychiatric disorders, and neurological disorders. The leading principal components separate these disease classes, as well as inflammatory bowel diseases from other autoimmune diseases. Generally, distinct diseases from the same class tend to be less separated, which is in line with their increased shared etiology. Enrichment analysis of genes contributing to leading principal components revealed pathways that are implicated in the immune system, while also pointing to pathways that have yet to be explored before in this context. Our results point to the potential of disPCA in going beyond epidemiological findings of the co-occurrence of distinct diseases, to highlighting novel genes and pathways that unsupervised learning suggest to be key players in the variability across diseases.
format Online
Article
Text
id pubmed-4161298
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-41612982014-09-17 Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies Chang, Diana Keinan, Alon PLoS Comput Biol Research Article Genome-wide association studies (GWASs) have recently revealed many genetic associations that are shared between different diseases. We propose a method, disPCA, for genome-wide characterization of shared and distinct risk factors between and within disease classes. It flips the conventional GWAS paradigm by analyzing the diseases themselves, across GWAS datasets, to explore their “shared pathogenetics”. The method applies principal component analysis (PCA) to gene-level significance scores across all genes and across GWASs, thereby revealing shared pathogenetics between diseases in an unsupervised fashion. Importantly, it adjusts for potential sources of heterogeneity present between GWAS which can confound investigation of shared disease etiology. We applied disPCA to 31 GWASs, including autoimmune diseases, cancers, psychiatric disorders, and neurological disorders. The leading principal components separate these disease classes, as well as inflammatory bowel diseases from other autoimmune diseases. Generally, distinct diseases from the same class tend to be less separated, which is in line with their increased shared etiology. Enrichment analysis of genes contributing to leading principal components revealed pathways that are implicated in the immune system, while also pointing to pathways that have yet to be explored before in this context. Our results point to the potential of disPCA in going beyond epidemiological findings of the co-occurrence of distinct diseases, to highlighting novel genes and pathways that unsupervised learning suggest to be key players in the variability across diseases. Public Library of Science 2014-09-11 /pmc/articles/PMC4161298/ /pubmed/25211452 http://dx.doi.org/10.1371/journal.pcbi.1003820 Text en © 2014 Chang, Keinan http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Chang, Diana
Keinan, Alon
Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies
title Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies
title_full Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies
title_fullStr Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies
title_full_unstemmed Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies
title_short Principal Component Analysis Characterizes Shared Pathogenetics from Genome-Wide Association Studies
title_sort principal component analysis characterizes shared pathogenetics from genome-wide association studies
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4161298/
https://www.ncbi.nlm.nih.gov/pubmed/25211452
http://dx.doi.org/10.1371/journal.pcbi.1003820
work_keys_str_mv AT changdiana principalcomponentanalysischaracterizessharedpathogeneticsfromgenomewideassociationstudies
AT keinanalon principalcomponentanalysischaracterizessharedpathogeneticsfromgenomewideassociationstudies