Cargando…

Causal graph-based analysis of genome-wide association data in rheumatoid arthritis

BACKGROUND: GWAS owe their popularity to the expectation that they will make a major impact on diagnosis, prognosis and management of disease by uncovering genetics underlying clinical phenotypes. The dominant paradigm in GWAS data analysis so far consists of extensive reliance on methods that empha...

Descripción completa

Detalles Bibliográficos
Autores principales: Alekseyenko, Alexander V, Lytkin, Nikita I, Ai, Jizhou, Ding, Bo, Padyukov, Leonid, Aliferis, Constantin F, Statnikov, Alexander
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2011
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3118953/
https://www.ncbi.nlm.nih.gov/pubmed/21592391
http://dx.doi.org/10.1186/1745-6150-6-25
_version_ 1782206524183019520
author Alekseyenko, Alexander V
Lytkin, Nikita I
Ai, Jizhou
Ding, Bo
Padyukov, Leonid
Aliferis, Constantin F
Statnikov, Alexander
author_facet Alekseyenko, Alexander V
Lytkin, Nikita I
Ai, Jizhou
Ding, Bo
Padyukov, Leonid
Aliferis, Constantin F
Statnikov, Alexander
author_sort Alekseyenko, Alexander V
collection PubMed
description BACKGROUND: GWAS owe their popularity to the expectation that they will make a major impact on diagnosis, prognosis and management of disease by uncovering genetics underlying clinical phenotypes. The dominant paradigm in GWAS data analysis so far consists of extensive reliance on methods that emphasize contribution of individual SNPs to statistical association with phenotypes. Multivariate methods, however, can extract more information by considering associations of multiple SNPs simultaneously. Recent advances in other genomics domains pinpoint multivariate causal graph-based inference as a promising principled analysis framework for high-throughput data. Designed to discover biomarkers in the local causal pathway of the phenotype, these methods lead to accurate and highly parsimonious multivariate predictive models. In this paper, we investigate the applicability of causal graph-based method TIE* to analysis of GWAS data. To test the utility of TIE*, we focus on anti-CCP positive rheumatoid arthritis (RA) GWAS datasets, where there is a general consensus in the community about the major genetic determinants of the disease. RESULTS: Application of TIE* to the North American Rheumatoid Arthritis Cohort (NARAC) GWAS data results in six SNPs, mostly from the MHC locus. Using these SNPs we develop two predictive models that can classify cases and disease-free controls with an accuracy of 0.81 area under the ROC curve, as verified in independent testing data from the same cohort. The predictive performance of these models generalizes reasonably well to Swedish subjects from the closely related but not identical Epidemiological Investigation of Rheumatoid Arthritis (EIRA) cohort with 0.71-0.78 area under the ROC curve. Moreover, the SNPs identified by the TIE* method render many other previously known SNP associations conditionally independent of the phenotype. CONCLUSIONS: Our experiments demonstrate that application of TIE* captures maximum amount of genetic information about RA in the data and recapitulates the major consensus findings about the genetic factors of this disease. In addition, TIE* yields reproducible markers and signatures of RA. This suggests that principled multivariate causal and predictive framework for GWAS analysis empowers the community with a new tool for high-quality and more efficient discovery. REVIEWERS: This article was reviewed by Prof. Anthony Almudevar, Dr. Eugene V. Koonin, and Prof. Marianthi Markatou.
format Online
Article
Text
id pubmed-3118953
institution National Center for Biotechnology Information
language English
publishDate 2011
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-31189532011-06-22 Causal graph-based analysis of genome-wide association data in rheumatoid arthritis Alekseyenko, Alexander V Lytkin, Nikita I Ai, Jizhou Ding, Bo Padyukov, Leonid Aliferis, Constantin F Statnikov, Alexander Biol Direct Research BACKGROUND: GWAS owe their popularity to the expectation that they will make a major impact on diagnosis, prognosis and management of disease by uncovering genetics underlying clinical phenotypes. The dominant paradigm in GWAS data analysis so far consists of extensive reliance on methods that emphasize contribution of individual SNPs to statistical association with phenotypes. Multivariate methods, however, can extract more information by considering associations of multiple SNPs simultaneously. Recent advances in other genomics domains pinpoint multivariate causal graph-based inference as a promising principled analysis framework for high-throughput data. Designed to discover biomarkers in the local causal pathway of the phenotype, these methods lead to accurate and highly parsimonious multivariate predictive models. In this paper, we investigate the applicability of causal graph-based method TIE* to analysis of GWAS data. To test the utility of TIE*, we focus on anti-CCP positive rheumatoid arthritis (RA) GWAS datasets, where there is a general consensus in the community about the major genetic determinants of the disease. RESULTS: Application of TIE* to the North American Rheumatoid Arthritis Cohort (NARAC) GWAS data results in six SNPs, mostly from the MHC locus. Using these SNPs we develop two predictive models that can classify cases and disease-free controls with an accuracy of 0.81 area under the ROC curve, as verified in independent testing data from the same cohort. The predictive performance of these models generalizes reasonably well to Swedish subjects from the closely related but not identical Epidemiological Investigation of Rheumatoid Arthritis (EIRA) cohort with 0.71-0.78 area under the ROC curve. Moreover, the SNPs identified by the TIE* method render many other previously known SNP associations conditionally independent of the phenotype. CONCLUSIONS: Our experiments demonstrate that application of TIE* captures maximum amount of genetic information about RA in the data and recapitulates the major consensus findings about the genetic factors of this disease. In addition, TIE* yields reproducible markers and signatures of RA. This suggests that principled multivariate causal and predictive framework for GWAS analysis empowers the community with a new tool for high-quality and more efficient discovery. REVIEWERS: This article was reviewed by Prof. Anthony Almudevar, Dr. Eugene V. Koonin, and Prof. Marianthi Markatou. BioMed Central 2011-05-18 /pmc/articles/PMC3118953/ /pubmed/21592391 http://dx.doi.org/10.1186/1745-6150-6-25 Text en Copyright ©2011 Alekseyenko et al; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research
Alekseyenko, Alexander V
Lytkin, Nikita I
Ai, Jizhou
Ding, Bo
Padyukov, Leonid
Aliferis, Constantin F
Statnikov, Alexander
Causal graph-based analysis of genome-wide association data in rheumatoid arthritis
title Causal graph-based analysis of genome-wide association data in rheumatoid arthritis
title_full Causal graph-based analysis of genome-wide association data in rheumatoid arthritis
title_fullStr Causal graph-based analysis of genome-wide association data in rheumatoid arthritis
title_full_unstemmed Causal graph-based analysis of genome-wide association data in rheumatoid arthritis
title_short Causal graph-based analysis of genome-wide association data in rheumatoid arthritis
title_sort causal graph-based analysis of genome-wide association data in rheumatoid arthritis
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3118953/
https://www.ncbi.nlm.nih.gov/pubmed/21592391
http://dx.doi.org/10.1186/1745-6150-6-25
work_keys_str_mv AT alekseyenkoalexanderv causalgraphbasedanalysisofgenomewideassociationdatainrheumatoidarthritis
AT lytkinnikitai causalgraphbasedanalysisofgenomewideassociationdatainrheumatoidarthritis
AT aijizhou causalgraphbasedanalysisofgenomewideassociationdatainrheumatoidarthritis
AT dingbo causalgraphbasedanalysisofgenomewideassociationdatainrheumatoidarthritis
AT padyukovleonid causalgraphbasedanalysisofgenomewideassociationdatainrheumatoidarthritis
AT aliferisconstantinf causalgraphbasedanalysisofgenomewideassociationdatainrheumatoidarthritis
AT statnikovalexander causalgraphbasedanalysisofgenomewideassociationdatainrheumatoidarthritis