Cargando…

Accurate and Robust Genomic Prediction of Celiac Disease Using Statistical Learning

Practical application of genomic-based risk stratification to clinical diagnosis is appealing yet performance varies widely depending on the disease and genomic risk score (GRS) method. Celiac disease (CD), a common immune-mediated illness, is strongly genetically determined and requires specific HL...

Descripción completa

Detalles Bibliográficos
Autores principales: Abraham, Gad, Tye-Din, Jason A., Bhalala, Oneil G., Kowalczyk, Adam, Zobel, Justin, Inouye, Michael
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2014
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3923679/
https://www.ncbi.nlm.nih.gov/pubmed/24550740
http://dx.doi.org/10.1371/journal.pgen.1004137
_version_ 1782303639827644416
author Abraham, Gad
Tye-Din, Jason A.
Bhalala, Oneil G.
Kowalczyk, Adam
Zobel, Justin
Inouye, Michael
author_facet Abraham, Gad
Tye-Din, Jason A.
Bhalala, Oneil G.
Kowalczyk, Adam
Zobel, Justin
Inouye, Michael
author_sort Abraham, Gad
collection PubMed
description Practical application of genomic-based risk stratification to clinical diagnosis is appealing yet performance varies widely depending on the disease and genomic risk score (GRS) method. Celiac disease (CD), a common immune-mediated illness, is strongly genetically determined and requires specific HLA haplotypes. HLA testing can exclude diagnosis but has low specificity, providing little information suitable for clinical risk stratification. Using six European cohorts, we provide a proof-of-concept that statistical learning approaches which simultaneously model all SNPs can generate robust and highly accurate predictive models of CD based on genome-wide SNP profiles. The high predictive capacity replicated both in cross-validation within each cohort (AUC of 0.87–0.89) and in independent replication across cohorts (AUC of 0.86–0.9), despite differences in ethnicity. The models explained 30–35% of disease variance and up to ∼43% of heritability. The GRS's utility was assessed in different clinically relevant settings. Comparable to HLA typing, the GRS can be used to identify individuals without CD with ≥99.6% negative predictive value however, unlike HLA typing, fine-scale stratification of individuals into categories of higher-risk for CD can identify those that would benefit from more invasive and costly definitive testing. The GRS is flexible and its performance can be adapted to the clinical situation by adjusting the threshold cut-off. Despite explaining a minority of disease heritability, our findings indicate a genomic risk score provides clinically relevant information to improve upon current diagnostic pathways for CD and support further studies evaluating the clinical utility of this approach in CD and other complex diseases.
format Online
Article
Text
id pubmed-3923679
institution National Center for Biotechnology Information
language English
publishDate 2014
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-39236792014-02-18 Accurate and Robust Genomic Prediction of Celiac Disease Using Statistical Learning Abraham, Gad Tye-Din, Jason A. Bhalala, Oneil G. Kowalczyk, Adam Zobel, Justin Inouye, Michael PLoS Genet Research Article Practical application of genomic-based risk stratification to clinical diagnosis is appealing yet performance varies widely depending on the disease and genomic risk score (GRS) method. Celiac disease (CD), a common immune-mediated illness, is strongly genetically determined and requires specific HLA haplotypes. HLA testing can exclude diagnosis but has low specificity, providing little information suitable for clinical risk stratification. Using six European cohorts, we provide a proof-of-concept that statistical learning approaches which simultaneously model all SNPs can generate robust and highly accurate predictive models of CD based on genome-wide SNP profiles. The high predictive capacity replicated both in cross-validation within each cohort (AUC of 0.87–0.89) and in independent replication across cohorts (AUC of 0.86–0.9), despite differences in ethnicity. The models explained 30–35% of disease variance and up to ∼43% of heritability. The GRS's utility was assessed in different clinically relevant settings. Comparable to HLA typing, the GRS can be used to identify individuals without CD with ≥99.6% negative predictive value however, unlike HLA typing, fine-scale stratification of individuals into categories of higher-risk for CD can identify those that would benefit from more invasive and costly definitive testing. The GRS is flexible and its performance can be adapted to the clinical situation by adjusting the threshold cut-off. Despite explaining a minority of disease heritability, our findings indicate a genomic risk score provides clinically relevant information to improve upon current diagnostic pathways for CD and support further studies evaluating the clinical utility of this approach in CD and other complex diseases. Public Library of Science 2014-02-13 /pmc/articles/PMC3923679/ /pubmed/24550740 http://dx.doi.org/10.1371/journal.pgen.1004137 Text en © 2014 Abraham et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle Research Article
Abraham, Gad
Tye-Din, Jason A.
Bhalala, Oneil G.
Kowalczyk, Adam
Zobel, Justin
Inouye, Michael
Accurate and Robust Genomic Prediction of Celiac Disease Using Statistical Learning
title Accurate and Robust Genomic Prediction of Celiac Disease Using Statistical Learning
title_full Accurate and Robust Genomic Prediction of Celiac Disease Using Statistical Learning
title_fullStr Accurate and Robust Genomic Prediction of Celiac Disease Using Statistical Learning
title_full_unstemmed Accurate and Robust Genomic Prediction of Celiac Disease Using Statistical Learning
title_short Accurate and Robust Genomic Prediction of Celiac Disease Using Statistical Learning
title_sort accurate and robust genomic prediction of celiac disease using statistical learning
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3923679/
https://www.ncbi.nlm.nih.gov/pubmed/24550740
http://dx.doi.org/10.1371/journal.pgen.1004137
work_keys_str_mv AT abrahamgad accurateandrobustgenomicpredictionofceliacdiseaseusingstatisticallearning
AT tyedinjasona accurateandrobustgenomicpredictionofceliacdiseaseusingstatisticallearning
AT bhalalaoneilg accurateandrobustgenomicpredictionofceliacdiseaseusingstatisticallearning
AT kowalczykadam accurateandrobustgenomicpredictionofceliacdiseaseusingstatisticallearning
AT zobeljustin accurateandrobustgenomicpredictionofceliacdiseaseusingstatisticallearning
AT inouyemichael accurateandrobustgenomicpredictionofceliacdiseaseusingstatisticallearning