Cargando…

netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data

The polygenic risk score (PRS) can help to identify individuals’ genetic susceptibility for various diseases by combining patient genetic profiles and identified single-nucleotide polymorphisms (SNPs) from genome-wide association studies. Although multiple diseases will usually afflict patients at o...

Descripción completa

Detalles Bibliográficos
Autores principales: Nam, Yonghyun, Jung, Sang-Hyuk, Verma, Anurag, Sriram, Vivek, Won, Hong-Hee, Yun, Jae-Seung, Kim, Dokyoon
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8682919/
https://www.ncbi.nlm.nih.gov/pubmed/34890160
_version_ 1784617313244282880
author Nam, Yonghyun
Jung, Sang-Hyuk
Verma, Anurag
Sriram, Vivek
Won, Hong-Hee
Yun, Jae-Seung
Kim, Dokyoon
author_facet Nam, Yonghyun
Jung, Sang-Hyuk
Verma, Anurag
Sriram, Vivek
Won, Hong-Hee
Yun, Jae-Seung
Kim, Dokyoon
author_sort Nam, Yonghyun
collection PubMed
description The polygenic risk score (PRS) can help to identify individuals’ genetic susceptibility for various diseases by combining patient genetic profiles and identified single-nucleotide polymorphisms (SNPs) from genome-wide association studies. Although multiple diseases will usually afflict patients at once or in succession, conventional PRSs fail to consider genetic relationships across multiple diseases. Even multi-trait PRSs, which take into account genetic effects for more than one disease at a time, fail to consider a sufficient number of phenotypes to accurately reflect the state of disease comorbidity in a patient, or are biased in terms of the traits that are selected. Thus, we developed novel network-based comorbidity risk scores to quantify associations among multiple phenotypes from phenome-wide association studies (PheWAS). We first constructed a disease-SNP heterogeneous multi-layered network (DS-Net), which consists of a disease network (disease-layer) and SNP network (SNP-layer). The disease-layer describes the population-level interactome from PheWAS data. The SNP-layer was constructed according to linkage disequilibrium. Both layers were attached to transform the information from a population-level interactome to individual-level inferences. Then, graph-based semi-supervised learning was applied to predict possible comorbidity scores on disease-layer for each subject. The SNP-layer serves as receiving individual genotyping data in the scoring process, and the disease-layer serves as the propagated output for an individual’s multiple disease comorbidity scores. The possible comorbidity scores were combined by logistic regression, and it is denoted as netCRS. The DS-Net was constructed from UK Biobank PheWAS data, and the individual genetic profiles were collected from the Penn Medicine Biobank. As a proof-of-concept study, myocardial infarction (MI) was selected to compare netCRS with the PRS with pruning and thresholding (PRS-PT). The combined model (netCRS + PRS-PT + covariates) achieved an AUC improvement of 6.26% compared to the (PRS-PT + covariates) model. In terms of risk stratification, the combined model was able to capture the risk of MI up to approximately eight-fold higher than that of the low-risk group. The netCRS and PRS-PT complement each other in predicting high-risk groups of patients with MI. We expect that using these risk prediction models will allow for the development of prevention strategies and reduction of MI morbidity and mortality.
format Online
Article
Text
id pubmed-8682919
institution National Center for Biotechnology Information
language English
publishDate 2022
record_format MEDLINE/PubMed
spelling pubmed-86829192022-01-01 netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data Nam, Yonghyun Jung, Sang-Hyuk Verma, Anurag Sriram, Vivek Won, Hong-Hee Yun, Jae-Seung Kim, Dokyoon Pac Symp Biocomput Article The polygenic risk score (PRS) can help to identify individuals’ genetic susceptibility for various diseases by combining patient genetic profiles and identified single-nucleotide polymorphisms (SNPs) from genome-wide association studies. Although multiple diseases will usually afflict patients at once or in succession, conventional PRSs fail to consider genetic relationships across multiple diseases. Even multi-trait PRSs, which take into account genetic effects for more than one disease at a time, fail to consider a sufficient number of phenotypes to accurately reflect the state of disease comorbidity in a patient, or are biased in terms of the traits that are selected. Thus, we developed novel network-based comorbidity risk scores to quantify associations among multiple phenotypes from phenome-wide association studies (PheWAS). We first constructed a disease-SNP heterogeneous multi-layered network (DS-Net), which consists of a disease network (disease-layer) and SNP network (SNP-layer). The disease-layer describes the population-level interactome from PheWAS data. The SNP-layer was constructed according to linkage disequilibrium. Both layers were attached to transform the information from a population-level interactome to individual-level inferences. Then, graph-based semi-supervised learning was applied to predict possible comorbidity scores on disease-layer for each subject. The SNP-layer serves as receiving individual genotyping data in the scoring process, and the disease-layer serves as the propagated output for an individual’s multiple disease comorbidity scores. The possible comorbidity scores were combined by logistic regression, and it is denoted as netCRS. The DS-Net was constructed from UK Biobank PheWAS data, and the individual genetic profiles were collected from the Penn Medicine Biobank. As a proof-of-concept study, myocardial infarction (MI) was selected to compare netCRS with the PRS with pruning and thresholding (PRS-PT). The combined model (netCRS + PRS-PT + covariates) achieved an AUC improvement of 6.26% compared to the (PRS-PT + covariates) model. In terms of risk stratification, the combined model was able to capture the risk of MI up to approximately eight-fold higher than that of the low-risk group. The netCRS and PRS-PT complement each other in predicting high-risk groups of patients with MI. We expect that using these risk prediction models will allow for the development of prevention strategies and reduction of MI morbidity and mortality. 2022 /pmc/articles/PMC8682919/ /pubmed/34890160 Text en https://creativecommons.org/licenses/by-nc/4.0/Open Access chapter published by World Scientific Publishing Company and distributed under the terms of the Creative Commons Attribution Non-Commercial (CC BY-NC) 4.0 License.
spellingShingle Article
Nam, Yonghyun
Jung, Sang-Hyuk
Verma, Anurag
Sriram, Vivek
Won, Hong-Hee
Yun, Jae-Seung
Kim, Dokyoon
netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data
title netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data
title_full netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data
title_fullStr netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data
title_full_unstemmed netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data
title_short netCRS: Network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled PheWAS data
title_sort netcrs: network-based comorbidity risk score for prediction of myocardial infarction using biobank-scaled phewas data
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8682919/
https://www.ncbi.nlm.nih.gov/pubmed/34890160
work_keys_str_mv AT namyonghyun netcrsnetworkbasedcomorbidityriskscoreforpredictionofmyocardialinfarctionusingbiobankscaledphewasdata
AT jungsanghyuk netcrsnetworkbasedcomorbidityriskscoreforpredictionofmyocardialinfarctionusingbiobankscaledphewasdata
AT vermaanurag netcrsnetworkbasedcomorbidityriskscoreforpredictionofmyocardialinfarctionusingbiobankscaledphewasdata
AT sriramvivek netcrsnetworkbasedcomorbidityriskscoreforpredictionofmyocardialinfarctionusingbiobankscaledphewasdata
AT wonhonghee netcrsnetworkbasedcomorbidityriskscoreforpredictionofmyocardialinfarctionusingbiobankscaledphewasdata
AT yunjaeseung netcrsnetworkbasedcomorbidityriskscoreforpredictionofmyocardialinfarctionusingbiobankscaledphewasdata
AT netcrsnetworkbasedcomorbidityriskscoreforpredictionofmyocardialinfarctionusingbiobankscaledphewasdata
AT kimdokyoon netcrsnetworkbasedcomorbidityriskscoreforpredictionofmyocardialinfarctionusingbiobankscaledphewasdata