Cargando…

Classifiers for Predicting Coronary Artery Disease Based on Gene Expression Profiles in Peripheral Blood Mononuclear Cells

OBJECTIVE: Coronary artery disease (CAD) is a serious global health concern. Current diagnostic methods for CAD involve risk to the patient and are costly, so better diagnostic tools are needed. We defined four classifiers based on gene expression profiles in peripheral blood mononuclear cells and d...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Jie, Wang, Xiaodong, Lin, Junhua, Li, Shaohua, Deng, Guoxiong, Wei, Jinru
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Dove 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8450378/
https://www.ncbi.nlm.nih.gov/pubmed/34552349
http://dx.doi.org/10.2147/IJGM.S329005
_version_ 1784569631078350848
author Liu, Jie
Wang, Xiaodong
Lin, Junhua
Li, Shaohua
Deng, Guoxiong
Wei, Jinru
author_facet Liu, Jie
Wang, Xiaodong
Lin, Junhua
Li, Shaohua
Deng, Guoxiong
Wei, Jinru
author_sort Liu, Jie
collection PubMed
description OBJECTIVE: Coronary artery disease (CAD) is a serious global health concern. Current diagnostic methods for CAD involve risk to the patient and are costly, so better diagnostic tools are needed. We defined four classifiers based on gene expression profiles in peripheral blood mononuclear cells and determined their potential for CAD detection. METHODS: We downloaded a CAD-related data set (GSE113079) from the Gene Expression Omnibus (GEO) database. We identified differentially expressed genes (DEGs) in peripheral blood mononuclear cells between CAD samples and healthy controls. DEGs were analyzed for functional enrichment. To create a robust CAD classifier, DEGs were identified by feature selection using the principal component analysis. Then, least absolute shrinkage and selection operator (LASSO) logistic regression, random forest, and support vector machine (SVM) models were created. Gene set variation analysis (GSVA) score and gene set enrichment analysis (GSEA) were also conducted. The performance of the models was evaluated in terms of the area under receiver operating characteristic curves (AUC). RESULTS: In the training set, we found 135 up-regulated genes and 104 down-regulated genes in CAD patients compared with controls. The DEGs were involved in some pathways associated with CAD, such as pathways involving calcium and interleukin-17 signaling. Twenty genes were identified as optimal features and used to generate the logistic classifier based on LASSO. The AUC for the classifier was 1.00 in the training set and 0.997 in the test set. Using the 20 DEGs, SVM and random forest classifiers were also generated and showed high diagnostic efficacy, with respective AUCs of 0.997 and 1.00 against the training set. A GSVA score was also established using the top 20 significant DEGs, which showed an AUC of 0.971 in the training set and 0.989 in the test set. Furthermore, GSEA showed autophagy and the proteasome to be major pathways involving the DEGs. CONCLUSION: We identified a set of genes specific for CAD whose expression can be measured non-invasively. Using these genes, we defined four diagnostic classifiers using multiple methods.
format Online
Article
Text
id pubmed-8450378
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Dove
record_format MEDLINE/PubMed
spelling pubmed-84503782021-09-21 Classifiers for Predicting Coronary Artery Disease Based on Gene Expression Profiles in Peripheral Blood Mononuclear Cells Liu, Jie Wang, Xiaodong Lin, Junhua Li, Shaohua Deng, Guoxiong Wei, Jinru Int J Gen Med Original Research OBJECTIVE: Coronary artery disease (CAD) is a serious global health concern. Current diagnostic methods for CAD involve risk to the patient and are costly, so better diagnostic tools are needed. We defined four classifiers based on gene expression profiles in peripheral blood mononuclear cells and determined their potential for CAD detection. METHODS: We downloaded a CAD-related data set (GSE113079) from the Gene Expression Omnibus (GEO) database. We identified differentially expressed genes (DEGs) in peripheral blood mononuclear cells between CAD samples and healthy controls. DEGs were analyzed for functional enrichment. To create a robust CAD classifier, DEGs were identified by feature selection using the principal component analysis. Then, least absolute shrinkage and selection operator (LASSO) logistic regression, random forest, and support vector machine (SVM) models were created. Gene set variation analysis (GSVA) score and gene set enrichment analysis (GSEA) were also conducted. The performance of the models was evaluated in terms of the area under receiver operating characteristic curves (AUC). RESULTS: In the training set, we found 135 up-regulated genes and 104 down-regulated genes in CAD patients compared with controls. The DEGs were involved in some pathways associated with CAD, such as pathways involving calcium and interleukin-17 signaling. Twenty genes were identified as optimal features and used to generate the logistic classifier based on LASSO. The AUC for the classifier was 1.00 in the training set and 0.997 in the test set. Using the 20 DEGs, SVM and random forest classifiers were also generated and showed high diagnostic efficacy, with respective AUCs of 0.997 and 1.00 against the training set. A GSVA score was also established using the top 20 significant DEGs, which showed an AUC of 0.971 in the training set and 0.989 in the test set. Furthermore, GSEA showed autophagy and the proteasome to be major pathways involving the DEGs. CONCLUSION: We identified a set of genes specific for CAD whose expression can be measured non-invasively. Using these genes, we defined four diagnostic classifiers using multiple methods. Dove 2021-09-15 /pmc/articles/PMC8450378/ /pubmed/34552349 http://dx.doi.org/10.2147/IJGM.S329005 Text en © 2021 Liu et al. https://creativecommons.org/licenses/by-nc/3.0/This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution – Non Commercial (unported, v3.0) License (http://creativecommons.org/licenses/by-nc/3.0/ (https://creativecommons.org/licenses/by-nc/3.0/) ). By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms (https://www.dovepress.com/terms.php).
spellingShingle Original Research
Liu, Jie
Wang, Xiaodong
Lin, Junhua
Li, Shaohua
Deng, Guoxiong
Wei, Jinru
Classifiers for Predicting Coronary Artery Disease Based on Gene Expression Profiles in Peripheral Blood Mononuclear Cells
title Classifiers for Predicting Coronary Artery Disease Based on Gene Expression Profiles in Peripheral Blood Mononuclear Cells
title_full Classifiers for Predicting Coronary Artery Disease Based on Gene Expression Profiles in Peripheral Blood Mononuclear Cells
title_fullStr Classifiers for Predicting Coronary Artery Disease Based on Gene Expression Profiles in Peripheral Blood Mononuclear Cells
title_full_unstemmed Classifiers for Predicting Coronary Artery Disease Based on Gene Expression Profiles in Peripheral Blood Mononuclear Cells
title_short Classifiers for Predicting Coronary Artery Disease Based on Gene Expression Profiles in Peripheral Blood Mononuclear Cells
title_sort classifiers for predicting coronary artery disease based on gene expression profiles in peripheral blood mononuclear cells
topic Original Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8450378/
https://www.ncbi.nlm.nih.gov/pubmed/34552349
http://dx.doi.org/10.2147/IJGM.S329005
work_keys_str_mv AT liujie classifiersforpredictingcoronaryarterydiseasebasedongeneexpressionprofilesinperipheralbloodmononuclearcells
AT wangxiaodong classifiersforpredictingcoronaryarterydiseasebasedongeneexpressionprofilesinperipheralbloodmononuclearcells
AT linjunhua classifiersforpredictingcoronaryarterydiseasebasedongeneexpressionprofilesinperipheralbloodmononuclearcells
AT lishaohua classifiersforpredictingcoronaryarterydiseasebasedongeneexpressionprofilesinperipheralbloodmononuclearcells
AT dengguoxiong classifiersforpredictingcoronaryarterydiseasebasedongeneexpressionprofilesinperipheralbloodmononuclearcells
AT weijinru classifiersforpredictingcoronaryarterydiseasebasedongeneexpressionprofilesinperipheralbloodmononuclearcells