Cargando…
Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure
The etiologies and pathogenesis of dilated cardiomyopathy (DCM) with heart failure (HF) remain to be defined. Thus, exploring specific diagnosis biomarkers and mechanisms is urgently needed to improve this situation. In this study, three gene expression profiling datasets (GSE29819, GSE21610, GSE178...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9760806/ https://www.ncbi.nlm.nih.gov/pubmed/36544902 http://dx.doi.org/10.3389/fcell.2022.1089915 |
_version_ | 1784852562404442112 |
---|---|
author | Zhu, Yihao Yang, Xiaojing Zu, Yao |
author_facet | Zhu, Yihao Yang, Xiaojing Zu, Yao |
author_sort | Zhu, Yihao |
collection | PubMed |
description | The etiologies and pathogenesis of dilated cardiomyopathy (DCM) with heart failure (HF) remain to be defined. Thus, exploring specific diagnosis biomarkers and mechanisms is urgently needed to improve this situation. In this study, three gene expression profiling datasets (GSE29819, GSE21610, GSE17800) and one single-cell RNA sequencing dataset (GSE95140) were obtained from the Gene Expression Omnibus (GEO) database. GSE29819 and GSE21610 were combined into the training group, while GSE17800 was the test group. We used the weighted gene co-expression network analysis (WGCNA) and identified fifteen driver genes highly associated with DCM with HF in the module. We performed the least absolute shrinkage and selection operator (LASSO) on the driver genes and then constructed five machine learning classifiers (random forest, gradient boosting machine, neural network, eXtreme gradient boosting, and support vector machine). Random forest was the best-performing classifier established on five Lasso-selected genes, which was utilized to select out NPPA, OMD, and PRELP for diagnosing DCM with HF. Moreover, we observed the up-regulation mRNA levels and robust diagnostic accuracies of NPPA, OMD, and PRELP in the training group and test group. Single-cell RNA-seq analysis further demonstrated their stable up-regulation expression patterns in various cardiomyocytes of DCM patients. Besides, through gene set enrichment analysis (GSEA), we found TGF-β signaling pathway, correlated with NPPA, OMD, and PRELP, was the underlying mechanism of DCM with HF. Overall, our study revealed NPPA, OMD, and PRELP serving as diagnostic biomarkers for DCM with HF, deepening the understanding of its pathogenesis. |
format | Online Article Text |
id | pubmed-9760806 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-97608062022-12-20 Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure Zhu, Yihao Yang, Xiaojing Zu, Yao Front Cell Dev Biol Cell and Developmental Biology The etiologies and pathogenesis of dilated cardiomyopathy (DCM) with heart failure (HF) remain to be defined. Thus, exploring specific diagnosis biomarkers and mechanisms is urgently needed to improve this situation. In this study, three gene expression profiling datasets (GSE29819, GSE21610, GSE17800) and one single-cell RNA sequencing dataset (GSE95140) were obtained from the Gene Expression Omnibus (GEO) database. GSE29819 and GSE21610 were combined into the training group, while GSE17800 was the test group. We used the weighted gene co-expression network analysis (WGCNA) and identified fifteen driver genes highly associated with DCM with HF in the module. We performed the least absolute shrinkage and selection operator (LASSO) on the driver genes and then constructed five machine learning classifiers (random forest, gradient boosting machine, neural network, eXtreme gradient boosting, and support vector machine). Random forest was the best-performing classifier established on five Lasso-selected genes, which was utilized to select out NPPA, OMD, and PRELP for diagnosing DCM with HF. Moreover, we observed the up-regulation mRNA levels and robust diagnostic accuracies of NPPA, OMD, and PRELP in the training group and test group. Single-cell RNA-seq analysis further demonstrated their stable up-regulation expression patterns in various cardiomyocytes of DCM patients. Besides, through gene set enrichment analysis (GSEA), we found TGF-β signaling pathway, correlated with NPPA, OMD, and PRELP, was the underlying mechanism of DCM with HF. Overall, our study revealed NPPA, OMD, and PRELP serving as diagnostic biomarkers for DCM with HF, deepening the understanding of its pathogenesis. Frontiers Media S.A. 2022-12-05 /pmc/articles/PMC9760806/ /pubmed/36544902 http://dx.doi.org/10.3389/fcell.2022.1089915 Text en Copyright © 2022 Zhu, Yang and Zu. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Cell and Developmental Biology Zhu, Yihao Yang, Xiaojing Zu, Yao Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure |
title | Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure |
title_full | Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure |
title_fullStr | Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure |
title_full_unstemmed | Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure |
title_short | Integrated analysis of WGCNA and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure |
title_sort | integrated analysis of wgcna and machine learning identified diagnostic biomarkers in dilated cardiomyopathy with heart failure |
topic | Cell and Developmental Biology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9760806/ https://www.ncbi.nlm.nih.gov/pubmed/36544902 http://dx.doi.org/10.3389/fcell.2022.1089915 |
work_keys_str_mv | AT zhuyihao integratedanalysisofwgcnaandmachinelearningidentifieddiagnosticbiomarkersindilatedcardiomyopathywithheartfailure AT yangxiaojing integratedanalysisofwgcnaandmachinelearningidentifieddiagnosticbiomarkersindilatedcardiomyopathywithheartfailure AT zuyao integratedanalysisofwgcnaandmachinelearningidentifieddiagnosticbiomarkersindilatedcardiomyopathywithheartfailure |