Cargando…

Machine learning algorithms assisted identification of post-stroke depression associated biological features

OBJECTIVES: Post-stroke depression (PSD) is a common and serious psychiatric complication which hinders functional recovery and social participation of stroke patients. Stroke is characterized by dynamic changes in metabolism and hemodynamics, however, there is still a lack of metabolism-associated...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Xintong, Wang, Xiangyu, Wang, Shuwei, Zhang, Yingjie, Wang, Zeyu, Yang, Qingyan, Wang, Song, Cao, Risheng, Yu, Binbin, Zheng, Yu, Dang, Yini
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10030717/
https://www.ncbi.nlm.nih.gov/pubmed/36968495
http://dx.doi.org/10.3389/fnins.2023.1146620
_version_ 1784910441273622528
author Zhang, Xintong
Wang, Xiangyu
Wang, Shuwei
Zhang, Yingjie
Wang, Zeyu
Yang, Qingyan
Wang, Song
Cao, Risheng
Yu, Binbin
Zheng, Yu
Dang, Yini
author_facet Zhang, Xintong
Wang, Xiangyu
Wang, Shuwei
Zhang, Yingjie
Wang, Zeyu
Yang, Qingyan
Wang, Song
Cao, Risheng
Yu, Binbin
Zheng, Yu
Dang, Yini
author_sort Zhang, Xintong
collection PubMed
description OBJECTIVES: Post-stroke depression (PSD) is a common and serious psychiatric complication which hinders functional recovery and social participation of stroke patients. Stroke is characterized by dynamic changes in metabolism and hemodynamics, however, there is still a lack of metabolism-associated effective and reliable diagnostic markers and therapeutic targets for PSD. Our study was dedicated to the discovery of metabolism related diagnostic and therapeutic biomarkers for PSD. METHODS: Expression profiles of GSE140275, GSE122709, and GSE180470 were obtained from GEO database. Differentially expressed genes (DEGs) were detected in GSE140275 and GSE122709. Functional enrichment analysis was performed for DEGs in GSE140275. Weighted gene co-expression network analysis (WGCNA) was constructed in GSE122709 to identify key module genes. Moreover, correlation analysis was performed to obtain metabolism related genes. Interaction analysis of key module genes, metabolism related genes, and DEGs in GSE122709 was performed to obtain candidate hub genes. Two machine learning algorithms, least absolute shrinkage and selection operator (LASSO) and random forest, were used to identify signature genes. Expression of signature genes was validated in GSE140275, GSE122709, and GSE180470. Gene set enrichment analysis (GSEA) was applied on signature genes. Based on signature genes, a nomogram model was constructed in our PSD cohort (27 PSD patients vs. 54 controls). ROC curves were performed for the estimation of its diagnostic value. Finally, correlation analysis between expression of signature genes and several clinical traits was performed. RESULTS: Functional enrichment analysis indicated that DEGs in GSE140275 enriched in metabolism pathway. A total of 8,188 metabolism associated genes were identified by correlation analysis. WGCNA analysis was constructed to obtain 3,471 key module genes. A total of 557 candidate hub genes were identified by interaction analysis. Furthermore, two signature genes (SDHD and FERMT3) were selected using LASSO and random forest analysis. GSEA analysis found that two signature genes had major roles in depression. Subsequently, PSD cohort was collected for constructing a PSD diagnosis. Nomogram model showed good reliability and validity. AUC values of receiver operating characteristic (ROC) curve of SDHD and FERMT3 were 0.896 and 0.964. ROC curves showed that two signature genes played a significant role in diagnosis of PSD. Correlation analysis found that SDHD (r = 0.653, P < 0.001) and FERM3 (r = 0.728, P < 0.001) were positively related to the Hamilton Depression Rating Scale 17-item (HAMD) score. CONCLUSION: A total of 557 metabolism associated candidate hub genes were obtained by interaction with DEGs in GSE122709, key modules genes, and metabolism related genes. Based on machine learning algorithms, two signature genes (SDHD and FERMT3) were identified, they were proved to be valuable therapeutic and diagnostic biomarkers for PSD. Early diagnosis and prevention of PSD were made possible by our findings.
format Online
Article
Text
id pubmed-10030717
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-100307172023-03-23 Machine learning algorithms assisted identification of post-stroke depression associated biological features Zhang, Xintong Wang, Xiangyu Wang, Shuwei Zhang, Yingjie Wang, Zeyu Yang, Qingyan Wang, Song Cao, Risheng Yu, Binbin Zheng, Yu Dang, Yini Front Neurosci Neuroscience OBJECTIVES: Post-stroke depression (PSD) is a common and serious psychiatric complication which hinders functional recovery and social participation of stroke patients. Stroke is characterized by dynamic changes in metabolism and hemodynamics, however, there is still a lack of metabolism-associated effective and reliable diagnostic markers and therapeutic targets for PSD. Our study was dedicated to the discovery of metabolism related diagnostic and therapeutic biomarkers for PSD. METHODS: Expression profiles of GSE140275, GSE122709, and GSE180470 were obtained from GEO database. Differentially expressed genes (DEGs) were detected in GSE140275 and GSE122709. Functional enrichment analysis was performed for DEGs in GSE140275. Weighted gene co-expression network analysis (WGCNA) was constructed in GSE122709 to identify key module genes. Moreover, correlation analysis was performed to obtain metabolism related genes. Interaction analysis of key module genes, metabolism related genes, and DEGs in GSE122709 was performed to obtain candidate hub genes. Two machine learning algorithms, least absolute shrinkage and selection operator (LASSO) and random forest, were used to identify signature genes. Expression of signature genes was validated in GSE140275, GSE122709, and GSE180470. Gene set enrichment analysis (GSEA) was applied on signature genes. Based on signature genes, a nomogram model was constructed in our PSD cohort (27 PSD patients vs. 54 controls). ROC curves were performed for the estimation of its diagnostic value. Finally, correlation analysis between expression of signature genes and several clinical traits was performed. RESULTS: Functional enrichment analysis indicated that DEGs in GSE140275 enriched in metabolism pathway. A total of 8,188 metabolism associated genes were identified by correlation analysis. WGCNA analysis was constructed to obtain 3,471 key module genes. A total of 557 candidate hub genes were identified by interaction analysis. Furthermore, two signature genes (SDHD and FERMT3) were selected using LASSO and random forest analysis. GSEA analysis found that two signature genes had major roles in depression. Subsequently, PSD cohort was collected for constructing a PSD diagnosis. Nomogram model showed good reliability and validity. AUC values of receiver operating characteristic (ROC) curve of SDHD and FERMT3 were 0.896 and 0.964. ROC curves showed that two signature genes played a significant role in diagnosis of PSD. Correlation analysis found that SDHD (r = 0.653, P < 0.001) and FERM3 (r = 0.728, P < 0.001) were positively related to the Hamilton Depression Rating Scale 17-item (HAMD) score. CONCLUSION: A total of 557 metabolism associated candidate hub genes were obtained by interaction with DEGs in GSE122709, key modules genes, and metabolism related genes. Based on machine learning algorithms, two signature genes (SDHD and FERMT3) were identified, they were proved to be valuable therapeutic and diagnostic biomarkers for PSD. Early diagnosis and prevention of PSD were made possible by our findings. Frontiers Media S.A. 2023-03-08 /pmc/articles/PMC10030717/ /pubmed/36968495 http://dx.doi.org/10.3389/fnins.2023.1146620 Text en Copyright © 2023 Zhang, Wang, Wang, Zhang, Wang, Yang, Wang, Cao, Yu, Zheng and Dang. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Zhang, Xintong
Wang, Xiangyu
Wang, Shuwei
Zhang, Yingjie
Wang, Zeyu
Yang, Qingyan
Wang, Song
Cao, Risheng
Yu, Binbin
Zheng, Yu
Dang, Yini
Machine learning algorithms assisted identification of post-stroke depression associated biological features
title Machine learning algorithms assisted identification of post-stroke depression associated biological features
title_full Machine learning algorithms assisted identification of post-stroke depression associated biological features
title_fullStr Machine learning algorithms assisted identification of post-stroke depression associated biological features
title_full_unstemmed Machine learning algorithms assisted identification of post-stroke depression associated biological features
title_short Machine learning algorithms assisted identification of post-stroke depression associated biological features
title_sort machine learning algorithms assisted identification of post-stroke depression associated biological features
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10030717/
https://www.ncbi.nlm.nih.gov/pubmed/36968495
http://dx.doi.org/10.3389/fnins.2023.1146620
work_keys_str_mv AT zhangxintong machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT wangxiangyu machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT wangshuwei machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT zhangyingjie machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT wangzeyu machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT yangqingyan machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT wangsong machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT caorisheng machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT yubinbin machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT zhengyu machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures
AT dangyini machinelearningalgorithmsassistedidentificationofpoststrokedepressionassociatedbiologicalfeatures