Cargando…

Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression

While Alzheimer's disease (AD) can cause a severe economic burden, the specific pathogenesis involved is yet to be elucidated. To identify feature genes associated with AD, we downloaded data from three GEO databases: GSE122063, GSE15222, and GSE138260. In the filtering, we used AD for search k...

Descripción completa

Detalles Bibliográficos
Autores principales: Sun, Hongyu, Yang, Jin, Li, Xiaohui, Lyu, Yi, Xu, Zhaomeng, He, Hui, Tong, Xiaomin, Ji, Tingyu, Ding, Shihan, Zhou, Chaoli, Han, Pengyong, Zheng, Jinping
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536257/
https://www.ncbi.nlm.nih.gov/pubmed/36213445
http://dx.doi.org/10.3389/fncom.2022.1001546
_version_ 1784802945277100032
author Sun, Hongyu
Yang, Jin
Li, Xiaohui
Lyu, Yi
Xu, Zhaomeng
He, Hui
Tong, Xiaomin
Ji, Tingyu
Ding, Shihan
Zhou, Chaoli
Han, Pengyong
Zheng, Jinping
author_facet Sun, Hongyu
Yang, Jin
Li, Xiaohui
Lyu, Yi
Xu, Zhaomeng
He, Hui
Tong, Xiaomin
Ji, Tingyu
Ding, Shihan
Zhou, Chaoli
Han, Pengyong
Zheng, Jinping
author_sort Sun, Hongyu
collection PubMed
description While Alzheimer's disease (AD) can cause a severe economic burden, the specific pathogenesis involved is yet to be elucidated. To identify feature genes associated with AD, we downloaded data from three GEO databases: GSE122063, GSE15222, and GSE138260. In the filtering, we used AD for search keywords, Homo sapiens for species selection, and established a sample size of > 20 for each data set, and each data set contains Including the normal group and AD group. The datasets GSE15222 and GSE138260 were combined as a training group to build a model, and GSE122063 was used as a test group to verify the model's accuracy. The genes with differential expression found in the combined datasets were used for analysis through Gene Ontology (GO) and The Kyoto Encyclopedia of Genes and Genome Pathways (KEGG). Then, AD-related module genes were identified using the combined dataset through a weighted gene co-expression network analysis (WGCNA). Both the differential and AD-related module genes were intersected to obtain AD key genes. These genes were first filtered through LASSO regression and then AD-related feature genes were obtained for subsequent immune-related analysis. A comprehensive analysis of three AD-related datasets in the GEO database revealed 111 common differential AD genes. In the GO analysis, the more prominent terms were cognition and learning or memory. The KEGG analysis showed that these differential genes were enriched not only in In the KEGG analysis, but also in three other pathways: neuroactive ligand-receptor interaction, cAMP signaling pathway, and Calcium signaling pathway. Three AD-related feature genes (SST, MLIP, HSPB3) were finally identified. The area under the ROC curve of these AD-related feature genes was greater than 0.7 in both the training and the test groups. Finally, an immune-related analysis of these genes was performed. The finding of AD-related feature genes (SST, MLIP, HSPB3) could help predict the onset and progression of the disease. Overall, our study may provide significant guidance for further exploration of potential biomarkers for the diagnosis and prediction of AD.
format Online
Article
Text
id pubmed-9536257
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-95362572022-10-07 Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression Sun, Hongyu Yang, Jin Li, Xiaohui Lyu, Yi Xu, Zhaomeng He, Hui Tong, Xiaomin Ji, Tingyu Ding, Shihan Zhou, Chaoli Han, Pengyong Zheng, Jinping Front Comput Neurosci Neuroscience While Alzheimer's disease (AD) can cause a severe economic burden, the specific pathogenesis involved is yet to be elucidated. To identify feature genes associated with AD, we downloaded data from three GEO databases: GSE122063, GSE15222, and GSE138260. In the filtering, we used AD for search keywords, Homo sapiens for species selection, and established a sample size of > 20 for each data set, and each data set contains Including the normal group and AD group. The datasets GSE15222 and GSE138260 were combined as a training group to build a model, and GSE122063 was used as a test group to verify the model's accuracy. The genes with differential expression found in the combined datasets were used for analysis through Gene Ontology (GO) and The Kyoto Encyclopedia of Genes and Genome Pathways (KEGG). Then, AD-related module genes were identified using the combined dataset through a weighted gene co-expression network analysis (WGCNA). Both the differential and AD-related module genes were intersected to obtain AD key genes. These genes were first filtered through LASSO regression and then AD-related feature genes were obtained for subsequent immune-related analysis. A comprehensive analysis of three AD-related datasets in the GEO database revealed 111 common differential AD genes. In the GO analysis, the more prominent terms were cognition and learning or memory. The KEGG analysis showed that these differential genes were enriched not only in In the KEGG analysis, but also in three other pathways: neuroactive ligand-receptor interaction, cAMP signaling pathway, and Calcium signaling pathway. Three AD-related feature genes (SST, MLIP, HSPB3) were finally identified. The area under the ROC curve of these AD-related feature genes was greater than 0.7 in both the training and the test groups. Finally, an immune-related analysis of these genes was performed. The finding of AD-related feature genes (SST, MLIP, HSPB3) could help predict the onset and progression of the disease. Overall, our study may provide significant guidance for further exploration of potential biomarkers for the diagnosis and prediction of AD. Frontiers Media S.A. 2022-09-21 /pmc/articles/PMC9536257/ /pubmed/36213445 http://dx.doi.org/10.3389/fncom.2022.1001546 Text en Copyright © 2022 Sun, Yang, Li, Lyu, Xu, He, Tong, Ji, Ding, Zhou, Han and Zheng. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Neuroscience
Sun, Hongyu
Yang, Jin
Li, Xiaohui
Lyu, Yi
Xu, Zhaomeng
He, Hui
Tong, Xiaomin
Ji, Tingyu
Ding, Shihan
Zhou, Chaoli
Han, Pengyong
Zheng, Jinping
Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression
title Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression
title_full Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression
title_fullStr Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression
title_full_unstemmed Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression
title_short Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression
title_sort identification of feature genes and pathways for alzheimer's disease via wgcna and lasso regression
topic Neuroscience
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536257/
https://www.ncbi.nlm.nih.gov/pubmed/36213445
http://dx.doi.org/10.3389/fncom.2022.1001546
work_keys_str_mv AT sunhongyu identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT yangjin identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT lixiaohui identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT lyuyi identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT xuzhaomeng identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT hehui identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT tongxiaomin identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT jitingyu identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT dingshihan identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT zhouchaoli identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT hanpengyong identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression
AT zhengjinping identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression