Cargando…
Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression
While Alzheimer's disease (AD) can cause a severe economic burden, the specific pathogenesis involved is yet to be elucidated. To identify feature genes associated with AD, we downloaded data from three GEO databases: GSE122063, GSE15222, and GSE138260. In the filtering, we used AD for search k...
Autores principales: | , , , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536257/ https://www.ncbi.nlm.nih.gov/pubmed/36213445 http://dx.doi.org/10.3389/fncom.2022.1001546 |
_version_ | 1784802945277100032 |
---|---|
author | Sun, Hongyu Yang, Jin Li, Xiaohui Lyu, Yi Xu, Zhaomeng He, Hui Tong, Xiaomin Ji, Tingyu Ding, Shihan Zhou, Chaoli Han, Pengyong Zheng, Jinping |
author_facet | Sun, Hongyu Yang, Jin Li, Xiaohui Lyu, Yi Xu, Zhaomeng He, Hui Tong, Xiaomin Ji, Tingyu Ding, Shihan Zhou, Chaoli Han, Pengyong Zheng, Jinping |
author_sort | Sun, Hongyu |
collection | PubMed |
description | While Alzheimer's disease (AD) can cause a severe economic burden, the specific pathogenesis involved is yet to be elucidated. To identify feature genes associated with AD, we downloaded data from three GEO databases: GSE122063, GSE15222, and GSE138260. In the filtering, we used AD for search keywords, Homo sapiens for species selection, and established a sample size of > 20 for each data set, and each data set contains Including the normal group and AD group. The datasets GSE15222 and GSE138260 were combined as a training group to build a model, and GSE122063 was used as a test group to verify the model's accuracy. The genes with differential expression found in the combined datasets were used for analysis through Gene Ontology (GO) and The Kyoto Encyclopedia of Genes and Genome Pathways (KEGG). Then, AD-related module genes were identified using the combined dataset through a weighted gene co-expression network analysis (WGCNA). Both the differential and AD-related module genes were intersected to obtain AD key genes. These genes were first filtered through LASSO regression and then AD-related feature genes were obtained for subsequent immune-related analysis. A comprehensive analysis of three AD-related datasets in the GEO database revealed 111 common differential AD genes. In the GO analysis, the more prominent terms were cognition and learning or memory. The KEGG analysis showed that these differential genes were enriched not only in In the KEGG analysis, but also in three other pathways: neuroactive ligand-receptor interaction, cAMP signaling pathway, and Calcium signaling pathway. Three AD-related feature genes (SST, MLIP, HSPB3) were finally identified. The area under the ROC curve of these AD-related feature genes was greater than 0.7 in both the training and the test groups. Finally, an immune-related analysis of these genes was performed. The finding of AD-related feature genes (SST, MLIP, HSPB3) could help predict the onset and progression of the disease. Overall, our study may provide significant guidance for further exploration of potential biomarkers for the diagnosis and prediction of AD. |
format | Online Article Text |
id | pubmed-9536257 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-95362572022-10-07 Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression Sun, Hongyu Yang, Jin Li, Xiaohui Lyu, Yi Xu, Zhaomeng He, Hui Tong, Xiaomin Ji, Tingyu Ding, Shihan Zhou, Chaoli Han, Pengyong Zheng, Jinping Front Comput Neurosci Neuroscience While Alzheimer's disease (AD) can cause a severe economic burden, the specific pathogenesis involved is yet to be elucidated. To identify feature genes associated with AD, we downloaded data from three GEO databases: GSE122063, GSE15222, and GSE138260. In the filtering, we used AD for search keywords, Homo sapiens for species selection, and established a sample size of > 20 for each data set, and each data set contains Including the normal group and AD group. The datasets GSE15222 and GSE138260 were combined as a training group to build a model, and GSE122063 was used as a test group to verify the model's accuracy. The genes with differential expression found in the combined datasets were used for analysis through Gene Ontology (GO) and The Kyoto Encyclopedia of Genes and Genome Pathways (KEGG). Then, AD-related module genes were identified using the combined dataset through a weighted gene co-expression network analysis (WGCNA). Both the differential and AD-related module genes were intersected to obtain AD key genes. These genes were first filtered through LASSO regression and then AD-related feature genes were obtained for subsequent immune-related analysis. A comprehensive analysis of three AD-related datasets in the GEO database revealed 111 common differential AD genes. In the GO analysis, the more prominent terms were cognition and learning or memory. The KEGG analysis showed that these differential genes were enriched not only in In the KEGG analysis, but also in three other pathways: neuroactive ligand-receptor interaction, cAMP signaling pathway, and Calcium signaling pathway. Three AD-related feature genes (SST, MLIP, HSPB3) were finally identified. The area under the ROC curve of these AD-related feature genes was greater than 0.7 in both the training and the test groups. Finally, an immune-related analysis of these genes was performed. The finding of AD-related feature genes (SST, MLIP, HSPB3) could help predict the onset and progression of the disease. Overall, our study may provide significant guidance for further exploration of potential biomarkers for the diagnosis and prediction of AD. Frontiers Media S.A. 2022-09-21 /pmc/articles/PMC9536257/ /pubmed/36213445 http://dx.doi.org/10.3389/fncom.2022.1001546 Text en Copyright © 2022 Sun, Yang, Li, Lyu, Xu, He, Tong, Ji, Ding, Zhou, Han and Zheng. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Neuroscience Sun, Hongyu Yang, Jin Li, Xiaohui Lyu, Yi Xu, Zhaomeng He, Hui Tong, Xiaomin Ji, Tingyu Ding, Shihan Zhou, Chaoli Han, Pengyong Zheng, Jinping Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression |
title | Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression |
title_full | Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression |
title_fullStr | Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression |
title_full_unstemmed | Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression |
title_short | Identification of feature genes and pathways for Alzheimer's disease via WGCNA and LASSO regression |
title_sort | identification of feature genes and pathways for alzheimer's disease via wgcna and lasso regression |
topic | Neuroscience |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9536257/ https://www.ncbi.nlm.nih.gov/pubmed/36213445 http://dx.doi.org/10.3389/fncom.2022.1001546 |
work_keys_str_mv | AT sunhongyu identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT yangjin identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT lixiaohui identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT lyuyi identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT xuzhaomeng identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT hehui identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT tongxiaomin identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT jitingyu identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT dingshihan identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT zhouchaoli identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT hanpengyong identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression AT zhengjinping identificationoffeaturegenesandpathwaysforalzheimersdiseaseviawgcnaandlassoregression |