Cargando…

Identification of 9 Gene Signatures by WGCNA to Predict Prognosis for Colon Adenocarcinoma

BACKGROUND: A risk assessment model for prognostic prediction of colon adenocarcinoma (COAD) was established based on weighted gene co-expression network analysis (WGCNA). METHODS: From the Cancer Genome Atlas (TCGA) database, RNA-seq data and clinical data of COAD patients were retrieved. After scr...

Descripción completa

Detalles Bibliográficos
Autores principales: Yang, Mian, He, Haibin, Peng, Tao, Lu, Yi, Yu, Jiazi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8983226/
https://www.ncbi.nlm.nih.gov/pubmed/35392038
http://dx.doi.org/10.1155/2022/8598046
_version_ 1784681939084509184
author Yang, Mian
He, Haibin
Peng, Tao
Lu, Yi
Yu, Jiazi
author_facet Yang, Mian
He, Haibin
Peng, Tao
Lu, Yi
Yu, Jiazi
author_sort Yang, Mian
collection PubMed
description BACKGROUND: A risk assessment model for prognostic prediction of colon adenocarcinoma (COAD) was established based on weighted gene co-expression network analysis (WGCNA). METHODS: From the Cancer Genome Atlas (TCGA) database, RNA-seq data and clinical data of COAD patients were retrieved. After screening of differentially expressed genes (DEGs), WGCNA was performed to identify gene modules and screen those associated with COAD progression. Then, via protein-protein interaction (PPI) network construction of module genes, hub genes were obtained, which were then subjected to the least absolute shrinkage and selection operator (LASSO) and Cox regression to build a hub gene-based prognostic scoring model. The receiver operating characteristic curve (ROC curve) was plotted for the optimal cutoff (OCO) of the risk score, based on which, patients were assigned to high or low-risk groups. Areas under the ROC curve (AUCs) were calculated, and model performance was visualized using Kaplan–Meier (KM) survival curves and verified in the external dataset GSE29621. Finally, the model's independent prognostic value was evaluated by univariate and multivariate Cox regression analyses, and a nomogram was built. RESULTS: Totally 2840 DEGs were screened from COAD dataset of TCGA, including 1401 upregulated ones and 1439 downregulated ones, which were divided into 10 modules by WGCNA. The eigenvalue of the black module was found to have a high correlation with COAD progression. PPI interaction networks were constructed for genes in the black module, and 34 hub genes were obtained by using the MCODE plug-in. A LASSO-Cox regression approach was utilized to analyze the hub genes, and a prognostic risk score model based on the signatures of 9 genes (CHEK1, DEPDC1B, FANCI, MCM10, NCAPG, PARPBP, PLK4, RAD51AP1, and RFC4) was constructed. KM analysis identified shorter overall lower survival in the high-risk group. The model was verified to have favorable predictive ability through training set and validation set. The nomogram, composed of tumor node metastasis (TNM) staging and risk score, was of good predictability. CONCLUSIONS: The COAD prognostic risk model constructed upon the signatures of 9 genes (CHEK1, DEPDC1B, FANCI, MCM10, NCAPG, PARPBP, PLK4, RAD51AP1, and RFC4) can effectively predict the survival status of COAD patients.
format Online
Article
Text
id pubmed-8983226
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-89832262022-04-06 Identification of 9 Gene Signatures by WGCNA to Predict Prognosis for Colon Adenocarcinoma Yang, Mian He, Haibin Peng, Tao Lu, Yi Yu, Jiazi Comput Intell Neurosci Research Article BACKGROUND: A risk assessment model for prognostic prediction of colon adenocarcinoma (COAD) was established based on weighted gene co-expression network analysis (WGCNA). METHODS: From the Cancer Genome Atlas (TCGA) database, RNA-seq data and clinical data of COAD patients were retrieved. After screening of differentially expressed genes (DEGs), WGCNA was performed to identify gene modules and screen those associated with COAD progression. Then, via protein-protein interaction (PPI) network construction of module genes, hub genes were obtained, which were then subjected to the least absolute shrinkage and selection operator (LASSO) and Cox regression to build a hub gene-based prognostic scoring model. The receiver operating characteristic curve (ROC curve) was plotted for the optimal cutoff (OCO) of the risk score, based on which, patients were assigned to high or low-risk groups. Areas under the ROC curve (AUCs) were calculated, and model performance was visualized using Kaplan–Meier (KM) survival curves and verified in the external dataset GSE29621. Finally, the model's independent prognostic value was evaluated by univariate and multivariate Cox regression analyses, and a nomogram was built. RESULTS: Totally 2840 DEGs were screened from COAD dataset of TCGA, including 1401 upregulated ones and 1439 downregulated ones, which were divided into 10 modules by WGCNA. The eigenvalue of the black module was found to have a high correlation with COAD progression. PPI interaction networks were constructed for genes in the black module, and 34 hub genes were obtained by using the MCODE plug-in. A LASSO-Cox regression approach was utilized to analyze the hub genes, and a prognostic risk score model based on the signatures of 9 genes (CHEK1, DEPDC1B, FANCI, MCM10, NCAPG, PARPBP, PLK4, RAD51AP1, and RFC4) was constructed. KM analysis identified shorter overall lower survival in the high-risk group. The model was verified to have favorable predictive ability through training set and validation set. The nomogram, composed of tumor node metastasis (TNM) staging and risk score, was of good predictability. CONCLUSIONS: The COAD prognostic risk model constructed upon the signatures of 9 genes (CHEK1, DEPDC1B, FANCI, MCM10, NCAPG, PARPBP, PLK4, RAD51AP1, and RFC4) can effectively predict the survival status of COAD patients. Hindawi 2022-03-29 /pmc/articles/PMC8983226/ /pubmed/35392038 http://dx.doi.org/10.1155/2022/8598046 Text en Copyright © 2022 Mian Yang et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Yang, Mian
He, Haibin
Peng, Tao
Lu, Yi
Yu, Jiazi
Identification of 9 Gene Signatures by WGCNA to Predict Prognosis for Colon Adenocarcinoma
title Identification of 9 Gene Signatures by WGCNA to Predict Prognosis for Colon Adenocarcinoma
title_full Identification of 9 Gene Signatures by WGCNA to Predict Prognosis for Colon Adenocarcinoma
title_fullStr Identification of 9 Gene Signatures by WGCNA to Predict Prognosis for Colon Adenocarcinoma
title_full_unstemmed Identification of 9 Gene Signatures by WGCNA to Predict Prognosis for Colon Adenocarcinoma
title_short Identification of 9 Gene Signatures by WGCNA to Predict Prognosis for Colon Adenocarcinoma
title_sort identification of 9 gene signatures by wgcna to predict prognosis for colon adenocarcinoma
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8983226/
https://www.ncbi.nlm.nih.gov/pubmed/35392038
http://dx.doi.org/10.1155/2022/8598046
work_keys_str_mv AT yangmian identificationof9genesignaturesbywgcnatopredictprognosisforcolonadenocarcinoma
AT hehaibin identificationof9genesignaturesbywgcnatopredictprognosisforcolonadenocarcinoma
AT pengtao identificationof9genesignaturesbywgcnatopredictprognosisforcolonadenocarcinoma
AT luyi identificationof9genesignaturesbywgcnatopredictprognosisforcolonadenocarcinoma
AT yujiazi identificationof9genesignaturesbywgcnatopredictprognosisforcolonadenocarcinoma