Cargando…

A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm

BACKGROUND: After curative surgical resection, about 30-75% lung adenocarcinoma (LUAD) patients suffer from recurrence with dismal survival outcomes. Identification of patients with high risk of recurrence to impose intense therapy is urgently needed. MATERIALS AND METHODS: Gene expression data of L...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Shaohua, Zhou, Jie, Liu, Kai, Chen, Zhoumiao, He, Zhengfu
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7669350/
https://www.ncbi.nlm.nih.gov/pubmed/33224985
http://dx.doi.org/10.1155/2020/9124792
_version_ 1783610550639394816
author Xu, Shaohua
Zhou, Jie
Liu, Kai
Chen, Zhoumiao
He, Zhengfu
author_facet Xu, Shaohua
Zhou, Jie
Liu, Kai
Chen, Zhoumiao
He, Zhengfu
author_sort Xu, Shaohua
collection PubMed
description BACKGROUND: After curative surgical resection, about 30-75% lung adenocarcinoma (LUAD) patients suffer from recurrence with dismal survival outcomes. Identification of patients with high risk of recurrence to impose intense therapy is urgently needed. MATERIALS AND METHODS: Gene expression data of LUAD were obtained from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Differentially expressed genes (DEGs) were calculated by comparing the recurrent and primary tissues. Prognostic genes associated with the recurrence-free survival (RFS) of LUAD patients were identified using univariate analysis. LASSO Cox regression and multivariate Cox analysis were applied to extract key genes and establish the prediction model. RESULTS: We detected 37 DEGs between primary and recurrent LUAD tumors. Using univariate analysis, 31 DEGs were found to be significantly associated with RFS. We established the RFS prediction model including thirteen genes using the LASSO Cox regression. In the training cohort, we classified patients into high- and low-risk groups and found that patients in the high-risk group suffered from worse RFS compared to those in the low-risk group (P < 0.01). Concordant results were confirmed in the internal and external validation cohort. The efficiency of the prediction model was also confirmed under different clinical subgroups. The high-risk group was significantly identified as the risk factor of recurrence in LUAD by the multivariate Cox analysis (HR = 13.37, P = 0.01). Compared to clinicopathological features, our prediction model possessed higher accuracy to identify patients with high risk of recurrence (AUC = 96.3%). Finally, we found that the G2M checkpoint pathway was enriched both in recurrent tumors and primary tumors of high-risk patients. CONCLUSIONS: Our recurrence-specific gene-based prognostic prediction model provides extra information about the risk of recurrence in LUAD, which is conducive for clinicians to conduct individualized therapy in clinic.
format Online
Article
Text
id pubmed-7669350
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-76693502020-11-19 A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm Xu, Shaohua Zhou, Jie Liu, Kai Chen, Zhoumiao He, Zhengfu Biomed Res Int Research Article BACKGROUND: After curative surgical resection, about 30-75% lung adenocarcinoma (LUAD) patients suffer from recurrence with dismal survival outcomes. Identification of patients with high risk of recurrence to impose intense therapy is urgently needed. MATERIALS AND METHODS: Gene expression data of LUAD were obtained from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Differentially expressed genes (DEGs) were calculated by comparing the recurrent and primary tissues. Prognostic genes associated with the recurrence-free survival (RFS) of LUAD patients were identified using univariate analysis. LASSO Cox regression and multivariate Cox analysis were applied to extract key genes and establish the prediction model. RESULTS: We detected 37 DEGs between primary and recurrent LUAD tumors. Using univariate analysis, 31 DEGs were found to be significantly associated with RFS. We established the RFS prediction model including thirteen genes using the LASSO Cox regression. In the training cohort, we classified patients into high- and low-risk groups and found that patients in the high-risk group suffered from worse RFS compared to those in the low-risk group (P < 0.01). Concordant results were confirmed in the internal and external validation cohort. The efficiency of the prediction model was also confirmed under different clinical subgroups. The high-risk group was significantly identified as the risk factor of recurrence in LUAD by the multivariate Cox analysis (HR = 13.37, P = 0.01). Compared to clinicopathological features, our prediction model possessed higher accuracy to identify patients with high risk of recurrence (AUC = 96.3%). Finally, we found that the G2M checkpoint pathway was enriched both in recurrent tumors and primary tumors of high-risk patients. CONCLUSIONS: Our recurrence-specific gene-based prognostic prediction model provides extra information about the risk of recurrence in LUAD, which is conducive for clinicians to conduct individualized therapy in clinic. Hindawi 2020-11-07 /pmc/articles/PMC7669350/ /pubmed/33224985 http://dx.doi.org/10.1155/2020/9124792 Text en Copyright © 2020 Shaohua Xu et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Xu, Shaohua
Zhou, Jie
Liu, Kai
Chen, Zhoumiao
He, Zhengfu
A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm
title A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm
title_full A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm
title_fullStr A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm
title_full_unstemmed A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm
title_short A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm
title_sort recurrence-specific gene-based prognosis prediction model for lung adenocarcinoma through machine learning algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7669350/
https://www.ncbi.nlm.nih.gov/pubmed/33224985
http://dx.doi.org/10.1155/2020/9124792
work_keys_str_mv AT xushaohua arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm
AT zhoujie arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm
AT liukai arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm
AT chenzhoumiao arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm
AT hezhengfu arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm
AT xushaohua recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm
AT zhoujie recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm
AT liukai recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm
AT chenzhoumiao recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm
AT hezhengfu recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm