Cargando…
A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm
BACKGROUND: After curative surgical resection, about 30-75% lung adenocarcinoma (LUAD) patients suffer from recurrence with dismal survival outcomes. Identification of patients with high risk of recurrence to impose intense therapy is urgently needed. MATERIALS AND METHODS: Gene expression data of L...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7669350/ https://www.ncbi.nlm.nih.gov/pubmed/33224985 http://dx.doi.org/10.1155/2020/9124792 |
_version_ | 1783610550639394816 |
---|---|
author | Xu, Shaohua Zhou, Jie Liu, Kai Chen, Zhoumiao He, Zhengfu |
author_facet | Xu, Shaohua Zhou, Jie Liu, Kai Chen, Zhoumiao He, Zhengfu |
author_sort | Xu, Shaohua |
collection | PubMed |
description | BACKGROUND: After curative surgical resection, about 30-75% lung adenocarcinoma (LUAD) patients suffer from recurrence with dismal survival outcomes. Identification of patients with high risk of recurrence to impose intense therapy is urgently needed. MATERIALS AND METHODS: Gene expression data of LUAD were obtained from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Differentially expressed genes (DEGs) were calculated by comparing the recurrent and primary tissues. Prognostic genes associated with the recurrence-free survival (RFS) of LUAD patients were identified using univariate analysis. LASSO Cox regression and multivariate Cox analysis were applied to extract key genes and establish the prediction model. RESULTS: We detected 37 DEGs between primary and recurrent LUAD tumors. Using univariate analysis, 31 DEGs were found to be significantly associated with RFS. We established the RFS prediction model including thirteen genes using the LASSO Cox regression. In the training cohort, we classified patients into high- and low-risk groups and found that patients in the high-risk group suffered from worse RFS compared to those in the low-risk group (P < 0.01). Concordant results were confirmed in the internal and external validation cohort. The efficiency of the prediction model was also confirmed under different clinical subgroups. The high-risk group was significantly identified as the risk factor of recurrence in LUAD by the multivariate Cox analysis (HR = 13.37, P = 0.01). Compared to clinicopathological features, our prediction model possessed higher accuracy to identify patients with high risk of recurrence (AUC = 96.3%). Finally, we found that the G2M checkpoint pathway was enriched both in recurrent tumors and primary tumors of high-risk patients. CONCLUSIONS: Our recurrence-specific gene-based prognostic prediction model provides extra information about the risk of recurrence in LUAD, which is conducive for clinicians to conduct individualized therapy in clinic. |
format | Online Article Text |
id | pubmed-7669350 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-76693502020-11-19 A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm Xu, Shaohua Zhou, Jie Liu, Kai Chen, Zhoumiao He, Zhengfu Biomed Res Int Research Article BACKGROUND: After curative surgical resection, about 30-75% lung adenocarcinoma (LUAD) patients suffer from recurrence with dismal survival outcomes. Identification of patients with high risk of recurrence to impose intense therapy is urgently needed. MATERIALS AND METHODS: Gene expression data of LUAD were obtained from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) databases. Differentially expressed genes (DEGs) were calculated by comparing the recurrent and primary tissues. Prognostic genes associated with the recurrence-free survival (RFS) of LUAD patients were identified using univariate analysis. LASSO Cox regression and multivariate Cox analysis were applied to extract key genes and establish the prediction model. RESULTS: We detected 37 DEGs between primary and recurrent LUAD tumors. Using univariate analysis, 31 DEGs were found to be significantly associated with RFS. We established the RFS prediction model including thirteen genes using the LASSO Cox regression. In the training cohort, we classified patients into high- and low-risk groups and found that patients in the high-risk group suffered from worse RFS compared to those in the low-risk group (P < 0.01). Concordant results were confirmed in the internal and external validation cohort. The efficiency of the prediction model was also confirmed under different clinical subgroups. The high-risk group was significantly identified as the risk factor of recurrence in LUAD by the multivariate Cox analysis (HR = 13.37, P = 0.01). Compared to clinicopathological features, our prediction model possessed higher accuracy to identify patients with high risk of recurrence (AUC = 96.3%). Finally, we found that the G2M checkpoint pathway was enriched both in recurrent tumors and primary tumors of high-risk patients. CONCLUSIONS: Our recurrence-specific gene-based prognostic prediction model provides extra information about the risk of recurrence in LUAD, which is conducive for clinicians to conduct individualized therapy in clinic. Hindawi 2020-11-07 /pmc/articles/PMC7669350/ /pubmed/33224985 http://dx.doi.org/10.1155/2020/9124792 Text en Copyright © 2020 Shaohua Xu et al. https://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Xu, Shaohua Zhou, Jie Liu, Kai Chen, Zhoumiao He, Zhengfu A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm |
title | A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm |
title_full | A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm |
title_fullStr | A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm |
title_full_unstemmed | A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm |
title_short | A Recurrence-Specific Gene-Based Prognosis Prediction Model for Lung Adenocarcinoma through Machine Learning Algorithm |
title_sort | recurrence-specific gene-based prognosis prediction model for lung adenocarcinoma through machine learning algorithm |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7669350/ https://www.ncbi.nlm.nih.gov/pubmed/33224985 http://dx.doi.org/10.1155/2020/9124792 |
work_keys_str_mv | AT xushaohua arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm AT zhoujie arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm AT liukai arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm AT chenzhoumiao arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm AT hezhengfu arecurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm AT xushaohua recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm AT zhoujie recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm AT liukai recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm AT chenzhoumiao recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm AT hezhengfu recurrencespecificgenebasedprognosispredictionmodelforlungadenocarcinomathroughmachinelearningalgorithm |