Cargando…

Identification of the Characteristic Genes and their Roles in Lung Adenocarcinoma Lymph Node Metastasis through Machine Learning Algorithm

BACKGROUND: Lymph node metastasis is an important route of lung cancer metastasis and can significantly affect the survival of lung cancer. METHODS: All the analysis was conducted out in the R software. Expression profile and clinical information of lung adenocarcinoma (LUAD) patients were downloade...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhou, Qian, Wang, Xianghui, Qian, Haiyun, Ma, Shengwei, Lei, Chenggang, Cui, Fenghe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581663/
https://www.ncbi.nlm.nih.gov/pubmed/36277017
http://dx.doi.org/10.1155/2022/1968829
_version_ 1784812675861053440
author Zhou, Qian
Wang, Xianghui
Qian, Haiyun
Ma, Shengwei
Lei, Chenggang
Cui, Fenghe
author_facet Zhou, Qian
Wang, Xianghui
Qian, Haiyun
Ma, Shengwei
Lei, Chenggang
Cui, Fenghe
author_sort Zhou, Qian
collection PubMed
description BACKGROUND: Lymph node metastasis is an important route of lung cancer metastasis and can significantly affect the survival of lung cancer. METHODS: All the analysis was conducted out in the R software. Expression profile and clinical information of lung adenocarcinoma (LUAD) patients were downloaded from The Cancer Genome Atlas database. RESULTS: In our study, we firstly identified the characteristic genes of lymph node metastasis in LUAD through two machine learning algorithms, least absolute shrinkage and selection operator (LASSO) logistic regression, and SVM-RFE algorithms. Ten characteristic genes were finally identified, including CRHR2, ITIH1, PRSS48, MAS1L, CYP4Z1, LMO1, TCP10L2, KRT78, IGFBP1, and PITX3. Next, we performed univariate Cox regression, LASSO regression, and multivariate Cox regression sequentially to construct a prognosis model based on MAS1L, TCP10L2, and CRHR2, which had a good prognosis prediction efficiency in both training and validation cohorts. Univariate and multivariate analysis indicated that our model is a risk factor independent of other clinical features. Pathway enrichment analysis showed that in the high-risk patients, the pathway of MYC target, unfolded protein response, interferon alpha response, DNA repair, reactive oxygen species pathway, and glycolysis were significantly enriched. Among three model genes, MAS1L aroused our interest and therefore was selected for further analysis. KM survival curves showed that the patients with higher MAS1L might have better disease-free survival and progression-free survival. Further, pathway enrichment, genomic instability, immune infiltration, and drug sensitivity analysis were performed to in-deep explore the role of MAS1L in LUAD. CONCLUSIONS: Results showed that the signature based on MAS1L, TCP10L2, and CRHR2 is a useful tool to predict prognosis and lung cancer lymph node metastasis.
format Online
Article
Text
id pubmed-9581663
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-95816632022-10-20 Identification of the Characteristic Genes and their Roles in Lung Adenocarcinoma Lymph Node Metastasis through Machine Learning Algorithm Zhou, Qian Wang, Xianghui Qian, Haiyun Ma, Shengwei Lei, Chenggang Cui, Fenghe Comput Math Methods Med Research Article BACKGROUND: Lymph node metastasis is an important route of lung cancer metastasis and can significantly affect the survival of lung cancer. METHODS: All the analysis was conducted out in the R software. Expression profile and clinical information of lung adenocarcinoma (LUAD) patients were downloaded from The Cancer Genome Atlas database. RESULTS: In our study, we firstly identified the characteristic genes of lymph node metastasis in LUAD through two machine learning algorithms, least absolute shrinkage and selection operator (LASSO) logistic regression, and SVM-RFE algorithms. Ten characteristic genes were finally identified, including CRHR2, ITIH1, PRSS48, MAS1L, CYP4Z1, LMO1, TCP10L2, KRT78, IGFBP1, and PITX3. Next, we performed univariate Cox regression, LASSO regression, and multivariate Cox regression sequentially to construct a prognosis model based on MAS1L, TCP10L2, and CRHR2, which had a good prognosis prediction efficiency in both training and validation cohorts. Univariate and multivariate analysis indicated that our model is a risk factor independent of other clinical features. Pathway enrichment analysis showed that in the high-risk patients, the pathway of MYC target, unfolded protein response, interferon alpha response, DNA repair, reactive oxygen species pathway, and glycolysis were significantly enriched. Among three model genes, MAS1L aroused our interest and therefore was selected for further analysis. KM survival curves showed that the patients with higher MAS1L might have better disease-free survival and progression-free survival. Further, pathway enrichment, genomic instability, immune infiltration, and drug sensitivity analysis were performed to in-deep explore the role of MAS1L in LUAD. CONCLUSIONS: Results showed that the signature based on MAS1L, TCP10L2, and CRHR2 is a useful tool to predict prognosis and lung cancer lymph node metastasis. Hindawi 2022-10-12 /pmc/articles/PMC9581663/ /pubmed/36277017 http://dx.doi.org/10.1155/2022/1968829 Text en Copyright © 2022 Qian Zhou et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhou, Qian
Wang, Xianghui
Qian, Haiyun
Ma, Shengwei
Lei, Chenggang
Cui, Fenghe
Identification of the Characteristic Genes and their Roles in Lung Adenocarcinoma Lymph Node Metastasis through Machine Learning Algorithm
title Identification of the Characteristic Genes and their Roles in Lung Adenocarcinoma Lymph Node Metastasis through Machine Learning Algorithm
title_full Identification of the Characteristic Genes and their Roles in Lung Adenocarcinoma Lymph Node Metastasis through Machine Learning Algorithm
title_fullStr Identification of the Characteristic Genes and their Roles in Lung Adenocarcinoma Lymph Node Metastasis through Machine Learning Algorithm
title_full_unstemmed Identification of the Characteristic Genes and their Roles in Lung Adenocarcinoma Lymph Node Metastasis through Machine Learning Algorithm
title_short Identification of the Characteristic Genes and their Roles in Lung Adenocarcinoma Lymph Node Metastasis through Machine Learning Algorithm
title_sort identification of the characteristic genes and their roles in lung adenocarcinoma lymph node metastasis through machine learning algorithm
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9581663/
https://www.ncbi.nlm.nih.gov/pubmed/36277017
http://dx.doi.org/10.1155/2022/1968829
work_keys_str_mv AT zhouqian identificationofthecharacteristicgenesandtheirrolesinlungadenocarcinomalymphnodemetastasisthroughmachinelearningalgorithm
AT wangxianghui identificationofthecharacteristicgenesandtheirrolesinlungadenocarcinomalymphnodemetastasisthroughmachinelearningalgorithm
AT qianhaiyun identificationofthecharacteristicgenesandtheirrolesinlungadenocarcinomalymphnodemetastasisthroughmachinelearningalgorithm
AT mashengwei identificationofthecharacteristicgenesandtheirrolesinlungadenocarcinomalymphnodemetastasisthroughmachinelearningalgorithm
AT leichenggang identificationofthecharacteristicgenesandtheirrolesinlungadenocarcinomalymphnodemetastasisthroughmachinelearningalgorithm
AT cuifenghe identificationofthecharacteristicgenesandtheirrolesinlungadenocarcinomalymphnodemetastasisthroughmachinelearningalgorithm