Cargando…

Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis

BACKGROUND: Lung adenocarcinoma (LUAD) as a frequent type of lung cancer has a 5-year overall survival rate of lower than 20% among patients with advanced lung cancer. This study aims to construct a risk model to guide immunotherapy in LUAD patients effectively. MATERIALS AND METHODS: LUAD Bulk RNA-...

Descripción completa

Detalles Bibliográficos
Autores principales: Shu, Jianfeng, Jiang, Jinni, Zhao, Guofang
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10424935/
https://www.ncbi.nlm.nih.gov/pubmed/37583701
http://dx.doi.org/10.3389/fimmu.2023.1177847
_version_ 1785089767091732480
author Shu, Jianfeng
Jiang, Jinni
Zhao, Guofang
author_facet Shu, Jianfeng
Jiang, Jinni
Zhao, Guofang
author_sort Shu, Jianfeng
collection PubMed
description BACKGROUND: Lung adenocarcinoma (LUAD) as a frequent type of lung cancer has a 5-year overall survival rate of lower than 20% among patients with advanced lung cancer. This study aims to construct a risk model to guide immunotherapy in LUAD patients effectively. MATERIALS AND METHODS: LUAD Bulk RNA-seq data for the construction of a model, single-cell RNA sequencing (scRNA-seq) data (GSE203360) for cell cluster analysis, and microarray data (GSE31210) for validation were collected from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database. We used the Seurat R package to filter and process scRNA-seq data. Sample clustering was performed in the ConsensusClusterPlus R package. Differentially expressed genes (DEGs) between two groups were mined by the Limma R package. MCP-counter, CIBERSORT, ssGSEA, and ESTIMATE were employed to evaluate immune characteristics. Stepwise multivariate analysis, Univariate Cox analysis, and Lasso regression analysis were conducted to identify key prognostic genes and were used to construct the risk model. Key prognostic gene expressions were explored by RT-qPCR and Western blot assay. RESULTS: A total of 27 immune cell marker genes associated with prognosis were identified for subtyping LUAD samples into clusters C3, C2, and C1. C1 had the longest overall survival and highest immune infiltration among them, followed by C2 and C3. Oncogenic pathways such as VEGF, EFGR, and MAPK were more activated in C3 compared to the other two clusters. Based on the DEGs among clusters, we confirmed seven key prognostic genes including CPA3, S100P, PTTG1, LOXL2, MELTF, PKP2, and TMPRSS11E. Two risk groups defined by the seven-gene risk model presented distinct responses to immunotherapy and chemotherapy, immune infiltration, and prognosis. The mRNA and protein level of CPA3 was decreased, while the remaining six gene levels were increased in clinical tumor tissues. CONCLUSION: Immune cell markers are effective in clustering LUAD samples into different subtypes, and they play important roles in regulating the immune microenvironment and cancer development. In addition, the seven-gene risk model may serve as a guide for assisting in personalized treatment in LUAD patients.
format Online
Article
Text
id pubmed-10424935
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-104249352023-08-15 Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis Shu, Jianfeng Jiang, Jinni Zhao, Guofang Front Immunol Immunology BACKGROUND: Lung adenocarcinoma (LUAD) as a frequent type of lung cancer has a 5-year overall survival rate of lower than 20% among patients with advanced lung cancer. This study aims to construct a risk model to guide immunotherapy in LUAD patients effectively. MATERIALS AND METHODS: LUAD Bulk RNA-seq data for the construction of a model, single-cell RNA sequencing (scRNA-seq) data (GSE203360) for cell cluster analysis, and microarray data (GSE31210) for validation were collected from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database. We used the Seurat R package to filter and process scRNA-seq data. Sample clustering was performed in the ConsensusClusterPlus R package. Differentially expressed genes (DEGs) between two groups were mined by the Limma R package. MCP-counter, CIBERSORT, ssGSEA, and ESTIMATE were employed to evaluate immune characteristics. Stepwise multivariate analysis, Univariate Cox analysis, and Lasso regression analysis were conducted to identify key prognostic genes and were used to construct the risk model. Key prognostic gene expressions were explored by RT-qPCR and Western blot assay. RESULTS: A total of 27 immune cell marker genes associated with prognosis were identified for subtyping LUAD samples into clusters C3, C2, and C1. C1 had the longest overall survival and highest immune infiltration among them, followed by C2 and C3. Oncogenic pathways such as VEGF, EFGR, and MAPK were more activated in C3 compared to the other two clusters. Based on the DEGs among clusters, we confirmed seven key prognostic genes including CPA3, S100P, PTTG1, LOXL2, MELTF, PKP2, and TMPRSS11E. Two risk groups defined by the seven-gene risk model presented distinct responses to immunotherapy and chemotherapy, immune infiltration, and prognosis. The mRNA and protein level of CPA3 was decreased, while the remaining six gene levels were increased in clinical tumor tissues. CONCLUSION: Immune cell markers are effective in clustering LUAD samples into different subtypes, and they play important roles in regulating the immune microenvironment and cancer development. In addition, the seven-gene risk model may serve as a guide for assisting in personalized treatment in LUAD patients. Frontiers Media S.A. 2023-07-31 /pmc/articles/PMC10424935/ /pubmed/37583701 http://dx.doi.org/10.3389/fimmu.2023.1177847 Text en Copyright © 2023 Shu, Jiang and Zhao https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Immunology
Shu, Jianfeng
Jiang, Jinni
Zhao, Guofang
Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis
title Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis
title_full Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis
title_fullStr Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis
title_full_unstemmed Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis
title_short Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis
title_sort identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis
topic Immunology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10424935/
https://www.ncbi.nlm.nih.gov/pubmed/37583701
http://dx.doi.org/10.3389/fimmu.2023.1177847
work_keys_str_mv AT shujianfeng identificationofnovelgenesignatureforlungadenocarcinomabymachinelearningtopredictimmunotherapyandprognosis
AT jiangjinni identificationofnovelgenesignatureforlungadenocarcinomabymachinelearningtopredictimmunotherapyandprognosis
AT zhaoguofang identificationofnovelgenesignatureforlungadenocarcinomabymachinelearningtopredictimmunotherapyandprognosis