Cargando…
Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis
BACKGROUND: Lung adenocarcinoma (LUAD) as a frequent type of lung cancer has a 5-year overall survival rate of lower than 20% among patients with advanced lung cancer. This study aims to construct a risk model to guide immunotherapy in LUAD patients effectively. MATERIALS AND METHODS: LUAD Bulk RNA-...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10424935/ https://www.ncbi.nlm.nih.gov/pubmed/37583701 http://dx.doi.org/10.3389/fimmu.2023.1177847 |
_version_ | 1785089767091732480 |
---|---|
author | Shu, Jianfeng Jiang, Jinni Zhao, Guofang |
author_facet | Shu, Jianfeng Jiang, Jinni Zhao, Guofang |
author_sort | Shu, Jianfeng |
collection | PubMed |
description | BACKGROUND: Lung adenocarcinoma (LUAD) as a frequent type of lung cancer has a 5-year overall survival rate of lower than 20% among patients with advanced lung cancer. This study aims to construct a risk model to guide immunotherapy in LUAD patients effectively. MATERIALS AND METHODS: LUAD Bulk RNA-seq data for the construction of a model, single-cell RNA sequencing (scRNA-seq) data (GSE203360) for cell cluster analysis, and microarray data (GSE31210) for validation were collected from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database. We used the Seurat R package to filter and process scRNA-seq data. Sample clustering was performed in the ConsensusClusterPlus R package. Differentially expressed genes (DEGs) between two groups were mined by the Limma R package. MCP-counter, CIBERSORT, ssGSEA, and ESTIMATE were employed to evaluate immune characteristics. Stepwise multivariate analysis, Univariate Cox analysis, and Lasso regression analysis were conducted to identify key prognostic genes and were used to construct the risk model. Key prognostic gene expressions were explored by RT-qPCR and Western blot assay. RESULTS: A total of 27 immune cell marker genes associated with prognosis were identified for subtyping LUAD samples into clusters C3, C2, and C1. C1 had the longest overall survival and highest immune infiltration among them, followed by C2 and C3. Oncogenic pathways such as VEGF, EFGR, and MAPK were more activated in C3 compared to the other two clusters. Based on the DEGs among clusters, we confirmed seven key prognostic genes including CPA3, S100P, PTTG1, LOXL2, MELTF, PKP2, and TMPRSS11E. Two risk groups defined by the seven-gene risk model presented distinct responses to immunotherapy and chemotherapy, immune infiltration, and prognosis. The mRNA and protein level of CPA3 was decreased, while the remaining six gene levels were increased in clinical tumor tissues. CONCLUSION: Immune cell markers are effective in clustering LUAD samples into different subtypes, and they play important roles in regulating the immune microenvironment and cancer development. In addition, the seven-gene risk model may serve as a guide for assisting in personalized treatment in LUAD patients. |
format | Online Article Text |
id | pubmed-10424935 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-104249352023-08-15 Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis Shu, Jianfeng Jiang, Jinni Zhao, Guofang Front Immunol Immunology BACKGROUND: Lung adenocarcinoma (LUAD) as a frequent type of lung cancer has a 5-year overall survival rate of lower than 20% among patients with advanced lung cancer. This study aims to construct a risk model to guide immunotherapy in LUAD patients effectively. MATERIALS AND METHODS: LUAD Bulk RNA-seq data for the construction of a model, single-cell RNA sequencing (scRNA-seq) data (GSE203360) for cell cluster analysis, and microarray data (GSE31210) for validation were collected from The Cancer Genome Atlas (TCGA) and Gene Expression Omnibus (GEO) database. We used the Seurat R package to filter and process scRNA-seq data. Sample clustering was performed in the ConsensusClusterPlus R package. Differentially expressed genes (DEGs) between two groups were mined by the Limma R package. MCP-counter, CIBERSORT, ssGSEA, and ESTIMATE were employed to evaluate immune characteristics. Stepwise multivariate analysis, Univariate Cox analysis, and Lasso regression analysis were conducted to identify key prognostic genes and were used to construct the risk model. Key prognostic gene expressions were explored by RT-qPCR and Western blot assay. RESULTS: A total of 27 immune cell marker genes associated with prognosis were identified for subtyping LUAD samples into clusters C3, C2, and C1. C1 had the longest overall survival and highest immune infiltration among them, followed by C2 and C3. Oncogenic pathways such as VEGF, EFGR, and MAPK were more activated in C3 compared to the other two clusters. Based on the DEGs among clusters, we confirmed seven key prognostic genes including CPA3, S100P, PTTG1, LOXL2, MELTF, PKP2, and TMPRSS11E. Two risk groups defined by the seven-gene risk model presented distinct responses to immunotherapy and chemotherapy, immune infiltration, and prognosis. The mRNA and protein level of CPA3 was decreased, while the remaining six gene levels were increased in clinical tumor tissues. CONCLUSION: Immune cell markers are effective in clustering LUAD samples into different subtypes, and they play important roles in regulating the immune microenvironment and cancer development. In addition, the seven-gene risk model may serve as a guide for assisting in personalized treatment in LUAD patients. Frontiers Media S.A. 2023-07-31 /pmc/articles/PMC10424935/ /pubmed/37583701 http://dx.doi.org/10.3389/fimmu.2023.1177847 Text en Copyright © 2023 Shu, Jiang and Zhao https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Immunology Shu, Jianfeng Jiang, Jinni Zhao, Guofang Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis |
title | Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis |
title_full | Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis |
title_fullStr | Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis |
title_full_unstemmed | Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis |
title_short | Identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis |
title_sort | identification of novel gene signature for lung adenocarcinoma by machine learning to predict immunotherapy and prognosis |
topic | Immunology |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10424935/ https://www.ncbi.nlm.nih.gov/pubmed/37583701 http://dx.doi.org/10.3389/fimmu.2023.1177847 |
work_keys_str_mv | AT shujianfeng identificationofnovelgenesignatureforlungadenocarcinomabymachinelearningtopredictimmunotherapyandprognosis AT jiangjinni identificationofnovelgenesignatureforlungadenocarcinomabymachinelearningtopredictimmunotherapyandprognosis AT zhaoguofang identificationofnovelgenesignatureforlungadenocarcinomabymachinelearningtopredictimmunotherapyandprognosis |