Cargando…

Identification of Signature Genes and Characterizations of Tumor Immune Microenvironment and Tumor Purity in Lung Adenocarcinoma Based on Machine Learning

The implication of the Estimation of Stromal and Immune cells in Malignant tumor tissues using expression data (ESTIMATE) method to determine the tumor microenvironment (TME) and tumor immune score including tumor purity represents an efficient method to identify and assess biomarkers for immunother...

Descripción completa

Detalles Bibliográficos
Autores principales: Feng, Haiming, Zhao, Ye, Yan, Weijian, Wei, Xiaoping, Lin, Junping, Jiang, Peng, Wang, Cheng, Li, Bin
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8916235/
https://www.ncbi.nlm.nih.gov/pubmed/35280857
http://dx.doi.org/10.3389/fmed.2022.843749
_version_ 1784668239932948480
author Feng, Haiming
Zhao, Ye
Yan, Weijian
Wei, Xiaoping
Lin, Junping
Jiang, Peng
Wang, Cheng
Li, Bin
author_facet Feng, Haiming
Zhao, Ye
Yan, Weijian
Wei, Xiaoping
Lin, Junping
Jiang, Peng
Wang, Cheng
Li, Bin
author_sort Feng, Haiming
collection PubMed
description The implication of the Estimation of Stromal and Immune cells in Malignant tumor tissues using expression data (ESTIMATE) method to determine the tumor microenvironment (TME) and tumor immune score including tumor purity represents an efficient method to identify and assess biomarkers for immunotherapy response in precision medicine. In this study we utilized a machine learning algorithm to analyze the Cancer Genome Atlas (TCGA) and Gene Expression Omnibus database (GEO) lung adenocarcinoma (LUAD) transcriptome data to evaluate the association between TME and tumor purity. Furthermore, we investigated whether fewer TME components or a few dominant genes can infer tumor purity. The results indicated that the 29 immune infiltrating components determined by the ssGSEA method could screen the 5 TME components [chemokine C-C-Motif receptor (CCR), T-helper-cells, Check-point, Treg, and tumor-infiltrating lymphocytes (TIL)] that significantly contributed the most to tumor purity prediction through regression tree and random forest regression methods. The findings revealed that higher activity of these five immune infiltrating components significantly lowered the tumor purity. Moreover, 5 TME components contributed significantly to the improvement of Mean Square Error (MES); therefore, we selected these five sets' genes and analyzed survival data to establish a prognostic model. We screened out 11 prognostic-related genes and constructed a risk model comprising 11 genes with good predictive value for patients' prognosis. Furthermore, we obtained four genes (GIMAP6, CD80, IL16, and CCR2) that had predictive advantages for tumor purity using random forest classification and random forest regression. The comprehensive score of genes for tumor purity prediction (CSGTPP) was obtained by least absolute shrinkage and selection operator (LASSO) regression indicated that four genes could be successfully used to classify high and low CSGTPP samples and that tumor purity was negatively correlated with CSGTPP. Survival analysis revealed that the higher the CSGTPP, the better the prognosis of patients. The association between a cluster of differentiation 274 (CD274) and CSGTPP revealed a higher expression of CD274 in the high CSGTPP group. Collectively, we speculated that CSGTPP could serve as a predictor of the response to immunotherapy and a promising indicator of immunotherapy effect.
format Online
Article
Text
id pubmed-8916235
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-89162352022-03-12 Identification of Signature Genes and Characterizations of Tumor Immune Microenvironment and Tumor Purity in Lung Adenocarcinoma Based on Machine Learning Feng, Haiming Zhao, Ye Yan, Weijian Wei, Xiaoping Lin, Junping Jiang, Peng Wang, Cheng Li, Bin Front Med (Lausanne) Medicine The implication of the Estimation of Stromal and Immune cells in Malignant tumor tissues using expression data (ESTIMATE) method to determine the tumor microenvironment (TME) and tumor immune score including tumor purity represents an efficient method to identify and assess biomarkers for immunotherapy response in precision medicine. In this study we utilized a machine learning algorithm to analyze the Cancer Genome Atlas (TCGA) and Gene Expression Omnibus database (GEO) lung adenocarcinoma (LUAD) transcriptome data to evaluate the association between TME and tumor purity. Furthermore, we investigated whether fewer TME components or a few dominant genes can infer tumor purity. The results indicated that the 29 immune infiltrating components determined by the ssGSEA method could screen the 5 TME components [chemokine C-C-Motif receptor (CCR), T-helper-cells, Check-point, Treg, and tumor-infiltrating lymphocytes (TIL)] that significantly contributed the most to tumor purity prediction through regression tree and random forest regression methods. The findings revealed that higher activity of these five immune infiltrating components significantly lowered the tumor purity. Moreover, 5 TME components contributed significantly to the improvement of Mean Square Error (MES); therefore, we selected these five sets' genes and analyzed survival data to establish a prognostic model. We screened out 11 prognostic-related genes and constructed a risk model comprising 11 genes with good predictive value for patients' prognosis. Furthermore, we obtained four genes (GIMAP6, CD80, IL16, and CCR2) that had predictive advantages for tumor purity using random forest classification and random forest regression. The comprehensive score of genes for tumor purity prediction (CSGTPP) was obtained by least absolute shrinkage and selection operator (LASSO) regression indicated that four genes could be successfully used to classify high and low CSGTPP samples and that tumor purity was negatively correlated with CSGTPP. Survival analysis revealed that the higher the CSGTPP, the better the prognosis of patients. The association between a cluster of differentiation 274 (CD274) and CSGTPP revealed a higher expression of CD274 in the high CSGTPP group. Collectively, we speculated that CSGTPP could serve as a predictor of the response to immunotherapy and a promising indicator of immunotherapy effect. Frontiers Media S.A. 2022-02-25 /pmc/articles/PMC8916235/ /pubmed/35280857 http://dx.doi.org/10.3389/fmed.2022.843749 Text en Copyright © 2022 Feng, Zhao, Yan, Wei, Lin, Jiang, Wang and Li. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Medicine
Feng, Haiming
Zhao, Ye
Yan, Weijian
Wei, Xiaoping
Lin, Junping
Jiang, Peng
Wang, Cheng
Li, Bin
Identification of Signature Genes and Characterizations of Tumor Immune Microenvironment and Tumor Purity in Lung Adenocarcinoma Based on Machine Learning
title Identification of Signature Genes and Characterizations of Tumor Immune Microenvironment and Tumor Purity in Lung Adenocarcinoma Based on Machine Learning
title_full Identification of Signature Genes and Characterizations of Tumor Immune Microenvironment and Tumor Purity in Lung Adenocarcinoma Based on Machine Learning
title_fullStr Identification of Signature Genes and Characterizations of Tumor Immune Microenvironment and Tumor Purity in Lung Adenocarcinoma Based on Machine Learning
title_full_unstemmed Identification of Signature Genes and Characterizations of Tumor Immune Microenvironment and Tumor Purity in Lung Adenocarcinoma Based on Machine Learning
title_short Identification of Signature Genes and Characterizations of Tumor Immune Microenvironment and Tumor Purity in Lung Adenocarcinoma Based on Machine Learning
title_sort identification of signature genes and characterizations of tumor immune microenvironment and tumor purity in lung adenocarcinoma based on machine learning
topic Medicine
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8916235/
https://www.ncbi.nlm.nih.gov/pubmed/35280857
http://dx.doi.org/10.3389/fmed.2022.843749
work_keys_str_mv AT fenghaiming identificationofsignaturegenesandcharacterizationsoftumorimmunemicroenvironmentandtumorpurityinlungadenocarcinomabasedonmachinelearning
AT zhaoye identificationofsignaturegenesandcharacterizationsoftumorimmunemicroenvironmentandtumorpurityinlungadenocarcinomabasedonmachinelearning
AT yanweijian identificationofsignaturegenesandcharacterizationsoftumorimmunemicroenvironmentandtumorpurityinlungadenocarcinomabasedonmachinelearning
AT weixiaoping identificationofsignaturegenesandcharacterizationsoftumorimmunemicroenvironmentandtumorpurityinlungadenocarcinomabasedonmachinelearning
AT linjunping identificationofsignaturegenesandcharacterizationsoftumorimmunemicroenvironmentandtumorpurityinlungadenocarcinomabasedonmachinelearning
AT jiangpeng identificationofsignaturegenesandcharacterizationsoftumorimmunemicroenvironmentandtumorpurityinlungadenocarcinomabasedonmachinelearning
AT wangcheng identificationofsignaturegenesandcharacterizationsoftumorimmunemicroenvironmentandtumorpurityinlungadenocarcinomabasedonmachinelearning
AT libin identificationofsignaturegenesandcharacterizationsoftumorimmunemicroenvironmentandtumorpurityinlungadenocarcinomabasedonmachinelearning