Cargando…

Single-cell RNA sequencing analysis to explore immune cell heterogeneity and novel biomarkers for the prognosis of lung adenocarcinoma

Background: Single-cell RNA sequencing is necessary to understand tumor heterogeneity, and the cell type heterogeneity of lung adenocarcinoma (LUAD) has not been fully studied. Method: We first reduced the dimensionality of the GSE149655 single-cell data. Then, we statistically analysed the subpopul...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Yong, Wang, Yao, Liang, Leilei, Song, Nan
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9486955/
https://www.ncbi.nlm.nih.gov/pubmed/36147484
http://dx.doi.org/10.3389/fgene.2022.975542
_version_ 1784792390149603328
author Xu, Yong
Wang, Yao
Liang, Leilei
Song, Nan
author_facet Xu, Yong
Wang, Yao
Liang, Leilei
Song, Nan
author_sort Xu, Yong
collection PubMed
description Background: Single-cell RNA sequencing is necessary to understand tumor heterogeneity, and the cell type heterogeneity of lung adenocarcinoma (LUAD) has not been fully studied. Method: We first reduced the dimensionality of the GSE149655 single-cell data. Then, we statistically analysed the subpopulations obtained by cell annotation to find the subpopulations highly enriched in tumor tissues. Monocle was used to predict the development trajectory of five subpopulations; beam was used to find the regulatory genes of five branches; qval was used to screen the key genes; and cellchart was used to analyse cell communication. Next, we used the differentially expressed genes of TCGA-LUAD to screen for overlapping genes and established a prognostic risk model through univariate and multivariate analyses. To identify the independence of the model in clinical application, univariate and multivariate Cox regression were used to analyse the relevant HR, 95% CI of HR and p value. Finally, the novel biomarker genes were verified by qPCR and immunohistochemistry. Results: The single-cell dataset GSE149655 was subjected to quality control, filtration and dimensionality reduction. Finally, 23 subpopulations were screened, and 11-cell subgroups were annotated in 23 subpopulations. Through the statistical analysis of 11 subgroups, five important subgroups were selected, including lung epithelial cells, macrophages, neuroendocrine cells, secret cells and T cells. From the analysis of cell trajectory and cell communication, it is found that the interaction of five subpopulations is very complex and that the communication between them is dense. We believe that these five subpopulations play a very important role in the occurrence and development of LUAD. Downloading the TCGA data, we screened the marker genes of these five subpopulations, which are also the differentially expressed genes in tumorigenesis, with a total of 462 genes, and constructed 10 gene prognostic risk models based on related genes. The 10-gene signature has strong robustness and can achieve stable prediction efficiency in datasets from different platforms. Two new molecular markers related to LUAD, HLA-DRB5 and CCDC50, were verified by qPCR and immunohistochemistry. The results showed that HLA-DRB5 expression was negatively correlated with the risk of LUAD, and CCDC50 expression was positively correlated with the risk of LUAD. Conclusion: Therefore, we identified a prognostic risk model including CCL20, CP, HLA-DRB5, RHOV, CYP4B1, BASP1, ACSL4, GNG7, CCDC50 and SPATS2 as risk biomarkers and verified their predictive value for the prognosis of LUAD, which could serve as a new therapeutic target.
format Online
Article
Text
id pubmed-9486955
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-94869552022-09-21 Single-cell RNA sequencing analysis to explore immune cell heterogeneity and novel biomarkers for the prognosis of lung adenocarcinoma Xu, Yong Wang, Yao Liang, Leilei Song, Nan Front Genet Genetics Background: Single-cell RNA sequencing is necessary to understand tumor heterogeneity, and the cell type heterogeneity of lung adenocarcinoma (LUAD) has not been fully studied. Method: We first reduced the dimensionality of the GSE149655 single-cell data. Then, we statistically analysed the subpopulations obtained by cell annotation to find the subpopulations highly enriched in tumor tissues. Monocle was used to predict the development trajectory of five subpopulations; beam was used to find the regulatory genes of five branches; qval was used to screen the key genes; and cellchart was used to analyse cell communication. Next, we used the differentially expressed genes of TCGA-LUAD to screen for overlapping genes and established a prognostic risk model through univariate and multivariate analyses. To identify the independence of the model in clinical application, univariate and multivariate Cox regression were used to analyse the relevant HR, 95% CI of HR and p value. Finally, the novel biomarker genes were verified by qPCR and immunohistochemistry. Results: The single-cell dataset GSE149655 was subjected to quality control, filtration and dimensionality reduction. Finally, 23 subpopulations were screened, and 11-cell subgroups were annotated in 23 subpopulations. Through the statistical analysis of 11 subgroups, five important subgroups were selected, including lung epithelial cells, macrophages, neuroendocrine cells, secret cells and T cells. From the analysis of cell trajectory and cell communication, it is found that the interaction of five subpopulations is very complex and that the communication between them is dense. We believe that these five subpopulations play a very important role in the occurrence and development of LUAD. Downloading the TCGA data, we screened the marker genes of these five subpopulations, which are also the differentially expressed genes in tumorigenesis, with a total of 462 genes, and constructed 10 gene prognostic risk models based on related genes. The 10-gene signature has strong robustness and can achieve stable prediction efficiency in datasets from different platforms. Two new molecular markers related to LUAD, HLA-DRB5 and CCDC50, were verified by qPCR and immunohistochemistry. The results showed that HLA-DRB5 expression was negatively correlated with the risk of LUAD, and CCDC50 expression was positively correlated with the risk of LUAD. Conclusion: Therefore, we identified a prognostic risk model including CCL20, CP, HLA-DRB5, RHOV, CYP4B1, BASP1, ACSL4, GNG7, CCDC50 and SPATS2 as risk biomarkers and verified their predictive value for the prognosis of LUAD, which could serve as a new therapeutic target. Frontiers Media S.A. 2022-08-15 /pmc/articles/PMC9486955/ /pubmed/36147484 http://dx.doi.org/10.3389/fgene.2022.975542 Text en Copyright © 2022 Xu, Wang, Liang and Song. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Genetics
Xu, Yong
Wang, Yao
Liang, Leilei
Song, Nan
Single-cell RNA sequencing analysis to explore immune cell heterogeneity and novel biomarkers for the prognosis of lung adenocarcinoma
title Single-cell RNA sequencing analysis to explore immune cell heterogeneity and novel biomarkers for the prognosis of lung adenocarcinoma
title_full Single-cell RNA sequencing analysis to explore immune cell heterogeneity and novel biomarkers for the prognosis of lung adenocarcinoma
title_fullStr Single-cell RNA sequencing analysis to explore immune cell heterogeneity and novel biomarkers for the prognosis of lung adenocarcinoma
title_full_unstemmed Single-cell RNA sequencing analysis to explore immune cell heterogeneity and novel biomarkers for the prognosis of lung adenocarcinoma
title_short Single-cell RNA sequencing analysis to explore immune cell heterogeneity and novel biomarkers for the prognosis of lung adenocarcinoma
title_sort single-cell rna sequencing analysis to explore immune cell heterogeneity and novel biomarkers for the prognosis of lung adenocarcinoma
topic Genetics
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9486955/
https://www.ncbi.nlm.nih.gov/pubmed/36147484
http://dx.doi.org/10.3389/fgene.2022.975542
work_keys_str_mv AT xuyong singlecellrnasequencinganalysistoexploreimmunecellheterogeneityandnovelbiomarkersfortheprognosisoflungadenocarcinoma
AT wangyao singlecellrnasequencinganalysistoexploreimmunecellheterogeneityandnovelbiomarkersfortheprognosisoflungadenocarcinoma
AT liangleilei singlecellrnasequencinganalysistoexploreimmunecellheterogeneityandnovelbiomarkersfortheprognosisoflungadenocarcinoma
AT songnan singlecellrnasequencinganalysistoexploreimmunecellheterogeneityandnovelbiomarkersfortheprognosisoflungadenocarcinoma