Cargando…

Integrated analysis of single-cell and bulk RNA-seq establishes a novel signature for prediction in gastric cancer

BACKGROUND: Single-cell sequencing technology provides the capability to analyze changes in specific cell types during the progression of disease. However, previous single-cell sequencing studies on gastric cancer (GC) have largely focused on immune cells and stromal cells, and further elucidation i...

Descripción completa

Detalles Bibliográficos
Autores principales: Wen, Fei, Guan, Xin, Qu, Hai-Xia, Jiang, Xiang-Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Baishideng Publishing Group Inc 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10401466/
https://www.ncbi.nlm.nih.gov/pubmed/37546563
http://dx.doi.org/10.4251/wjgo.v15.i7.1215
_version_ 1785084670966235136
author Wen, Fei
Guan, Xin
Qu, Hai-Xia
Jiang, Xiang-Jun
author_facet Wen, Fei
Guan, Xin
Qu, Hai-Xia
Jiang, Xiang-Jun
author_sort Wen, Fei
collection PubMed
description BACKGROUND: Single-cell sequencing technology provides the capability to analyze changes in specific cell types during the progression of disease. However, previous single-cell sequencing studies on gastric cancer (GC) have largely focused on immune cells and stromal cells, and further elucidation is required regarding the alterations that occur in gastric epithelial cells during the development of GC. AIM: To create a GC prediction model based on single-cell and bulk RNA sequencing (bulk RNA-seq) data. METHODS: In this study, we conducted a comprehensive analysis by integrating three single-cell RNA sequencing (scRNA-seq) datasets and ten bulk RNA-seq datasets. Our analysis mainly focused on determining cell proportions and identifying differentially expressed genes (DEGs). Specifically, we performed differential expression analysis among epithelial cells in GC tissues and normal gastric tissues (NAGs) and utilized both single-cell and bulk RNA-seq data to establish a prediction model for GC. We further validated the accuracy of the GC prediction model in bulk RNA-seq data. We also used Kaplan–Meier plots to verify the correlation between genes in the prediction model and the prognosis of GC. RESULTS: By analyzing scRNA-seq data from a total of 70707 cells from GC tissue, NAG, and chronic gastric tissue, 10 cell types were identified, and DEGs in GC and normal epithelial cells were screened. After determining the DEGs in GC and normal gastric samples identified by bulk RNA-seq data, a GC predictive classifier was constructed using the Least absolute shrinkage and selection operator (LASSO) and random forest methods. The LASSO classifier showed good performance in both validation and model verification using The Cancer Genome Atlas and Genotype-Tissue Expression (GTEx) datasets [area under the curve (AUC)_min = 0.988, AUC_1se = 0.994], and the random forest model also achieved good results with the validation set (AUC = 0.92). Genes TIMP1, PLOD3, CKS2, TYMP, TNFRSF10B, CPNE1, GDF15, BCAP31, and CLDN7 were identified to have high importance values in multiple GC predictive models, and KM-PLOTTER analysis showed their relevance to GC prognosis, suggesting their potential for use in GC diagnosis and treatment. CONCLUSION: A predictive classifier was established based on the analysis of RNA-seq data, and the genes in it are expected to serve as auxiliary markers in the clinical diagnosis of GC.
format Online
Article
Text
id pubmed-10401466
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Baishideng Publishing Group Inc
record_format MEDLINE/PubMed
spelling pubmed-104014662023-08-05 Integrated analysis of single-cell and bulk RNA-seq establishes a novel signature for prediction in gastric cancer Wen, Fei Guan, Xin Qu, Hai-Xia Jiang, Xiang-Jun World J Gastrointest Oncol Clinical and Translational Research BACKGROUND: Single-cell sequencing technology provides the capability to analyze changes in specific cell types during the progression of disease. However, previous single-cell sequencing studies on gastric cancer (GC) have largely focused on immune cells and stromal cells, and further elucidation is required regarding the alterations that occur in gastric epithelial cells during the development of GC. AIM: To create a GC prediction model based on single-cell and bulk RNA sequencing (bulk RNA-seq) data. METHODS: In this study, we conducted a comprehensive analysis by integrating three single-cell RNA sequencing (scRNA-seq) datasets and ten bulk RNA-seq datasets. Our analysis mainly focused on determining cell proportions and identifying differentially expressed genes (DEGs). Specifically, we performed differential expression analysis among epithelial cells in GC tissues and normal gastric tissues (NAGs) and utilized both single-cell and bulk RNA-seq data to establish a prediction model for GC. We further validated the accuracy of the GC prediction model in bulk RNA-seq data. We also used Kaplan–Meier plots to verify the correlation between genes in the prediction model and the prognosis of GC. RESULTS: By analyzing scRNA-seq data from a total of 70707 cells from GC tissue, NAG, and chronic gastric tissue, 10 cell types were identified, and DEGs in GC and normal epithelial cells were screened. After determining the DEGs in GC and normal gastric samples identified by bulk RNA-seq data, a GC predictive classifier was constructed using the Least absolute shrinkage and selection operator (LASSO) and random forest methods. The LASSO classifier showed good performance in both validation and model verification using The Cancer Genome Atlas and Genotype-Tissue Expression (GTEx) datasets [area under the curve (AUC)_min = 0.988, AUC_1se = 0.994], and the random forest model also achieved good results with the validation set (AUC = 0.92). Genes TIMP1, PLOD3, CKS2, TYMP, TNFRSF10B, CPNE1, GDF15, BCAP31, and CLDN7 were identified to have high importance values in multiple GC predictive models, and KM-PLOTTER analysis showed their relevance to GC prognosis, suggesting their potential for use in GC diagnosis and treatment. CONCLUSION: A predictive classifier was established based on the analysis of RNA-seq data, and the genes in it are expected to serve as auxiliary markers in the clinical diagnosis of GC. Baishideng Publishing Group Inc 2023-07-15 2023-07-15 /pmc/articles/PMC10401466/ /pubmed/37546563 http://dx.doi.org/10.4251/wjgo.v15.i7.1215 Text en ©The Author(s) 2023. Published by Baishideng Publishing Group Inc. All rights reserved. https://creativecommons.org/licenses/by-nc/4.0/This article is an open-access article that was selected by an in-house editor and fully peer-reviewed by external reviewers. It is distributed in accordance with the Creative Commons Attribution NonCommercial (CC BY-NC 4.0) license, which permits others to distribute, remix, adapt, build upon this work non-commercially, and license their derivative works on different terms, provided the original work is properly cited and the use is non-commercial.
spellingShingle Clinical and Translational Research
Wen, Fei
Guan, Xin
Qu, Hai-Xia
Jiang, Xiang-Jun
Integrated analysis of single-cell and bulk RNA-seq establishes a novel signature for prediction in gastric cancer
title Integrated analysis of single-cell and bulk RNA-seq establishes a novel signature for prediction in gastric cancer
title_full Integrated analysis of single-cell and bulk RNA-seq establishes a novel signature for prediction in gastric cancer
title_fullStr Integrated analysis of single-cell and bulk RNA-seq establishes a novel signature for prediction in gastric cancer
title_full_unstemmed Integrated analysis of single-cell and bulk RNA-seq establishes a novel signature for prediction in gastric cancer
title_short Integrated analysis of single-cell and bulk RNA-seq establishes a novel signature for prediction in gastric cancer
title_sort integrated analysis of single-cell and bulk rna-seq establishes a novel signature for prediction in gastric cancer
topic Clinical and Translational Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10401466/
https://www.ncbi.nlm.nih.gov/pubmed/37546563
http://dx.doi.org/10.4251/wjgo.v15.i7.1215
work_keys_str_mv AT wenfei integratedanalysisofsinglecellandbulkrnaseqestablishesanovelsignatureforpredictioningastriccancer
AT guanxin integratedanalysisofsinglecellandbulkrnaseqestablishesanovelsignatureforpredictioningastriccancer
AT quhaixia integratedanalysisofsinglecellandbulkrnaseqestablishesanovelsignatureforpredictioningastriccancer
AT jiangxiangjun integratedanalysisofsinglecellandbulkrnaseqestablishesanovelsignatureforpredictioningastriccancer