Cargando…

Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model

Colorectal cancer represents a significant health threat, yet a standardized method for early clinical assessment and prognosis remains elusive. This study sought to address this gap by using the Seurat package to analyze a single-cell sequencing dataset (GSE178318) of colorectal cancer, thereby ide...

Descripción completa

Detalles Bibliográficos
Autores principales: Lu, Di, Li, Xiaofang, Yuan, Yuan, Li, Yaqi, Wang, Jiannan, Zhang, Qian, Yang, Zhiyu, Gao, Shanjun, Zhang, Xiulei, Zhou, Bingxi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer US 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10499771/
https://www.ncbi.nlm.nih.gov/pubmed/37702857
http://dx.doi.org/10.1007/s12672-023-00789-x
_version_ 1785105779423969280
author Lu, Di
Li, Xiaofang
Yuan, Yuan
Li, Yaqi
Wang, Jiannan
Zhang, Qian
Yang, Zhiyu
Gao, Shanjun
Zhang, Xiulei
Zhou, Bingxi
author_facet Lu, Di
Li, Xiaofang
Yuan, Yuan
Li, Yaqi
Wang, Jiannan
Zhang, Qian
Yang, Zhiyu
Gao, Shanjun
Zhang, Xiulei
Zhou, Bingxi
author_sort Lu, Di
collection PubMed
description Colorectal cancer represents a significant health threat, yet a standardized method for early clinical assessment and prognosis remains elusive. This study sought to address this gap by using the Seurat package to analyze a single-cell sequencing dataset (GSE178318) of colorectal cancer, thereby identifying distinctive marker genes characterizing various cell subpopulations. Through CIBERSORT analysis of colorectal cancer data within The Cancer Genome Atlas (TCGA) database, significant differences existed in both cell subpopulations and prognostic values. Employing WGCNA, we pinpointed modules exhibiting strong correlations with these subpopulations, subsequently utilizing the survival package coxph to isolate genes within these modules. Further stratification of TCGA dataset based on these selected genes brought to light notable variations between subtypes. The prognostic relevance of these differentially expressed genes was rigorously assessed through survival analysis, with LASSO regression employed for modeling prognostic factors. Our resulting model, anchored by a 10-gene signature originating from these differentially expressed genes and LASSO regression, proved adept at accurately predicting clinical prognoses, even when tested against external datasets. Specifically, natural killer cells from the C7 subpopulation were found to bear significant associations with colorectal cancer survival and prognosis, as observed within the TCGA database. These findings underscore the promise of an integrated 10-gene signature prognostic risk assessment model, harmonizing single-cell sequencing insights with TCGA data, for effectively estimating the risk associated with colorectal cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12672-023-00789-x.
format Online
Article
Text
id pubmed-10499771
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Springer US
record_format MEDLINE/PubMed
spelling pubmed-104997712023-09-15 Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model Lu, Di Li, Xiaofang Yuan, Yuan Li, Yaqi Wang, Jiannan Zhang, Qian Yang, Zhiyu Gao, Shanjun Zhang, Xiulei Zhou, Bingxi Discov Oncol Research Colorectal cancer represents a significant health threat, yet a standardized method for early clinical assessment and prognosis remains elusive. This study sought to address this gap by using the Seurat package to analyze a single-cell sequencing dataset (GSE178318) of colorectal cancer, thereby identifying distinctive marker genes characterizing various cell subpopulations. Through CIBERSORT analysis of colorectal cancer data within The Cancer Genome Atlas (TCGA) database, significant differences existed in both cell subpopulations and prognostic values. Employing WGCNA, we pinpointed modules exhibiting strong correlations with these subpopulations, subsequently utilizing the survival package coxph to isolate genes within these modules. Further stratification of TCGA dataset based on these selected genes brought to light notable variations between subtypes. The prognostic relevance of these differentially expressed genes was rigorously assessed through survival analysis, with LASSO regression employed for modeling prognostic factors. Our resulting model, anchored by a 10-gene signature originating from these differentially expressed genes and LASSO regression, proved adept at accurately predicting clinical prognoses, even when tested against external datasets. Specifically, natural killer cells from the C7 subpopulation were found to bear significant associations with colorectal cancer survival and prognosis, as observed within the TCGA database. These findings underscore the promise of an integrated 10-gene signature prognostic risk assessment model, harmonizing single-cell sequencing insights with TCGA data, for effectively estimating the risk associated with colorectal cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12672-023-00789-x. Springer US 2023-09-13 /pmc/articles/PMC10499771/ /pubmed/37702857 http://dx.doi.org/10.1007/s12672-023-00789-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research
Lu, Di
Li, Xiaofang
Yuan, Yuan
Li, Yaqi
Wang, Jiannan
Zhang, Qian
Yang, Zhiyu
Gao, Shanjun
Zhang, Xiulei
Zhou, Bingxi
Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model
title Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model
title_full Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model
title_fullStr Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model
title_full_unstemmed Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model
title_short Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model
title_sort integrating tcga and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10499771/
https://www.ncbi.nlm.nih.gov/pubmed/37702857
http://dx.doi.org/10.1007/s12672-023-00789-x
work_keys_str_mv AT ludi integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel
AT lixiaofang integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel
AT yuanyuan integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel
AT liyaqi integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel
AT wangjiannan integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel
AT zhangqian integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel
AT yangzhiyu integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel
AT gaoshanjun integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel
AT zhangxiulei integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel
AT zhoubingxi integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel