Cargando…
Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model
Colorectal cancer represents a significant health threat, yet a standardized method for early clinical assessment and prognosis remains elusive. This study sought to address this gap by using the Seurat package to analyze a single-cell sequencing dataset (GSE178318) of colorectal cancer, thereby ide...
Autores principales: | , , , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Springer US
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10499771/ https://www.ncbi.nlm.nih.gov/pubmed/37702857 http://dx.doi.org/10.1007/s12672-023-00789-x |
_version_ | 1785105779423969280 |
---|---|
author | Lu, Di Li, Xiaofang Yuan, Yuan Li, Yaqi Wang, Jiannan Zhang, Qian Yang, Zhiyu Gao, Shanjun Zhang, Xiulei Zhou, Bingxi |
author_facet | Lu, Di Li, Xiaofang Yuan, Yuan Li, Yaqi Wang, Jiannan Zhang, Qian Yang, Zhiyu Gao, Shanjun Zhang, Xiulei Zhou, Bingxi |
author_sort | Lu, Di |
collection | PubMed |
description | Colorectal cancer represents a significant health threat, yet a standardized method for early clinical assessment and prognosis remains elusive. This study sought to address this gap by using the Seurat package to analyze a single-cell sequencing dataset (GSE178318) of colorectal cancer, thereby identifying distinctive marker genes characterizing various cell subpopulations. Through CIBERSORT analysis of colorectal cancer data within The Cancer Genome Atlas (TCGA) database, significant differences existed in both cell subpopulations and prognostic values. Employing WGCNA, we pinpointed modules exhibiting strong correlations with these subpopulations, subsequently utilizing the survival package coxph to isolate genes within these modules. Further stratification of TCGA dataset based on these selected genes brought to light notable variations between subtypes. The prognostic relevance of these differentially expressed genes was rigorously assessed through survival analysis, with LASSO regression employed for modeling prognostic factors. Our resulting model, anchored by a 10-gene signature originating from these differentially expressed genes and LASSO regression, proved adept at accurately predicting clinical prognoses, even when tested against external datasets. Specifically, natural killer cells from the C7 subpopulation were found to bear significant associations with colorectal cancer survival and prognosis, as observed within the TCGA database. These findings underscore the promise of an integrated 10-gene signature prognostic risk assessment model, harmonizing single-cell sequencing insights with TCGA data, for effectively estimating the risk associated with colorectal cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12672-023-00789-x. |
format | Online Article Text |
id | pubmed-10499771 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Springer US |
record_format | MEDLINE/PubMed |
spelling | pubmed-104997712023-09-15 Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model Lu, Di Li, Xiaofang Yuan, Yuan Li, Yaqi Wang, Jiannan Zhang, Qian Yang, Zhiyu Gao, Shanjun Zhang, Xiulei Zhou, Bingxi Discov Oncol Research Colorectal cancer represents a significant health threat, yet a standardized method for early clinical assessment and prognosis remains elusive. This study sought to address this gap by using the Seurat package to analyze a single-cell sequencing dataset (GSE178318) of colorectal cancer, thereby identifying distinctive marker genes characterizing various cell subpopulations. Through CIBERSORT analysis of colorectal cancer data within The Cancer Genome Atlas (TCGA) database, significant differences existed in both cell subpopulations and prognostic values. Employing WGCNA, we pinpointed modules exhibiting strong correlations with these subpopulations, subsequently utilizing the survival package coxph to isolate genes within these modules. Further stratification of TCGA dataset based on these selected genes brought to light notable variations between subtypes. The prognostic relevance of these differentially expressed genes was rigorously assessed through survival analysis, with LASSO regression employed for modeling prognostic factors. Our resulting model, anchored by a 10-gene signature originating from these differentially expressed genes and LASSO regression, proved adept at accurately predicting clinical prognoses, even when tested against external datasets. Specifically, natural killer cells from the C7 subpopulation were found to bear significant associations with colorectal cancer survival and prognosis, as observed within the TCGA database. These findings underscore the promise of an integrated 10-gene signature prognostic risk assessment model, harmonizing single-cell sequencing insights with TCGA data, for effectively estimating the risk associated with colorectal cancer. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1007/s12672-023-00789-x. Springer US 2023-09-13 /pmc/articles/PMC10499771/ /pubmed/37702857 http://dx.doi.org/10.1007/s12672-023-00789-x Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Research Lu, Di Li, Xiaofang Yuan, Yuan Li, Yaqi Wang, Jiannan Zhang, Qian Yang, Zhiyu Gao, Shanjun Zhang, Xiulei Zhou, Bingxi Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model |
title | Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model |
title_full | Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model |
title_fullStr | Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model |
title_full_unstemmed | Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model |
title_short | Integrating TCGA and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model |
title_sort | integrating tcga and single-cell sequencing data for colorectal cancer: a 10-gene prognostic risk assessment model |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10499771/ https://www.ncbi.nlm.nih.gov/pubmed/37702857 http://dx.doi.org/10.1007/s12672-023-00789-x |
work_keys_str_mv | AT ludi integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel AT lixiaofang integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel AT yuanyuan integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel AT liyaqi integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel AT wangjiannan integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel AT zhangqian integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel AT yangzhiyu integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel AT gaoshanjun integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel AT zhangxiulei integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel AT zhoubingxi integratingtcgaandsinglecellsequencingdataforcolorectalcancera10geneprognosticriskassessmentmodel |