Cargando…

Distant Metastasis Pattern and Prognostic Prediction Model of Colorectal Cancer Patients Based on Big Data Mining

AIMS: This study aimed to investigate the distant metastasis pattern from newly diagnosed colorectal cancer (CRC) and also construct and validate a prognostic nomogram to predict both overall survival (OS) and cancer-specific survival (CSS) of CRC patients with distant metastases. METHODS: Primary C...

Descripción completa

Detalles Bibliográficos
Autores principales: Liu, Chuan, Wang, Ting, Yang, Jiahui, Zhang, Jixiang, Wei, Shuchun, Guo, Yingyun, Yu, Rong, Tan, Zongbiao, Wang, Shuo, Dong, Weiguo
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9074728/
https://www.ncbi.nlm.nih.gov/pubmed/35530362
http://dx.doi.org/10.3389/fonc.2022.878805
_version_ 1784701533429956608
author Liu, Chuan
Wang, Ting
Yang, Jiahui
Zhang, Jixiang
Wei, Shuchun
Guo, Yingyun
Yu, Rong
Tan, Zongbiao
Wang, Shuo
Dong, Weiguo
author_facet Liu, Chuan
Wang, Ting
Yang, Jiahui
Zhang, Jixiang
Wei, Shuchun
Guo, Yingyun
Yu, Rong
Tan, Zongbiao
Wang, Shuo
Dong, Weiguo
author_sort Liu, Chuan
collection PubMed
description AIMS: This study aimed to investigate the distant metastasis pattern from newly diagnosed colorectal cancer (CRC) and also construct and validate a prognostic nomogram to predict both overall survival (OS) and cancer-specific survival (CSS) of CRC patients with distant metastases. METHODS: Primary CRC patients who were initially diagnosed from 2010 to 2016 in the SEER database were included in the analysis. The independent risk factors affecting the OS, CSS, all-cause mortality, and CRC-specific mortality of the patients were screened by the Cox regression and Fine–Gray competitive risk model. The nomogram models were constructed to predict the OS and CSS of the patients. The reliability and accuracy of the prediction model were evaluated by consistency index (C-index) and calibration curve. The gene chip GSE41258 was downloaded from the GEO database, and differentially expressed genes (DEGs) were screened by the GEO2R online tool (p < 0.05, |logFC|>1.5). The Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathway and Gene Ontology (GO) annotation and String website were used for enrichment analysis and protein–protein interaction (PPI) analysis of DEGs, respectively, and Cytoscape software was used to construct PPI network and screen function modules and hub genes. RESULTS: A total of 57,835 CRC patients, including 47,823 without distant metastases and 10,012 (17.31%) with metastases, were identified. Older age, unmarried status, poorly differentiated or undifferentiated grade, right colon site, larger tumor size, N2 stage, more metastatic sites, and elevated carcinoembryonic antigen (CEA) might lead to poorer prognosis (all p < 0.01). The independent risk factors of OS and CSS were included to construct a prognosis prediction model for predicting OS and CSS in CRC patients with distant metastasis. C-index and calibration curve of the training group and validation group showed that the models had acceptable predictive performance and high calibration degree. Furthermore, by comparing CRC tissues with and without liver metastasis, 158 DEGs and top 10 hub genes were screened. Hub genes were mainly concentrated in liver function and coagulation function. CONCLUSION: The big data in the public database were counted and transformed into a prognostic evaluation tool that could be applied to the clinic, which has certain clinical significance for the formulation of the treatment plan and prognostic evaluation of CRC patients with distant metastasis.
format Online
Article
Text
id pubmed-9074728
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-90747282022-05-07 Distant Metastasis Pattern and Prognostic Prediction Model of Colorectal Cancer Patients Based on Big Data Mining Liu, Chuan Wang, Ting Yang, Jiahui Zhang, Jixiang Wei, Shuchun Guo, Yingyun Yu, Rong Tan, Zongbiao Wang, Shuo Dong, Weiguo Front Oncol Oncology AIMS: This study aimed to investigate the distant metastasis pattern from newly diagnosed colorectal cancer (CRC) and also construct and validate a prognostic nomogram to predict both overall survival (OS) and cancer-specific survival (CSS) of CRC patients with distant metastases. METHODS: Primary CRC patients who were initially diagnosed from 2010 to 2016 in the SEER database were included in the analysis. The independent risk factors affecting the OS, CSS, all-cause mortality, and CRC-specific mortality of the patients were screened by the Cox regression and Fine–Gray competitive risk model. The nomogram models were constructed to predict the OS and CSS of the patients. The reliability and accuracy of the prediction model were evaluated by consistency index (C-index) and calibration curve. The gene chip GSE41258 was downloaded from the GEO database, and differentially expressed genes (DEGs) were screened by the GEO2R online tool (p < 0.05, |logFC|>1.5). The Kyoto Encyclopedia of Genes and Genomes (KEGG) Pathway and Gene Ontology (GO) annotation and String website were used for enrichment analysis and protein–protein interaction (PPI) analysis of DEGs, respectively, and Cytoscape software was used to construct PPI network and screen function modules and hub genes. RESULTS: A total of 57,835 CRC patients, including 47,823 without distant metastases and 10,012 (17.31%) with metastases, were identified. Older age, unmarried status, poorly differentiated or undifferentiated grade, right colon site, larger tumor size, N2 stage, more metastatic sites, and elevated carcinoembryonic antigen (CEA) might lead to poorer prognosis (all p < 0.01). The independent risk factors of OS and CSS were included to construct a prognosis prediction model for predicting OS and CSS in CRC patients with distant metastasis. C-index and calibration curve of the training group and validation group showed that the models had acceptable predictive performance and high calibration degree. Furthermore, by comparing CRC tissues with and without liver metastasis, 158 DEGs and top 10 hub genes were screened. Hub genes were mainly concentrated in liver function and coagulation function. CONCLUSION: The big data in the public database were counted and transformed into a prognostic evaluation tool that could be applied to the clinic, which has certain clinical significance for the formulation of the treatment plan and prognostic evaluation of CRC patients with distant metastasis. Frontiers Media S.A. 2022-04-22 /pmc/articles/PMC9074728/ /pubmed/35530362 http://dx.doi.org/10.3389/fonc.2022.878805 Text en Copyright © 2022 Liu, Wang, Yang, Zhang, Wei, Guo, Yu, Tan, Wang and Dong https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Oncology
Liu, Chuan
Wang, Ting
Yang, Jiahui
Zhang, Jixiang
Wei, Shuchun
Guo, Yingyun
Yu, Rong
Tan, Zongbiao
Wang, Shuo
Dong, Weiguo
Distant Metastasis Pattern and Prognostic Prediction Model of Colorectal Cancer Patients Based on Big Data Mining
title Distant Metastasis Pattern and Prognostic Prediction Model of Colorectal Cancer Patients Based on Big Data Mining
title_full Distant Metastasis Pattern and Prognostic Prediction Model of Colorectal Cancer Patients Based on Big Data Mining
title_fullStr Distant Metastasis Pattern and Prognostic Prediction Model of Colorectal Cancer Patients Based on Big Data Mining
title_full_unstemmed Distant Metastasis Pattern and Prognostic Prediction Model of Colorectal Cancer Patients Based on Big Data Mining
title_short Distant Metastasis Pattern and Prognostic Prediction Model of Colorectal Cancer Patients Based on Big Data Mining
title_sort distant metastasis pattern and prognostic prediction model of colorectal cancer patients based on big data mining
topic Oncology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9074728/
https://www.ncbi.nlm.nih.gov/pubmed/35530362
http://dx.doi.org/10.3389/fonc.2022.878805
work_keys_str_mv AT liuchuan distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining
AT wangting distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining
AT yangjiahui distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining
AT zhangjixiang distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining
AT weishuchun distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining
AT guoyingyun distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining
AT yurong distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining
AT tanzongbiao distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining
AT wangshuo distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining
AT dongweiguo distantmetastasispatternandprognosticpredictionmodelofcolorectalcancerpatientsbasedonbigdatamining