Cargando…

Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer

PURPOSE: The purpose of this study was to investigate the clinical and non-clinical characteristics that may affect the early death rate of patients with metastatic colorectal carcinoma (mCRC) and develop accurate prognostic predictive models for mCRC. METHOD: Medical records of 35,639 patients with...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhang, Yalong, Zhang, Zunni, Wei, Liuxiang, Wei, Shujing
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Frontiers Media S.A. 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9810140/
https://www.ncbi.nlm.nih.gov/pubmed/36605237
http://dx.doi.org/10.3389/fpubh.2022.1008137
_version_ 1784863248207577088
author Zhang, Yalong
Zhang, Zunni
Wei, Liuxiang
Wei, Shujing
author_facet Zhang, Yalong
Zhang, Zunni
Wei, Liuxiang
Wei, Shujing
author_sort Zhang, Yalong
collection PubMed
description PURPOSE: The purpose of this study was to investigate the clinical and non-clinical characteristics that may affect the early death rate of patients with metastatic colorectal carcinoma (mCRC) and develop accurate prognostic predictive models for mCRC. METHOD: Medical records of 35,639 patients with mCRC diagnosed from 2010 to 2019 were obtained from the SEER database. All the patients were randomly divided into a training cohort and a validation cohort in a ratio of 7:3. X-tile software was utilized to identify the optimal cutoff point for age and tumor size. Univariate and multivariate logistic regression models were used to determine the independent predictors associated with overall early death and cancer-specific early death caused by mCRC. Simultaneously, predictive and dynamic nomograms were constructed. Moreover, logistic regression, random forest, CatBoost, LightGBM, and XGBoost were used to establish machine learning (ML) models. In addition, receiver operating characteristic curves (ROCs) and calibration plots were obtained to estimate the accuracy of the models. Decision curve analysis (DCA) was employed to determine the clinical benefits of ML models. RESULTS: The optimal cutoff points for age were 58 and 77 years and those for tumor size of 45 and 76. A total of 15 independent risk factors, namely, age, marital status, race, tumor localization, histologic type, grade, N-stage, tumor size, surgery, radiation, chemotherapy, bone metastasis, brain metastasis, liver metastasis, and lung metastasis, were significantly associated with the overall early death rate of patients with mCRC and the cancer-specific early death rate of patients with mCRC, following which nomograms were constructed. The ML models revealed that the random forest model accurately predicted outcomes, followed by logistic regression, CatBoost, XGBoost, and LightGBM models. Compared with other algorithms, the random forest model provided more clinical benefits than other models and can be used to make clinical decisions in overall early death and specific early death caused by mCRC. CONCLUSION: ML algorithms combined with nomograms may play an important role in distinguishing early deaths owing to mCRC and potentially help clinicians make clinical decisions and follow-up strategies.
format Online
Article
Text
id pubmed-9810140
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Frontiers Media S.A.
record_format MEDLINE/PubMed
spelling pubmed-98101402023-01-04 Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer Zhang, Yalong Zhang, Zunni Wei, Liuxiang Wei, Shujing Front Public Health Public Health PURPOSE: The purpose of this study was to investigate the clinical and non-clinical characteristics that may affect the early death rate of patients with metastatic colorectal carcinoma (mCRC) and develop accurate prognostic predictive models for mCRC. METHOD: Medical records of 35,639 patients with mCRC diagnosed from 2010 to 2019 were obtained from the SEER database. All the patients were randomly divided into a training cohort and a validation cohort in a ratio of 7:3. X-tile software was utilized to identify the optimal cutoff point for age and tumor size. Univariate and multivariate logistic regression models were used to determine the independent predictors associated with overall early death and cancer-specific early death caused by mCRC. Simultaneously, predictive and dynamic nomograms were constructed. Moreover, logistic regression, random forest, CatBoost, LightGBM, and XGBoost were used to establish machine learning (ML) models. In addition, receiver operating characteristic curves (ROCs) and calibration plots were obtained to estimate the accuracy of the models. Decision curve analysis (DCA) was employed to determine the clinical benefits of ML models. RESULTS: The optimal cutoff points for age were 58 and 77 years and those for tumor size of 45 and 76. A total of 15 independent risk factors, namely, age, marital status, race, tumor localization, histologic type, grade, N-stage, tumor size, surgery, radiation, chemotherapy, bone metastasis, brain metastasis, liver metastasis, and lung metastasis, were significantly associated with the overall early death rate of patients with mCRC and the cancer-specific early death rate of patients with mCRC, following which nomograms were constructed. The ML models revealed that the random forest model accurately predicted outcomes, followed by logistic regression, CatBoost, XGBoost, and LightGBM models. Compared with other algorithms, the random forest model provided more clinical benefits than other models and can be used to make clinical decisions in overall early death and specific early death caused by mCRC. CONCLUSION: ML algorithms combined with nomograms may play an important role in distinguishing early deaths owing to mCRC and potentially help clinicians make clinical decisions and follow-up strategies. Frontiers Media S.A. 2022-12-20 /pmc/articles/PMC9810140/ /pubmed/36605237 http://dx.doi.org/10.3389/fpubh.2022.1008137 Text en Copyright © 2022 Zhang, Zhang, Wei and Wei. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.
spellingShingle Public Health
Zhang, Yalong
Zhang, Zunni
Wei, Liuxiang
Wei, Shujing
Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer
title Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer
title_full Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer
title_fullStr Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer
title_full_unstemmed Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer
title_short Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer
title_sort construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer
topic Public Health
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9810140/
https://www.ncbi.nlm.nih.gov/pubmed/36605237
http://dx.doi.org/10.3389/fpubh.2022.1008137
work_keys_str_mv AT zhangyalong constructionandvalidationofnomogramscombinedwithnovelmachinelearningalgorithmstopredictearlydeathofpatientswithmetastaticcolorectalcancer
AT zhangzunni constructionandvalidationofnomogramscombinedwithnovelmachinelearningalgorithmstopredictearlydeathofpatientswithmetastaticcolorectalcancer
AT weiliuxiang constructionandvalidationofnomogramscombinedwithnovelmachinelearningalgorithmstopredictearlydeathofpatientswithmetastaticcolorectalcancer
AT weishujing constructionandvalidationofnomogramscombinedwithnovelmachinelearningalgorithmstopredictearlydeathofpatientswithmetastaticcolorectalcancer