Cargando…
Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer
PURPOSE: The purpose of this study was to investigate the clinical and non-clinical characteristics that may affect the early death rate of patients with metastatic colorectal carcinoma (mCRC) and develop accurate prognostic predictive models for mCRC. METHOD: Medical records of 35,639 patients with...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Frontiers Media S.A.
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9810140/ https://www.ncbi.nlm.nih.gov/pubmed/36605237 http://dx.doi.org/10.3389/fpubh.2022.1008137 |
_version_ | 1784863248207577088 |
---|---|
author | Zhang, Yalong Zhang, Zunni Wei, Liuxiang Wei, Shujing |
author_facet | Zhang, Yalong Zhang, Zunni Wei, Liuxiang Wei, Shujing |
author_sort | Zhang, Yalong |
collection | PubMed |
description | PURPOSE: The purpose of this study was to investigate the clinical and non-clinical characteristics that may affect the early death rate of patients with metastatic colorectal carcinoma (mCRC) and develop accurate prognostic predictive models for mCRC. METHOD: Medical records of 35,639 patients with mCRC diagnosed from 2010 to 2019 were obtained from the SEER database. All the patients were randomly divided into a training cohort and a validation cohort in a ratio of 7:3. X-tile software was utilized to identify the optimal cutoff point for age and tumor size. Univariate and multivariate logistic regression models were used to determine the independent predictors associated with overall early death and cancer-specific early death caused by mCRC. Simultaneously, predictive and dynamic nomograms were constructed. Moreover, logistic regression, random forest, CatBoost, LightGBM, and XGBoost were used to establish machine learning (ML) models. In addition, receiver operating characteristic curves (ROCs) and calibration plots were obtained to estimate the accuracy of the models. Decision curve analysis (DCA) was employed to determine the clinical benefits of ML models. RESULTS: The optimal cutoff points for age were 58 and 77 years and those for tumor size of 45 and 76. A total of 15 independent risk factors, namely, age, marital status, race, tumor localization, histologic type, grade, N-stage, tumor size, surgery, radiation, chemotherapy, bone metastasis, brain metastasis, liver metastasis, and lung metastasis, were significantly associated with the overall early death rate of patients with mCRC and the cancer-specific early death rate of patients with mCRC, following which nomograms were constructed. The ML models revealed that the random forest model accurately predicted outcomes, followed by logistic regression, CatBoost, XGBoost, and LightGBM models. Compared with other algorithms, the random forest model provided more clinical benefits than other models and can be used to make clinical decisions in overall early death and specific early death caused by mCRC. CONCLUSION: ML algorithms combined with nomograms may play an important role in distinguishing early deaths owing to mCRC and potentially help clinicians make clinical decisions and follow-up strategies. |
format | Online Article Text |
id | pubmed-9810140 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Frontiers Media S.A. |
record_format | MEDLINE/PubMed |
spelling | pubmed-98101402023-01-04 Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer Zhang, Yalong Zhang, Zunni Wei, Liuxiang Wei, Shujing Front Public Health Public Health PURPOSE: The purpose of this study was to investigate the clinical and non-clinical characteristics that may affect the early death rate of patients with metastatic colorectal carcinoma (mCRC) and develop accurate prognostic predictive models for mCRC. METHOD: Medical records of 35,639 patients with mCRC diagnosed from 2010 to 2019 were obtained from the SEER database. All the patients were randomly divided into a training cohort and a validation cohort in a ratio of 7:3. X-tile software was utilized to identify the optimal cutoff point for age and tumor size. Univariate and multivariate logistic regression models were used to determine the independent predictors associated with overall early death and cancer-specific early death caused by mCRC. Simultaneously, predictive and dynamic nomograms were constructed. Moreover, logistic regression, random forest, CatBoost, LightGBM, and XGBoost were used to establish machine learning (ML) models. In addition, receiver operating characteristic curves (ROCs) and calibration plots were obtained to estimate the accuracy of the models. Decision curve analysis (DCA) was employed to determine the clinical benefits of ML models. RESULTS: The optimal cutoff points for age were 58 and 77 years and those for tumor size of 45 and 76. A total of 15 independent risk factors, namely, age, marital status, race, tumor localization, histologic type, grade, N-stage, tumor size, surgery, radiation, chemotherapy, bone metastasis, brain metastasis, liver metastasis, and lung metastasis, were significantly associated with the overall early death rate of patients with mCRC and the cancer-specific early death rate of patients with mCRC, following which nomograms were constructed. The ML models revealed that the random forest model accurately predicted outcomes, followed by logistic regression, CatBoost, XGBoost, and LightGBM models. Compared with other algorithms, the random forest model provided more clinical benefits than other models and can be used to make clinical decisions in overall early death and specific early death caused by mCRC. CONCLUSION: ML algorithms combined with nomograms may play an important role in distinguishing early deaths owing to mCRC and potentially help clinicians make clinical decisions and follow-up strategies. Frontiers Media S.A. 2022-12-20 /pmc/articles/PMC9810140/ /pubmed/36605237 http://dx.doi.org/10.3389/fpubh.2022.1008137 Text en Copyright © 2022 Zhang, Zhang, Wei and Wei. https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms. |
spellingShingle | Public Health Zhang, Yalong Zhang, Zunni Wei, Liuxiang Wei, Shujing Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer |
title | Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer |
title_full | Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer |
title_fullStr | Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer |
title_full_unstemmed | Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer |
title_short | Construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer |
title_sort | construction and validation of nomograms combined with novel machine learning algorithms to predict early death of patients with metastatic colorectal cancer |
topic | Public Health |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9810140/ https://www.ncbi.nlm.nih.gov/pubmed/36605237 http://dx.doi.org/10.3389/fpubh.2022.1008137 |
work_keys_str_mv | AT zhangyalong constructionandvalidationofnomogramscombinedwithnovelmachinelearningalgorithmstopredictearlydeathofpatientswithmetastaticcolorectalcancer AT zhangzunni constructionandvalidationofnomogramscombinedwithnovelmachinelearningalgorithmstopredictearlydeathofpatientswithmetastaticcolorectalcancer AT weiliuxiang constructionandvalidationofnomogramscombinedwithnovelmachinelearningalgorithmstopredictearlydeathofpatientswithmetastaticcolorectalcancer AT weishujing constructionandvalidationofnomogramscombinedwithnovelmachinelearningalgorithmstopredictearlydeathofpatientswithmetastaticcolorectalcancer |