Cargando…

Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection

The aim of this study is to explore the feasibility of using machine learning (ML) technology to predict postoperative recurrence risk among stage IV colorectal cancer patients. Four basic ML algorithms were used for prediction—logistic regression, decision tree, GradientBoosting and lightGBM. The r...

Descripción completa

Detalles Bibliográficos
Autores principales: Xu, Yucan, Ju, Lingsha, Tong, Jianhua, Zhou, Cheng-Mao, Yang, Jian-Jun
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7220939/
https://www.ncbi.nlm.nih.gov/pubmed/32054897
http://dx.doi.org/10.1038/s41598-020-59115-y
_version_ 1783533266155864064
author Xu, Yucan
Ju, Lingsha
Tong, Jianhua
Zhou, Cheng-Mao
Yang, Jian-Jun
author_facet Xu, Yucan
Ju, Lingsha
Tong, Jianhua
Zhou, Cheng-Mao
Yang, Jian-Jun
author_sort Xu, Yucan
collection PubMed
description The aim of this study is to explore the feasibility of using machine learning (ML) technology to predict postoperative recurrence risk among stage IV colorectal cancer patients. Four basic ML algorithms were used for prediction—logistic regression, decision tree, GradientBoosting and lightGBM. The research samples were randomly divided into a training group and a testing group at a ratio of 8:2. 999 patients with stage 4 colorectal cancer were included in this study. In the training group, the GradientBoosting model’s AUC value was the highest, at 0.881. The Logistic model’s AUC value was the lowest, at 0.734. The GradientBoosting model had the highest F1_score (0.912). In the test group, the AUC Logistic model had the lowest AUC value (0.692). The GradientBoosting model’s AUC value was 0.734, which can still predict cancer progress. However, the gbm model had the highest AUC value (0.761), and the gbm model had the highest F1_score (0.974). The GradientBoosting model and the gbm model performed better than the other two algorithms. The weight matrix diagram of the GradientBoosting algorithm shows that chemotherapy, age, LogCEA, CEA and anesthesia time were the five most influential risk factors for tumor recurrence. The four machine learning algorithms can each predict the risk of tumor recurrence in patients with stage IV colorectal cancer after surgery. Among them, GradientBoosting and gbm performed best. Moreover, the GradientBoosting weight matrix shows that the five most influential variables accounting for postoperative tumor recurrence are chemotherapy, age, LogCEA, CEA and anesthesia time.
format Online
Article
Text
id pubmed-7220939
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-72209392020-05-20 Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection Xu, Yucan Ju, Lingsha Tong, Jianhua Zhou, Cheng-Mao Yang, Jian-Jun Sci Rep Article The aim of this study is to explore the feasibility of using machine learning (ML) technology to predict postoperative recurrence risk among stage IV colorectal cancer patients. Four basic ML algorithms were used for prediction—logistic regression, decision tree, GradientBoosting and lightGBM. The research samples were randomly divided into a training group and a testing group at a ratio of 8:2. 999 patients with stage 4 colorectal cancer were included in this study. In the training group, the GradientBoosting model’s AUC value was the highest, at 0.881. The Logistic model’s AUC value was the lowest, at 0.734. The GradientBoosting model had the highest F1_score (0.912). In the test group, the AUC Logistic model had the lowest AUC value (0.692). The GradientBoosting model’s AUC value was 0.734, which can still predict cancer progress. However, the gbm model had the highest AUC value (0.761), and the gbm model had the highest F1_score (0.974). The GradientBoosting model and the gbm model performed better than the other two algorithms. The weight matrix diagram of the GradientBoosting algorithm shows that chemotherapy, age, LogCEA, CEA and anesthesia time were the five most influential risk factors for tumor recurrence. The four machine learning algorithms can each predict the risk of tumor recurrence in patients with stage IV colorectal cancer after surgery. Among them, GradientBoosting and gbm performed best. Moreover, the GradientBoosting weight matrix shows that the five most influential variables accounting for postoperative tumor recurrence are chemotherapy, age, LogCEA, CEA and anesthesia time. Nature Publishing Group UK 2020-02-13 /pmc/articles/PMC7220939/ /pubmed/32054897 http://dx.doi.org/10.1038/s41598-020-59115-y Text en © The Author(s) 2020 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Xu, Yucan
Ju, Lingsha
Tong, Jianhua
Zhou, Cheng-Mao
Yang, Jian-Jun
Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection
title Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection
title_full Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection
title_fullStr Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection
title_full_unstemmed Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection
title_short Machine Learning Algorithms for Predicting the Recurrence of Stage IV Colorectal Cancer After Tumor Resection
title_sort machine learning algorithms for predicting the recurrence of stage iv colorectal cancer after tumor resection
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7220939/
https://www.ncbi.nlm.nih.gov/pubmed/32054897
http://dx.doi.org/10.1038/s41598-020-59115-y
work_keys_str_mv AT xuyucan machinelearningalgorithmsforpredictingtherecurrenceofstageivcolorectalcanceraftertumorresection
AT julingsha machinelearningalgorithmsforpredictingtherecurrenceofstageivcolorectalcanceraftertumorresection
AT tongjianhua machinelearningalgorithmsforpredictingtherecurrenceofstageivcolorectalcanceraftertumorresection
AT zhouchengmao machinelearningalgorithmsforpredictingtherecurrenceofstageivcolorectalcanceraftertumorresection
AT yangjianjun machinelearningalgorithmsforpredictingtherecurrenceofstageivcolorectalcanceraftertumorresection