Cargando…

Prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine

INTRODUCTION: It is essential to predict the survival status of patients based on their prognosis. This can assist physicians in evaluating treatment decisions. Random forest is an excellent machine learning algorithm even without any modification. We propose a new random forest weighting method and...

Descripción completa

Detalles Bibliográficos
Autores principales:	Xu, Cheng, Wang, Jing, Zheng, Tianlong, Cao, Yue, Ye, Fan
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Termedia Publishing House 2021
Materias:	Clinical Research
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9479734/ https://www.ncbi.nlm.nih.gov/pubmed/36160349 http://dx.doi.org/10.5114/aoms/135594

_version_	1784790857612787712
author	Xu, Cheng Wang, Jing Zheng, Tianlong Cao, Yue Ye, Fan
author_facet	Xu, Cheng Wang, Jing Zheng, Tianlong Cao, Yue Ye, Fan
author_sort	Xu, Cheng
collection	PubMed
description	INTRODUCTION: It is essential to predict the survival status of patients based on their prognosis. This can assist physicians in evaluating treatment decisions. Random forest is an excellent machine learning algorithm even without any modification. We propose a new random forest weighting method and apply it to the gastric cancer patient data from the Surveillance, Epidemiology, and End Results (SEER) program. We evaluated the generalization ability of this weighted random forest algorithm on 10 public medical datasets. Furthermore, for the same weighting mode, the difference between using out-of-bag (OOB) data and all training sets as the weighting basis is explored. MATERIAL AND METHODS: 110 697 cases of gastric cancer patients diagnosed between 1975 and 2016 obtained from the SEER database were included in the experiment. In addition, 10 public medical datasets were used for the generalization ability evaluation of this weighted random forest algorithm. RESULTS: Through experimental verification, on the SEER gastric cancer patient data, the weighted random forest algorithm improves the accuracy by 0.79% compared with the original random forest. In AUC, macro-averaging increased by 2.32% and micro-averaging increased by 0.51% on average. Among the 10 public datasets, the random forest weighted in accuracy has the best performance on 6 datasets, with an average increase of 1.44% in accuracy and an average increase of 1.2% in AUC. CONCLUSIONS: Compared with the original random forest, the weighted random forest model shows a significant improvement in performance, and the effect of using all training data as the weighting basis is better than using OOB data.
format	Online Article Text
id	pubmed-9479734
institution	National Center for Biotechnology Information
language	English
publishDate	2021
publisher	Termedia Publishing House
record_format	MEDLINE/PubMed
spelling	pubmed-94797342022-09-22 Prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine Xu, Cheng Wang, Jing Zheng, Tianlong Cao, Yue Ye, Fan Arch Med Sci Clinical Research INTRODUCTION: It is essential to predict the survival status of patients based on their prognosis. This can assist physicians in evaluating treatment decisions. Random forest is an excellent machine learning algorithm even without any modification. We propose a new random forest weighting method and apply it to the gastric cancer patient data from the Surveillance, Epidemiology, and End Results (SEER) program. We evaluated the generalization ability of this weighted random forest algorithm on 10 public medical datasets. Furthermore, for the same weighting mode, the difference between using out-of-bag (OOB) data and all training sets as the weighting basis is explored. MATERIAL AND METHODS: 110 697 cases of gastric cancer patients diagnosed between 1975 and 2016 obtained from the SEER database were included in the experiment. In addition, 10 public medical datasets were used for the generalization ability evaluation of this weighted random forest algorithm. RESULTS: Through experimental verification, on the SEER gastric cancer patient data, the weighted random forest algorithm improves the accuracy by 0.79% compared with the original random forest. In AUC, macro-averaging increased by 2.32% and micro-averaging increased by 0.51% on average. Among the 10 public datasets, the random forest weighted in accuracy has the best performance on 6 datasets, with an average increase of 1.44% in accuracy and an average increase of 1.2% in AUC. CONCLUSIONS: Compared with the original random forest, the weighted random forest model shows a significant improvement in performance, and the effect of using all training data as the weighting basis is better than using OOB data. Termedia Publishing House 2021-04-10 /pmc/articles/PMC9479734/ /pubmed/36160349 http://dx.doi.org/10.5114/aoms/135594 Text en Copyright: © 2022 Termedia & Banach https://creativecommons.org/licenses/by-nc-sa/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0) License, allowing third parties to copy and redistribute the material in any medium or format and to remix, transform, and build upon the material, provided the original work is properly cited and states its license.
spellingShingle	Clinical Research Xu, Cheng Wang, Jing Zheng, Tianlong Cao, Yue Ye, Fan Prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine
title	Prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine
title_full	Prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine
title_fullStr	Prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine
title_full_unstemmed	Prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine
title_short	Prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine
title_sort	prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine
topic	Clinical Research
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9479734/ https://www.ncbi.nlm.nih.gov/pubmed/36160349 http://dx.doi.org/10.5114/aoms/135594
work_keys_str_mv	AT xucheng predictionofprognosisandsurvivalofpatientswithgastriccancerbyaweightedimprovedrandomforestmodelanapplicationofmachinelearninginmedicine AT wangjing predictionofprognosisandsurvivalofpatientswithgastriccancerbyaweightedimprovedrandomforestmodelanapplicationofmachinelearninginmedicine AT zhengtianlong predictionofprognosisandsurvivalofpatientswithgastriccancerbyaweightedimprovedrandomforestmodelanapplicationofmachinelearninginmedicine AT caoyue predictionofprognosisandsurvivalofpatientswithgastriccancerbyaweightedimprovedrandomforestmodelanapplicationofmachinelearninginmedicine AT yefan predictionofprognosisandsurvivalofpatientswithgastriccancerbyaweightedimprovedrandomforestmodelanapplicationofmachinelearninginmedicine

Prediction of prognosis and survival of patients with gastric cancer by a weighted improved random forest model: an application of machine learning in medicine

Ejemplares similares