Cargando…

Predicting metastasis in gastric cancer patients: machine learning-based approaches

Gastric cancer (GC), with a 5-year survival rate of less than 40%, is known as the fourth principal reason of cancer-related mortality over the world. This study aims to develop predictive models using different machine learning (ML) classifiers based on both demographic and clinical variables to pr...

Descripción completa

Detalles Bibliográficos
Autores principales: Talebi, Atefeh, Celis-Morales, Carlos A., Borumandnia, Nasrin, Abbasi, Somayeh, Pourhoseingholi, Mohamad Amin, Akbari, Abolfazl, Yousefi, Javad
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10011363/
https://www.ncbi.nlm.nih.gov/pubmed/36914697
http://dx.doi.org/10.1038/s41598-023-31272-w
_version_ 1784906375364608000
author Talebi, Atefeh
Celis-Morales, Carlos A.
Borumandnia, Nasrin
Abbasi, Somayeh
Pourhoseingholi, Mohamad Amin
Akbari, Abolfazl
Yousefi, Javad
author_facet Talebi, Atefeh
Celis-Morales, Carlos A.
Borumandnia, Nasrin
Abbasi, Somayeh
Pourhoseingholi, Mohamad Amin
Akbari, Abolfazl
Yousefi, Javad
author_sort Talebi, Atefeh
collection PubMed
description Gastric cancer (GC), with a 5-year survival rate of less than 40%, is known as the fourth principal reason of cancer-related mortality over the world. This study aims to develop predictive models using different machine learning (ML) classifiers based on both demographic and clinical variables to predict metastasis status of patients with GC. The data applied in this study including 733 of GC patients, divided into a train and test groups at a ratio of 8:2, diagnosed at Taleghani tertiary hospital. In order to predict metastasis in GC, ML-based algorithms, including Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), Neural Network (NN), Decision Tree (RT) and Logistic Regression (LR), with 5-fold cross validation were performed. To assess the model performance, F1 score, precision, sensitivity, specificity, area under the curve (AUC) of receiver operating characteristic (ROC) curve and precision-recall AUC (PR-AUC) were obtained. 262 (36%) experienced metastasis among 733 patients with GC. Although all models have optimal performance, the indices of SVM model seems to be more appropiate (training set: AUC: 0.94, Sensitivity: 0.94; testing set: AUC: 0.85, Sensitivity: 0.92). Then, NN has the higher AUC among ML approaches (training set: AUC: 0.98; testing set: AUC: 0.86). The RF of ML-based models, which determine size of tumor and age as two essential variables, is considered as the third efficient model, because of higher specificity and AUC (84% and 87%). Based on the demographic and clinical characteristics, ML approaches can predict the metastasis status in GC patients. According to AUC, sensitivity and specificity in both SVM and NN can be regarded as better algorithms among 6 applied ML-based methods.
format Online
Article
Text
id pubmed-10011363
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-100113632023-03-15 Predicting metastasis in gastric cancer patients: machine learning-based approaches Talebi, Atefeh Celis-Morales, Carlos A. Borumandnia, Nasrin Abbasi, Somayeh Pourhoseingholi, Mohamad Amin Akbari, Abolfazl Yousefi, Javad Sci Rep Article Gastric cancer (GC), with a 5-year survival rate of less than 40%, is known as the fourth principal reason of cancer-related mortality over the world. This study aims to develop predictive models using different machine learning (ML) classifiers based on both demographic and clinical variables to predict metastasis status of patients with GC. The data applied in this study including 733 of GC patients, divided into a train and test groups at a ratio of 8:2, diagnosed at Taleghani tertiary hospital. In order to predict metastasis in GC, ML-based algorithms, including Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), Neural Network (NN), Decision Tree (RT) and Logistic Regression (LR), with 5-fold cross validation were performed. To assess the model performance, F1 score, precision, sensitivity, specificity, area under the curve (AUC) of receiver operating characteristic (ROC) curve and precision-recall AUC (PR-AUC) were obtained. 262 (36%) experienced metastasis among 733 patients with GC. Although all models have optimal performance, the indices of SVM model seems to be more appropiate (training set: AUC: 0.94, Sensitivity: 0.94; testing set: AUC: 0.85, Sensitivity: 0.92). Then, NN has the higher AUC among ML approaches (training set: AUC: 0.98; testing set: AUC: 0.86). The RF of ML-based models, which determine size of tumor and age as two essential variables, is considered as the third efficient model, because of higher specificity and AUC (84% and 87%). Based on the demographic and clinical characteristics, ML approaches can predict the metastasis status in GC patients. According to AUC, sensitivity and specificity in both SVM and NN can be regarded as better algorithms among 6 applied ML-based methods. Nature Publishing Group UK 2023-03-13 /pmc/articles/PMC10011363/ /pubmed/36914697 http://dx.doi.org/10.1038/s41598-023-31272-w Text en © The Author(s) 2023, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Article
Talebi, Atefeh
Celis-Morales, Carlos A.
Borumandnia, Nasrin
Abbasi, Somayeh
Pourhoseingholi, Mohamad Amin
Akbari, Abolfazl
Yousefi, Javad
Predicting metastasis in gastric cancer patients: machine learning-based approaches
title Predicting metastasis in gastric cancer patients: machine learning-based approaches
title_full Predicting metastasis in gastric cancer patients: machine learning-based approaches
title_fullStr Predicting metastasis in gastric cancer patients: machine learning-based approaches
title_full_unstemmed Predicting metastasis in gastric cancer patients: machine learning-based approaches
title_short Predicting metastasis in gastric cancer patients: machine learning-based approaches
title_sort predicting metastasis in gastric cancer patients: machine learning-based approaches
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10011363/
https://www.ncbi.nlm.nih.gov/pubmed/36914697
http://dx.doi.org/10.1038/s41598-023-31272-w
work_keys_str_mv AT talebiatefeh predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches
AT celismoralescarlosa predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches
AT borumandnianasrin predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches
AT abbasisomayeh predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches
AT pourhoseingholimohamadamin predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches
AT akbariabolfazl predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches
AT yousefijavad predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches