Cargando…
Predicting metastasis in gastric cancer patients: machine learning-based approaches
Gastric cancer (GC), with a 5-year survival rate of less than 40%, is known as the fourth principal reason of cancer-related mortality over the world. This study aims to develop predictive models using different machine learning (ML) classifiers based on both demographic and clinical variables to pr...
Autores principales: | , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Nature Publishing Group UK
2023
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10011363/ https://www.ncbi.nlm.nih.gov/pubmed/36914697 http://dx.doi.org/10.1038/s41598-023-31272-w |
_version_ | 1784906375364608000 |
---|---|
author | Talebi, Atefeh Celis-Morales, Carlos A. Borumandnia, Nasrin Abbasi, Somayeh Pourhoseingholi, Mohamad Amin Akbari, Abolfazl Yousefi, Javad |
author_facet | Talebi, Atefeh Celis-Morales, Carlos A. Borumandnia, Nasrin Abbasi, Somayeh Pourhoseingholi, Mohamad Amin Akbari, Abolfazl Yousefi, Javad |
author_sort | Talebi, Atefeh |
collection | PubMed |
description | Gastric cancer (GC), with a 5-year survival rate of less than 40%, is known as the fourth principal reason of cancer-related mortality over the world. This study aims to develop predictive models using different machine learning (ML) classifiers based on both demographic and clinical variables to predict metastasis status of patients with GC. The data applied in this study including 733 of GC patients, divided into a train and test groups at a ratio of 8:2, diagnosed at Taleghani tertiary hospital. In order to predict metastasis in GC, ML-based algorithms, including Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), Neural Network (NN), Decision Tree (RT) and Logistic Regression (LR), with 5-fold cross validation were performed. To assess the model performance, F1 score, precision, sensitivity, specificity, area under the curve (AUC) of receiver operating characteristic (ROC) curve and precision-recall AUC (PR-AUC) were obtained. 262 (36%) experienced metastasis among 733 patients with GC. Although all models have optimal performance, the indices of SVM model seems to be more appropiate (training set: AUC: 0.94, Sensitivity: 0.94; testing set: AUC: 0.85, Sensitivity: 0.92). Then, NN has the higher AUC among ML approaches (training set: AUC: 0.98; testing set: AUC: 0.86). The RF of ML-based models, which determine size of tumor and age as two essential variables, is considered as the third efficient model, because of higher specificity and AUC (84% and 87%). Based on the demographic and clinical characteristics, ML approaches can predict the metastasis status in GC patients. According to AUC, sensitivity and specificity in both SVM and NN can be regarded as better algorithms among 6 applied ML-based methods. |
format | Online Article Text |
id | pubmed-10011363 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2023 |
publisher | Nature Publishing Group UK |
record_format | MEDLINE/PubMed |
spelling | pubmed-100113632023-03-15 Predicting metastasis in gastric cancer patients: machine learning-based approaches Talebi, Atefeh Celis-Morales, Carlos A. Borumandnia, Nasrin Abbasi, Somayeh Pourhoseingholi, Mohamad Amin Akbari, Abolfazl Yousefi, Javad Sci Rep Article Gastric cancer (GC), with a 5-year survival rate of less than 40%, is known as the fourth principal reason of cancer-related mortality over the world. This study aims to develop predictive models using different machine learning (ML) classifiers based on both demographic and clinical variables to predict metastasis status of patients with GC. The data applied in this study including 733 of GC patients, divided into a train and test groups at a ratio of 8:2, diagnosed at Taleghani tertiary hospital. In order to predict metastasis in GC, ML-based algorithms, including Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), Neural Network (NN), Decision Tree (RT) and Logistic Regression (LR), with 5-fold cross validation were performed. To assess the model performance, F1 score, precision, sensitivity, specificity, area under the curve (AUC) of receiver operating characteristic (ROC) curve and precision-recall AUC (PR-AUC) were obtained. 262 (36%) experienced metastasis among 733 patients with GC. Although all models have optimal performance, the indices of SVM model seems to be more appropiate (training set: AUC: 0.94, Sensitivity: 0.94; testing set: AUC: 0.85, Sensitivity: 0.92). Then, NN has the higher AUC among ML approaches (training set: AUC: 0.98; testing set: AUC: 0.86). The RF of ML-based models, which determine size of tumor and age as two essential variables, is considered as the third efficient model, because of higher specificity and AUC (84% and 87%). Based on the demographic and clinical characteristics, ML approaches can predict the metastasis status in GC patients. According to AUC, sensitivity and specificity in both SVM and NN can be regarded as better algorithms among 6 applied ML-based methods. Nature Publishing Group UK 2023-03-13 /pmc/articles/PMC10011363/ /pubmed/36914697 http://dx.doi.org/10.1038/s41598-023-31272-w Text en © The Author(s) 2023, corrected publication 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . |
spellingShingle | Article Talebi, Atefeh Celis-Morales, Carlos A. Borumandnia, Nasrin Abbasi, Somayeh Pourhoseingholi, Mohamad Amin Akbari, Abolfazl Yousefi, Javad Predicting metastasis in gastric cancer patients: machine learning-based approaches |
title | Predicting metastasis in gastric cancer patients: machine learning-based approaches |
title_full | Predicting metastasis in gastric cancer patients: machine learning-based approaches |
title_fullStr | Predicting metastasis in gastric cancer patients: machine learning-based approaches |
title_full_unstemmed | Predicting metastasis in gastric cancer patients: machine learning-based approaches |
title_short | Predicting metastasis in gastric cancer patients: machine learning-based approaches |
title_sort | predicting metastasis in gastric cancer patients: machine learning-based approaches |
topic | Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10011363/ https://www.ncbi.nlm.nih.gov/pubmed/36914697 http://dx.doi.org/10.1038/s41598-023-31272-w |
work_keys_str_mv | AT talebiatefeh predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches AT celismoralescarlosa predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches AT borumandnianasrin predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches AT abbasisomayeh predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches AT pourhoseingholimohamadamin predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches AT akbariabolfazl predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches AT yousefijavad predictingmetastasisingastriccancerpatientsmachinelearningbasedapproaches |