Cargando…

Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia

INTRODUCTION: Chronic myeloid leukemia (CML) is a myeloproliferative disorder resulting from the translocation of chromosomes 19 and 22. CML includes 15–20% of all cases of leukemia. Although bone marrow transplant and, more recently, tyrosine kinase inhibitors (TKIs) as a first-line treatment have...

Descripción completa

Detalles Bibliográficos
Autores principales: Shanbehzadeh, Mostafa, Afrash, Mohammad Reza, Mirani, Nader, Kazemi-Arpanahi, Hadi
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9450320/
https://www.ncbi.nlm.nih.gov/pubmed/36068539
http://dx.doi.org/10.1186/s12911-022-01980-w
_version_ 1784784499058409472
author Shanbehzadeh, Mostafa
Afrash, Mohammad Reza
Mirani, Nader
Kazemi-Arpanahi, Hadi
author_facet Shanbehzadeh, Mostafa
Afrash, Mohammad Reza
Mirani, Nader
Kazemi-Arpanahi, Hadi
author_sort Shanbehzadeh, Mostafa
collection PubMed
description INTRODUCTION: Chronic myeloid leukemia (CML) is a myeloproliferative disorder resulting from the translocation of chromosomes 19 and 22. CML includes 15–20% of all cases of leukemia. Although bone marrow transplant and, more recently, tyrosine kinase inhibitors (TKIs) as a first-line treatment have significantly prolonged survival in CML patients, accurate prediction using available patient-level factors can be challenging. We intended to predict 5-year survival among CML patients via eight machine learning (ML) algorithms and compare their performance. METHODS: The data of 837 CML patients were retrospectively extracted and randomly split into training and test segments (70:30 ratio). The outcome variable was 5-year survival with potential values of alive or deceased. The dataset for the full features and important features selected by minimal redundancy maximal relevance (mRMR) feature selection were fed into eight ML techniques, including eXtreme gradient boosting (XGBoost), multilayer perceptron (MLP), pattern recognition network, k-nearest neighborhood (KNN), probabilistic neural network, support vector machine (SVM) (kernel = linear), SVM (kernel = RBF), and J-48. The scikit-learn library in Python was used to implement the models. Finally, the performance of the developed models was measured using some evaluation criteria with 95% confidence intervals (CI). RESULTS: Spleen palpable, age, and unexplained hemorrhage were identified as the top three effective features affecting CML 5-year survival. The performance of ML models using the selected-features was superior to that of the full-features dataset. Among the eight ML algorithms, SVM (kernel = RBF) had the best performance in tenfold cross-validation with an accuracy of 85.7%, specificity of 85%, sensitivity of 86%, F-measure of 87%, kappa statistic of 86.1%, and area under the curve (AUC) of 85% for the selected-features. Using the full-features dataset yielded an accuracy of 69.7%, specificity of 69.1%, sensitivity of 71.3%, F-measure of 72%, kappa statistic of 75.2%, and AUC of 70.1%. CONCLUSIONS: Accurate prediction of the survival likelihood of CML patients can inform caregivers to promote patient prognostication and choose the best possible treatment path. While external validation is required, our developed models will offer customized treatment and may guide the prescription of personalized medicine for CML patients.
format Online
Article
Text
id pubmed-9450320
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-94503202022-09-08 Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia Shanbehzadeh, Mostafa Afrash, Mohammad Reza Mirani, Nader Kazemi-Arpanahi, Hadi BMC Med Inform Decis Mak Research INTRODUCTION: Chronic myeloid leukemia (CML) is a myeloproliferative disorder resulting from the translocation of chromosomes 19 and 22. CML includes 15–20% of all cases of leukemia. Although bone marrow transplant and, more recently, tyrosine kinase inhibitors (TKIs) as a first-line treatment have significantly prolonged survival in CML patients, accurate prediction using available patient-level factors can be challenging. We intended to predict 5-year survival among CML patients via eight machine learning (ML) algorithms and compare their performance. METHODS: The data of 837 CML patients were retrospectively extracted and randomly split into training and test segments (70:30 ratio). The outcome variable was 5-year survival with potential values of alive or deceased. The dataset for the full features and important features selected by minimal redundancy maximal relevance (mRMR) feature selection were fed into eight ML techniques, including eXtreme gradient boosting (XGBoost), multilayer perceptron (MLP), pattern recognition network, k-nearest neighborhood (KNN), probabilistic neural network, support vector machine (SVM) (kernel = linear), SVM (kernel = RBF), and J-48. The scikit-learn library in Python was used to implement the models. Finally, the performance of the developed models was measured using some evaluation criteria with 95% confidence intervals (CI). RESULTS: Spleen palpable, age, and unexplained hemorrhage were identified as the top three effective features affecting CML 5-year survival. The performance of ML models using the selected-features was superior to that of the full-features dataset. Among the eight ML algorithms, SVM (kernel = RBF) had the best performance in tenfold cross-validation with an accuracy of 85.7%, specificity of 85%, sensitivity of 86%, F-measure of 87%, kappa statistic of 86.1%, and area under the curve (AUC) of 85% for the selected-features. Using the full-features dataset yielded an accuracy of 69.7%, specificity of 69.1%, sensitivity of 71.3%, F-measure of 72%, kappa statistic of 75.2%, and AUC of 70.1%. CONCLUSIONS: Accurate prediction of the survival likelihood of CML patients can inform caregivers to promote patient prognostication and choose the best possible treatment path. While external validation is required, our developed models will offer customized treatment and may guide the prescription of personalized medicine for CML patients. BioMed Central 2022-09-06 /pmc/articles/PMC9450320/ /pubmed/36068539 http://dx.doi.org/10.1186/s12911-022-01980-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Shanbehzadeh, Mostafa
Afrash, Mohammad Reza
Mirani, Nader
Kazemi-Arpanahi, Hadi
Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia
title Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia
title_full Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia
title_fullStr Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia
title_full_unstemmed Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia
title_short Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia
title_sort comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9450320/
https://www.ncbi.nlm.nih.gov/pubmed/36068539
http://dx.doi.org/10.1186/s12911-022-01980-w
work_keys_str_mv AT shanbehzadehmostafa comparingmachinelearningalgorithmstopredict5yearsurvivalinpatientswithchronicmyeloidleukemia
AT afrashmohammadreza comparingmachinelearningalgorithmstopredict5yearsurvivalinpatientswithchronicmyeloidleukemia
AT miraninader comparingmachinelearningalgorithmstopredict5yearsurvivalinpatientswithchronicmyeloidleukemia
AT kazemiarpanahihadi comparingmachinelearningalgorithmstopredict5yearsurvivalinpatientswithchronicmyeloidleukemia