Cargando…
Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia
INTRODUCTION: Chronic myeloid leukemia (CML) is a myeloproliferative disorder resulting from the translocation of chromosomes 19 and 22. CML includes 15–20% of all cases of leukemia. Although bone marrow transplant and, more recently, tyrosine kinase inhibitors (TKIs) as a first-line treatment have...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9450320/ https://www.ncbi.nlm.nih.gov/pubmed/36068539 http://dx.doi.org/10.1186/s12911-022-01980-w |
_version_ | 1784784499058409472 |
---|---|
author | Shanbehzadeh, Mostafa Afrash, Mohammad Reza Mirani, Nader Kazemi-Arpanahi, Hadi |
author_facet | Shanbehzadeh, Mostafa Afrash, Mohammad Reza Mirani, Nader Kazemi-Arpanahi, Hadi |
author_sort | Shanbehzadeh, Mostafa |
collection | PubMed |
description | INTRODUCTION: Chronic myeloid leukemia (CML) is a myeloproliferative disorder resulting from the translocation of chromosomes 19 and 22. CML includes 15–20% of all cases of leukemia. Although bone marrow transplant and, more recently, tyrosine kinase inhibitors (TKIs) as a first-line treatment have significantly prolonged survival in CML patients, accurate prediction using available patient-level factors can be challenging. We intended to predict 5-year survival among CML patients via eight machine learning (ML) algorithms and compare their performance. METHODS: The data of 837 CML patients were retrospectively extracted and randomly split into training and test segments (70:30 ratio). The outcome variable was 5-year survival with potential values of alive or deceased. The dataset for the full features and important features selected by minimal redundancy maximal relevance (mRMR) feature selection were fed into eight ML techniques, including eXtreme gradient boosting (XGBoost), multilayer perceptron (MLP), pattern recognition network, k-nearest neighborhood (KNN), probabilistic neural network, support vector machine (SVM) (kernel = linear), SVM (kernel = RBF), and J-48. The scikit-learn library in Python was used to implement the models. Finally, the performance of the developed models was measured using some evaluation criteria with 95% confidence intervals (CI). RESULTS: Spleen palpable, age, and unexplained hemorrhage were identified as the top three effective features affecting CML 5-year survival. The performance of ML models using the selected-features was superior to that of the full-features dataset. Among the eight ML algorithms, SVM (kernel = RBF) had the best performance in tenfold cross-validation with an accuracy of 85.7%, specificity of 85%, sensitivity of 86%, F-measure of 87%, kappa statistic of 86.1%, and area under the curve (AUC) of 85% for the selected-features. Using the full-features dataset yielded an accuracy of 69.7%, specificity of 69.1%, sensitivity of 71.3%, F-measure of 72%, kappa statistic of 75.2%, and AUC of 70.1%. CONCLUSIONS: Accurate prediction of the survival likelihood of CML patients can inform caregivers to promote patient prognostication and choose the best possible treatment path. While external validation is required, our developed models will offer customized treatment and may guide the prescription of personalized medicine for CML patients. |
format | Online Article Text |
id | pubmed-9450320 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-94503202022-09-08 Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia Shanbehzadeh, Mostafa Afrash, Mohammad Reza Mirani, Nader Kazemi-Arpanahi, Hadi BMC Med Inform Decis Mak Research INTRODUCTION: Chronic myeloid leukemia (CML) is a myeloproliferative disorder resulting from the translocation of chromosomes 19 and 22. CML includes 15–20% of all cases of leukemia. Although bone marrow transplant and, more recently, tyrosine kinase inhibitors (TKIs) as a first-line treatment have significantly prolonged survival in CML patients, accurate prediction using available patient-level factors can be challenging. We intended to predict 5-year survival among CML patients via eight machine learning (ML) algorithms and compare their performance. METHODS: The data of 837 CML patients were retrospectively extracted and randomly split into training and test segments (70:30 ratio). The outcome variable was 5-year survival with potential values of alive or deceased. The dataset for the full features and important features selected by minimal redundancy maximal relevance (mRMR) feature selection were fed into eight ML techniques, including eXtreme gradient boosting (XGBoost), multilayer perceptron (MLP), pattern recognition network, k-nearest neighborhood (KNN), probabilistic neural network, support vector machine (SVM) (kernel = linear), SVM (kernel = RBF), and J-48. The scikit-learn library in Python was used to implement the models. Finally, the performance of the developed models was measured using some evaluation criteria with 95% confidence intervals (CI). RESULTS: Spleen palpable, age, and unexplained hemorrhage were identified as the top three effective features affecting CML 5-year survival. The performance of ML models using the selected-features was superior to that of the full-features dataset. Among the eight ML algorithms, SVM (kernel = RBF) had the best performance in tenfold cross-validation with an accuracy of 85.7%, specificity of 85%, sensitivity of 86%, F-measure of 87%, kappa statistic of 86.1%, and area under the curve (AUC) of 85% for the selected-features. Using the full-features dataset yielded an accuracy of 69.7%, specificity of 69.1%, sensitivity of 71.3%, F-measure of 72%, kappa statistic of 75.2%, and AUC of 70.1%. CONCLUSIONS: Accurate prediction of the survival likelihood of CML patients can inform caregivers to promote patient prognostication and choose the best possible treatment path. While external validation is required, our developed models will offer customized treatment and may guide the prescription of personalized medicine for CML patients. BioMed Central 2022-09-06 /pmc/articles/PMC9450320/ /pubmed/36068539 http://dx.doi.org/10.1186/s12911-022-01980-w Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Shanbehzadeh, Mostafa Afrash, Mohammad Reza Mirani, Nader Kazemi-Arpanahi, Hadi Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia |
title | Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia |
title_full | Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia |
title_fullStr | Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia |
title_full_unstemmed | Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia |
title_short | Comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia |
title_sort | comparing machine learning algorithms to predict 5-year survival in patients with chronic myeloid leukemia |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9450320/ https://www.ncbi.nlm.nih.gov/pubmed/36068539 http://dx.doi.org/10.1186/s12911-022-01980-w |
work_keys_str_mv | AT shanbehzadehmostafa comparingmachinelearningalgorithmstopredict5yearsurvivalinpatientswithchronicmyeloidleukemia AT afrashmohammadreza comparingmachinelearningalgorithmstopredict5yearsurvivalinpatientswithchronicmyeloidleukemia AT miraninader comparingmachinelearningalgorithmstopredict5yearsurvivalinpatientswithchronicmyeloidleukemia AT kazemiarpanahihadi comparingmachinelearningalgorithmstopredict5yearsurvivalinpatientswithchronicmyeloidleukemia |