Cargando…

Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia

The prediction of relapse in childhood acute lymphoblastic leukemia (ALL) is a critical factor for successful treatment and follow-up planning. Our goal was to construct an ALL relapse prediction model based on machine learning algorithms. Monte Carlo cross-validation nested by 10-fold cross-validat...

Descripción completa

Detalles Bibliográficos
Autores principales: Pan, Liyan, Liu, Guangjian, Lin, Fangqin, Zhong, Shuling, Xia, Huimin, Sun, Xin, Liang, Huiying
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Nature Publishing Group UK 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5547099/
https://www.ncbi.nlm.nih.gov/pubmed/28784991
http://dx.doi.org/10.1038/s41598-017-07408-0
_version_ 1783255651327148032
author Pan, Liyan
Liu, Guangjian
Lin, Fangqin
Zhong, Shuling
Xia, Huimin
Sun, Xin
Liang, Huiying
author_facet Pan, Liyan
Liu, Guangjian
Lin, Fangqin
Zhong, Shuling
Xia, Huimin
Sun, Xin
Liang, Huiying
author_sort Pan, Liyan
collection PubMed
description The prediction of relapse in childhood acute lymphoblastic leukemia (ALL) is a critical factor for successful treatment and follow-up planning. Our goal was to construct an ALL relapse prediction model based on machine learning algorithms. Monte Carlo cross-validation nested by 10-fold cross-validation was used to rank clinical variables on the randomly split training sets of 336 newly diagnosed ALL children, and a forward feature selection algorithm was employed to find the shortest list of most discriminatory variables. To enable an unbiased estimation of the prediction model to new patients, besides the split test sets of 150 patients, we introduced another independent data set of 84 patients to evaluate the model. The Random Forest model with 14 features achieved a cross-validation accuracy of 0.827 ± 0.031 on one set and an accuracy of 0.798 on the other, with the area under the curve of 0.902 ± 0.027 and 0.904, respectively. The model performed well across different risk-level groups, with the best accuracy of 0.829 in the standard-risk group. To our knowledge, this is the first study to use machine learning models to predict childhood ALL relapse based on medical data from Electronic Medical Record, which will further facilitate stratification treatments.
format Online
Article
Text
id pubmed-5547099
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher Nature Publishing Group UK
record_format MEDLINE/PubMed
spelling pubmed-55470992017-08-09 Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia Pan, Liyan Liu, Guangjian Lin, Fangqin Zhong, Shuling Xia, Huimin Sun, Xin Liang, Huiying Sci Rep Article The prediction of relapse in childhood acute lymphoblastic leukemia (ALL) is a critical factor for successful treatment and follow-up planning. Our goal was to construct an ALL relapse prediction model based on machine learning algorithms. Monte Carlo cross-validation nested by 10-fold cross-validation was used to rank clinical variables on the randomly split training sets of 336 newly diagnosed ALL children, and a forward feature selection algorithm was employed to find the shortest list of most discriminatory variables. To enable an unbiased estimation of the prediction model to new patients, besides the split test sets of 150 patients, we introduced another independent data set of 84 patients to evaluate the model. The Random Forest model with 14 features achieved a cross-validation accuracy of 0.827 ± 0.031 on one set and an accuracy of 0.798 on the other, with the area under the curve of 0.902 ± 0.027 and 0.904, respectively. The model performed well across different risk-level groups, with the best accuracy of 0.829 in the standard-risk group. To our knowledge, this is the first study to use machine learning models to predict childhood ALL relapse based on medical data from Electronic Medical Record, which will further facilitate stratification treatments. Nature Publishing Group UK 2017-08-07 /pmc/articles/PMC5547099/ /pubmed/28784991 http://dx.doi.org/10.1038/s41598-017-07408-0 Text en © The Author(s) 2017 Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
spellingShingle Article
Pan, Liyan
Liu, Guangjian
Lin, Fangqin
Zhong, Shuling
Xia, Huimin
Sun, Xin
Liang, Huiying
Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia
title Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia
title_full Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia
title_fullStr Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia
title_full_unstemmed Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia
title_short Machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia
title_sort machine learning applications for prediction of relapse in childhood acute lymphoblastic leukemia
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5547099/
https://www.ncbi.nlm.nih.gov/pubmed/28784991
http://dx.doi.org/10.1038/s41598-017-07408-0
work_keys_str_mv AT panliyan machinelearningapplicationsforpredictionofrelapseinchildhoodacutelymphoblasticleukemia
AT liuguangjian machinelearningapplicationsforpredictionofrelapseinchildhoodacutelymphoblasticleukemia
AT linfangqin machinelearningapplicationsforpredictionofrelapseinchildhoodacutelymphoblasticleukemia
AT zhongshuling machinelearningapplicationsforpredictionofrelapseinchildhoodacutelymphoblasticleukemia
AT xiahuimin machinelearningapplicationsforpredictionofrelapseinchildhoodacutelymphoblasticleukemia
AT sunxin machinelearningapplicationsforpredictionofrelapseinchildhoodacutelymphoblasticleukemia
AT lianghuiying machinelearningapplicationsforpredictionofrelapseinchildhoodacutelymphoblasticleukemia