Cargando…

Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers

PURPOSE: Machine learning classification algorithms (classifiers) for prediction of treatment response are becoming more popular in radiotherapy literature. General Machine learning literature provides evidence in favor of some classifier families (random forest, support vector machine, gradient boo...

Descripción completa

Detalles Bibliográficos
Autores principales: Deist, Timo M., Dankers, Frank J. W. M., Valdes, Gilmer, Wijsman, Robin, Hsu, I‐Chow, Oberije, Cary, Lustberg, Tim, van Soest, Johan, Hoebers, Frank, Jochems, Arthur, El Naqa, Issam, Wee, Leonard, Morin, Olivier, Raleigh, David R., Bots, Wouter, Kaanders, Johannes H., Belderbos, José, Kwint, Margriet, Solberg, Timothy, Monshouwer, René, Bussink, Johan, Dekker, Andre, Lambin, Philippe
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6095141/
https://www.ncbi.nlm.nih.gov/pubmed/29763967
http://dx.doi.org/10.1002/mp.12967
_version_ 1783347929791070208
author Deist, Timo M.
Dankers, Frank J. W. M.
Valdes, Gilmer
Wijsman, Robin
Hsu, I‐Chow
Oberije, Cary
Lustberg, Tim
van Soest, Johan
Hoebers, Frank
Jochems, Arthur
El Naqa, Issam
Wee, Leonard
Morin, Olivier
Raleigh, David R.
Bots, Wouter
Kaanders, Johannes H.
Belderbos, José
Kwint, Margriet
Solberg, Timothy
Monshouwer, René
Bussink, Johan
Dekker, Andre
Lambin, Philippe
author_facet Deist, Timo M.
Dankers, Frank J. W. M.
Valdes, Gilmer
Wijsman, Robin
Hsu, I‐Chow
Oberije, Cary
Lustberg, Tim
van Soest, Johan
Hoebers, Frank
Jochems, Arthur
El Naqa, Issam
Wee, Leonard
Morin, Olivier
Raleigh, David R.
Bots, Wouter
Kaanders, Johannes H.
Belderbos, José
Kwint, Margriet
Solberg, Timothy
Monshouwer, René
Bussink, Johan
Dekker, Andre
Lambin, Philippe
author_sort Deist, Timo M.
collection PubMed
description PURPOSE: Machine learning classification algorithms (classifiers) for prediction of treatment response are becoming more popular in radiotherapy literature. General Machine learning literature provides evidence in favor of some classifier families (random forest, support vector machine, gradient boosting) in terms of classification performance. The purpose of this study is to compare such classifiers specifically for (chemo)radiotherapy datasets and to estimate their average discriminative performance for radiation treatment outcome prediction. METHODS: We collected 12 datasets (3496 patients) from prior studies on post‐(chemo)radiotherapy toxicity, survival, or tumor control with clinical, dosimetric, or blood biomarker features from multiple institutions and for different tumor sites, that is, (non‐)small‐cell lung cancer, head and neck cancer, and meningioma. Six common classification algorithms with built‐in feature selection (decision tree, random forest, neural network, support vector machine, elastic net logistic regression, LogitBoost) were applied on each dataset using the popular open‐source R package caret. The R code and documentation for the analysis are available online (https://github.com/timodeist/classifier_selection_code). All classifiers were run on each dataset in a 100‐repeated nested fivefold cross‐validation with hyperparameter tuning. Performance metrics (AUC, calibration slope and intercept, accuracy, Cohen's kappa, and Brier score) were computed. We ranked classifiers by AUC to determine which classifier is likely to also perform well in future studies. We simulated the benefit for potential investigators to select a certain classifier for a new dataset based on our study (pre‐selection based on other datasets) or estimating the best classifier for a dataset (set‐specific selection based on information from the new dataset) compared with uninformed classifier selection (random selection). RESULTS: Random forest (best in 6/12 datasets) and elastic net logistic regression (best in 4/12 datasets) showed the overall best discrimination, but there was no single best classifier across datasets. Both classifiers had a median AUC rank of 2. Preselection and set‐specific selection yielded a significant average AUC improvement of 0.02 and 0.02 over random selection with an average AUC rank improvement of 0.42 and 0.66, respectively. CONCLUSION: Random forest and elastic net logistic regression yield higher discriminative performance in (chemo)radiotherapy outcome and toxicity prediction than other studied classifiers. Thus, one of these two classifiers should be the first choice for investigators when building classification models or to benchmark one's own modeling results against. Our results also show that an informed preselection of classifiers based on existing datasets can improve discrimination over random selection.
format Online
Article
Text
id pubmed-6095141
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-60951412019-07-01 Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers Deist, Timo M. Dankers, Frank J. W. M. Valdes, Gilmer Wijsman, Robin Hsu, I‐Chow Oberije, Cary Lustberg, Tim van Soest, Johan Hoebers, Frank Jochems, Arthur El Naqa, Issam Wee, Leonard Morin, Olivier Raleigh, David R. Bots, Wouter Kaanders, Johannes H. Belderbos, José Kwint, Margriet Solberg, Timothy Monshouwer, René Bussink, Johan Dekker, Andre Lambin, Philippe Med Phys BIOLOGICAL PHYSICS AND RESPONSE PREDICTION PURPOSE: Machine learning classification algorithms (classifiers) for prediction of treatment response are becoming more popular in radiotherapy literature. General Machine learning literature provides evidence in favor of some classifier families (random forest, support vector machine, gradient boosting) in terms of classification performance. The purpose of this study is to compare such classifiers specifically for (chemo)radiotherapy datasets and to estimate their average discriminative performance for radiation treatment outcome prediction. METHODS: We collected 12 datasets (3496 patients) from prior studies on post‐(chemo)radiotherapy toxicity, survival, or tumor control with clinical, dosimetric, or blood biomarker features from multiple institutions and for different tumor sites, that is, (non‐)small‐cell lung cancer, head and neck cancer, and meningioma. Six common classification algorithms with built‐in feature selection (decision tree, random forest, neural network, support vector machine, elastic net logistic regression, LogitBoost) were applied on each dataset using the popular open‐source R package caret. The R code and documentation for the analysis are available online (https://github.com/timodeist/classifier_selection_code). All classifiers were run on each dataset in a 100‐repeated nested fivefold cross‐validation with hyperparameter tuning. Performance metrics (AUC, calibration slope and intercept, accuracy, Cohen's kappa, and Brier score) were computed. We ranked classifiers by AUC to determine which classifier is likely to also perform well in future studies. We simulated the benefit for potential investigators to select a certain classifier for a new dataset based on our study (pre‐selection based on other datasets) or estimating the best classifier for a dataset (set‐specific selection based on information from the new dataset) compared with uninformed classifier selection (random selection). RESULTS: Random forest (best in 6/12 datasets) and elastic net logistic regression (best in 4/12 datasets) showed the overall best discrimination, but there was no single best classifier across datasets. Both classifiers had a median AUC rank of 2. Preselection and set‐specific selection yielded a significant average AUC improvement of 0.02 and 0.02 over random selection with an average AUC rank improvement of 0.42 and 0.66, respectively. CONCLUSION: Random forest and elastic net logistic regression yield higher discriminative performance in (chemo)radiotherapy outcome and toxicity prediction than other studied classifiers. Thus, one of these two classifiers should be the first choice for investigators when building classification models or to benchmark one's own modeling results against. Our results also show that an informed preselection of classifiers based on existing datasets can improve discrimination over random selection. John Wiley and Sons Inc. 2018-06-13 2018-07 /pmc/articles/PMC6095141/ /pubmed/29763967 http://dx.doi.org/10.1002/mp.12967 Text en © 2018 The Authors. Medical Physics published by Wiley Periodicals, Inc. on behalf of American Association of Physicists in Medicine. https://creativecommons.org/licenses/by/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited.
spellingShingle BIOLOGICAL PHYSICS AND RESPONSE PREDICTION
Deist, Timo M.
Dankers, Frank J. W. M.
Valdes, Gilmer
Wijsman, Robin
Hsu, I‐Chow
Oberije, Cary
Lustberg, Tim
van Soest, Johan
Hoebers, Frank
Jochems, Arthur
El Naqa, Issam
Wee, Leonard
Morin, Olivier
Raleigh, David R.
Bots, Wouter
Kaanders, Johannes H.
Belderbos, José
Kwint, Margriet
Solberg, Timothy
Monshouwer, René
Bussink, Johan
Dekker, Andre
Lambin, Philippe
Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers
title Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers
title_full Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers
title_fullStr Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers
title_full_unstemmed Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers
title_short Machine learning algorithms for outcome prediction in (chemo)radiotherapy: An empirical comparison of classifiers
title_sort machine learning algorithms for outcome prediction in (chemo)radiotherapy: an empirical comparison of classifiers
topic BIOLOGICAL PHYSICS AND RESPONSE PREDICTION
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6095141/
https://www.ncbi.nlm.nih.gov/pubmed/29763967
http://dx.doi.org/10.1002/mp.12967
work_keys_str_mv AT deisttimom machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT dankersfrankjwm machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT valdesgilmer machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT wijsmanrobin machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT hsuichow machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT oberijecary machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT lustbergtim machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT vansoestjohan machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT hoebersfrank machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT jochemsarthur machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT elnaqaissam machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT weeleonard machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT morinolivier machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT raleighdavidr machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT botswouter machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT kaandersjohannesh machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT belderbosjose machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT kwintmargriet machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT solbergtimothy machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT monshouwerrene machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT bussinkjohan machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT dekkerandre machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers
AT lambinphilippe machinelearningalgorithmsforoutcomepredictioninchemoradiotherapyanempiricalcomparisonofclassifiers