Cargando…

Development and utility assessment of a machine learning bloodstream infection classifier in pediatric patients receiving cancer treatments

BACKGROUND: Objectives were to build a machine learning algorithm to identify bloodstream infection (BSI) among pediatric patients with cancer and hematopoietic stem cell transplantation (HSCT) recipients, and to compare this approach with presence of neutropenia to identify BSI. METHODS: We include...

Descripción completa

Detalles Bibliográficos
Autores principales: Sung, Lillian, Corbin, Conor, Steinberg, Ethan, Vettese, Emily, Campigotto, Aaron, Lecce, Loreto, Tomlinson, George A., Shah, Nigam
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7666525/
https://www.ncbi.nlm.nih.gov/pubmed/33187484
http://dx.doi.org/10.1186/s12885-020-07618-2
_version_ 1783610146265497600
author Sung, Lillian
Corbin, Conor
Steinberg, Ethan
Vettese, Emily
Campigotto, Aaron
Lecce, Loreto
Tomlinson, George A.
Shah, Nigam
author_facet Sung, Lillian
Corbin, Conor
Steinberg, Ethan
Vettese, Emily
Campigotto, Aaron
Lecce, Loreto
Tomlinson, George A.
Shah, Nigam
author_sort Sung, Lillian
collection PubMed
description BACKGROUND: Objectives were to build a machine learning algorithm to identify bloodstream infection (BSI) among pediatric patients with cancer and hematopoietic stem cell transplantation (HSCT) recipients, and to compare this approach with presence of neutropenia to identify BSI. METHODS: We included patients 0–18 years of age at cancer diagnosis or HSCT between January 2009 and November 2018. Eligible blood cultures were those with no previous blood culture (regardless of result) within 7 days. The primary outcome was BSI. Four machine learning algorithms were used: elastic net, support vector machine and two implementations of gradient boosting machine (GBM and XGBoost). Model training and evaluation were performed using temporally disjoint training (60%), validation (20%) and test (20%) sets. The best model was compared to neutropenia alone in the test set. RESULTS: Of 11,183 eligible blood cultures, 624 (5.6%) were positive. The best model in the validation set was GBM, which achieved an area-under-the-receiver-operator-curve (AUROC) of 0.74 in the test set. Among the 2236 in the test set, the number of false positives and specificity of GBM vs. neutropenia were 508 vs. 592 and 0.76 vs. 0.72 respectively. Among 139 test set BSIs, six (4.3%) non-neutropenic patients were identified by GBM. All received antibiotics prior to culture result availability. CONCLUSIONS: We developed a machine learning algorithm to classify BSI. GBM achieved an AUROC of 0.74 and identified 4.3% additional true cases in the test set. The machine learning algorithm did not perform substantially better than using presence of neutropenia alone to predict BSI. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12885-020-07618-2.
format Online
Article
Text
id pubmed-7666525
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-76665252020-11-16 Development and utility assessment of a machine learning bloodstream infection classifier in pediatric patients receiving cancer treatments Sung, Lillian Corbin, Conor Steinberg, Ethan Vettese, Emily Campigotto, Aaron Lecce, Loreto Tomlinson, George A. Shah, Nigam BMC Cancer Research Article BACKGROUND: Objectives were to build a machine learning algorithm to identify bloodstream infection (BSI) among pediatric patients with cancer and hematopoietic stem cell transplantation (HSCT) recipients, and to compare this approach with presence of neutropenia to identify BSI. METHODS: We included patients 0–18 years of age at cancer diagnosis or HSCT between January 2009 and November 2018. Eligible blood cultures were those with no previous blood culture (regardless of result) within 7 days. The primary outcome was BSI. Four machine learning algorithms were used: elastic net, support vector machine and two implementations of gradient boosting machine (GBM and XGBoost). Model training and evaluation were performed using temporally disjoint training (60%), validation (20%) and test (20%) sets. The best model was compared to neutropenia alone in the test set. RESULTS: Of 11,183 eligible blood cultures, 624 (5.6%) were positive. The best model in the validation set was GBM, which achieved an area-under-the-receiver-operator-curve (AUROC) of 0.74 in the test set. Among the 2236 in the test set, the number of false positives and specificity of GBM vs. neutropenia were 508 vs. 592 and 0.76 vs. 0.72 respectively. Among 139 test set BSIs, six (4.3%) non-neutropenic patients were identified by GBM. All received antibiotics prior to culture result availability. CONCLUSIONS: We developed a machine learning algorithm to classify BSI. GBM achieved an AUROC of 0.74 and identified 4.3% additional true cases in the test set. The machine learning algorithm did not perform substantially better than using presence of neutropenia alone to predict BSI. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12885-020-07618-2. BioMed Central 2020-11-13 /pmc/articles/PMC7666525/ /pubmed/33187484 http://dx.doi.org/10.1186/s12885-020-07618-2 Text en © The Author(s) 2020 Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Sung, Lillian
Corbin, Conor
Steinberg, Ethan
Vettese, Emily
Campigotto, Aaron
Lecce, Loreto
Tomlinson, George A.
Shah, Nigam
Development and utility assessment of a machine learning bloodstream infection classifier in pediatric patients receiving cancer treatments
title Development and utility assessment of a machine learning bloodstream infection classifier in pediatric patients receiving cancer treatments
title_full Development and utility assessment of a machine learning bloodstream infection classifier in pediatric patients receiving cancer treatments
title_fullStr Development and utility assessment of a machine learning bloodstream infection classifier in pediatric patients receiving cancer treatments
title_full_unstemmed Development and utility assessment of a machine learning bloodstream infection classifier in pediatric patients receiving cancer treatments
title_short Development and utility assessment of a machine learning bloodstream infection classifier in pediatric patients receiving cancer treatments
title_sort development and utility assessment of a machine learning bloodstream infection classifier in pediatric patients receiving cancer treatments
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7666525/
https://www.ncbi.nlm.nih.gov/pubmed/33187484
http://dx.doi.org/10.1186/s12885-020-07618-2
work_keys_str_mv AT sunglillian developmentandutilityassessmentofamachinelearningbloodstreaminfectionclassifierinpediatricpatientsreceivingcancertreatments
AT corbinconor developmentandutilityassessmentofamachinelearningbloodstreaminfectionclassifierinpediatricpatientsreceivingcancertreatments
AT steinbergethan developmentandutilityassessmentofamachinelearningbloodstreaminfectionclassifierinpediatricpatientsreceivingcancertreatments
AT vetteseemily developmentandutilityassessmentofamachinelearningbloodstreaminfectionclassifierinpediatricpatientsreceivingcancertreatments
AT campigottoaaron developmentandutilityassessmentofamachinelearningbloodstreaminfectionclassifierinpediatricpatientsreceivingcancertreatments
AT lecceloreto developmentandutilityassessmentofamachinelearningbloodstreaminfectionclassifierinpediatricpatientsreceivingcancertreatments
AT tomlinsongeorgea developmentandutilityassessmentofamachinelearningbloodstreaminfectionclassifierinpediatricpatientsreceivingcancertreatments
AT shahnigam developmentandutilityassessmentofamachinelearningbloodstreaminfectionclassifierinpediatricpatientsreceivingcancertreatments