Cargando…

Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers

BACKGROUND: The bacteraemia prediction is relevant because sepsis is one of the most important causes of morbidity and mortality. Bacteraemia prognosis primarily depends on a rapid diagnosis. The bacteraemia prediction would shorten up to 6 days the diagnosis, and, in conjunction with individual pat...

Descripción completa

Detalles Bibliográficos
Autores principales: Garnica, Oscar, Gómez, Diego, Ramos, Víctor, Hidalgo, J. Ignacio, Ruiz-Giardín, José M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Springer International Publishing 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8405861/
https://www.ncbi.nlm.nih.gov/pubmed/34484472
http://dx.doi.org/10.1007/s13167-021-00252-3
_version_ 1783746403740155904
author Garnica, Oscar
Gómez, Diego
Ramos, Víctor
Hidalgo, J. Ignacio
Ruiz-Giardín, José M.
author_facet Garnica, Oscar
Gómez, Diego
Ramos, Víctor
Hidalgo, J. Ignacio
Ruiz-Giardín, José M.
author_sort Garnica, Oscar
collection PubMed
description BACKGROUND: The bacteraemia prediction is relevant because sepsis is one of the most important causes of morbidity and mortality. Bacteraemia prognosis primarily depends on a rapid diagnosis. The bacteraemia prediction would shorten up to 6 days the diagnosis, and, in conjunction with individual patient variables, should be considered to start the early administration of personalised antibiotic treatment and medical services, the election of specific diagnostic techniques and the determination of additional treatments, such as surgery, that would prevent subsequent complications. Machine learning techniques could help physicians make these informed decisions by predicting bacteraemia using the data already available in electronic hospital records. OBJECTIVE: This study presents the application of machine learning techniques to these records to predict the blood culture’s outcome, which would reduce the lag in starting a personalised antibiotic treatment and the medical costs associated with erroneous treatments due to conservative assumptions about blood culture outcomes. METHODS: Six supervised classifiers were created using three machine learning techniques, Support Vector Machine, Random Forest and K-Nearest Neighbours, on the electronic health records of hospital patients. The best approach to handle missing data was chosen and, for each machine learning technique, two classification models were created: the first uses the features known at the time of blood extraction, whereas the second uses four extra features revealed during the blood culture. RESULTS: The six classifiers were trained and tested using a dataset of 4357 patients with 117 features per patient. The models obtain predictions that, for the best case, are up to a state-of-the-art accuracy of 85.9%, a sensitivity of 87.4% and an AUC of 0.93. CONCLUSIONS: Our results provide cutting-edge metrics of interest in predictive medical models with values that exceed the medical practice threshold and previous results in the literature using classical modelling techniques in specific types of bacteraemia. Additionally, the consistency of results is reasserted because the three classifiers’ importance ranking shows similar features that coincide with those that physicians use in their manual heuristics. Therefore, the efficacy of these machine learning techniques confirms their viability to assist in the aims of predictive and personalised medicine once the disease presents bacteraemia-compatible symptoms and to assist in improving the healthcare economy.
format Online
Article
Text
id pubmed-8405861
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Springer International Publishing
record_format MEDLINE/PubMed
spelling pubmed-84058612021-08-31 Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers Garnica, Oscar Gómez, Diego Ramos, Víctor Hidalgo, J. Ignacio Ruiz-Giardín, José M. EPMA J Research BACKGROUND: The bacteraemia prediction is relevant because sepsis is one of the most important causes of morbidity and mortality. Bacteraemia prognosis primarily depends on a rapid diagnosis. The bacteraemia prediction would shorten up to 6 days the diagnosis, and, in conjunction with individual patient variables, should be considered to start the early administration of personalised antibiotic treatment and medical services, the election of specific diagnostic techniques and the determination of additional treatments, such as surgery, that would prevent subsequent complications. Machine learning techniques could help physicians make these informed decisions by predicting bacteraemia using the data already available in electronic hospital records. OBJECTIVE: This study presents the application of machine learning techniques to these records to predict the blood culture’s outcome, which would reduce the lag in starting a personalised antibiotic treatment and the medical costs associated with erroneous treatments due to conservative assumptions about blood culture outcomes. METHODS: Six supervised classifiers were created using three machine learning techniques, Support Vector Machine, Random Forest and K-Nearest Neighbours, on the electronic health records of hospital patients. The best approach to handle missing data was chosen and, for each machine learning technique, two classification models were created: the first uses the features known at the time of blood extraction, whereas the second uses four extra features revealed during the blood culture. RESULTS: The six classifiers were trained and tested using a dataset of 4357 patients with 117 features per patient. The models obtain predictions that, for the best case, are up to a state-of-the-art accuracy of 85.9%, a sensitivity of 87.4% and an AUC of 0.93. CONCLUSIONS: Our results provide cutting-edge metrics of interest in predictive medical models with values that exceed the medical practice threshold and previous results in the literature using classical modelling techniques in specific types of bacteraemia. Additionally, the consistency of results is reasserted because the three classifiers’ importance ranking shows similar features that coincide with those that physicians use in their manual heuristics. Therefore, the efficacy of these machine learning techniques confirms their viability to assist in the aims of predictive and personalised medicine once the disease presents bacteraemia-compatible symptoms and to assist in improving the healthcare economy. Springer International Publishing 2021-08-31 /pmc/articles/PMC8405861/ /pubmed/34484472 http://dx.doi.org/10.1007/s13167-021-00252-3 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) .
spellingShingle Research
Garnica, Oscar
Gómez, Diego
Ramos, Víctor
Hidalgo, J. Ignacio
Ruiz-Giardín, José M.
Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers
title Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers
title_full Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers
title_fullStr Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers
title_full_unstemmed Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers
title_short Diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers
title_sort diagnosing hospital bacteraemia in the framework of predictive, preventive and personalised medicine using electronic health records and machine learning classifiers
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8405861/
https://www.ncbi.nlm.nih.gov/pubmed/34484472
http://dx.doi.org/10.1007/s13167-021-00252-3
work_keys_str_mv AT garnicaoscar diagnosinghospitalbacteraemiaintheframeworkofpredictivepreventiveandpersonalisedmedicineusingelectronichealthrecordsandmachinelearningclassifiers
AT gomezdiego diagnosinghospitalbacteraemiaintheframeworkofpredictivepreventiveandpersonalisedmedicineusingelectronichealthrecordsandmachinelearningclassifiers
AT ramosvictor diagnosinghospitalbacteraemiaintheframeworkofpredictivepreventiveandpersonalisedmedicineusingelectronichealthrecordsandmachinelearningclassifiers
AT hidalgojignacio diagnosinghospitalbacteraemiaintheframeworkofpredictivepreventiveandpersonalisedmedicineusingelectronichealthrecordsandmachinelearningclassifiers
AT ruizgiardinjosem diagnosinghospitalbacteraemiaintheframeworkofpredictivepreventiveandpersonalisedmedicineusingelectronichealthrecordsandmachinelearningclassifiers