Cargando…

Use of machine learning to analyse routinely collected intensive care unit data: a systematic review

BACKGROUND: Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients’ journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in...

Descripción completa

Detalles Bibliográficos
Autores principales: Shillan, Duncan, Sterne, Jonathan A. C., Champneys, Alan, Gibbison, Ben
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6704673/
https://www.ncbi.nlm.nih.gov/pubmed/31439010
http://dx.doi.org/10.1186/s13054-019-2564-9
_version_ 1783445549130711040
author Shillan, Duncan
Sterne, Jonathan A. C.
Champneys, Alan
Gibbison, Ben
author_facet Shillan, Duncan
Sterne, Jonathan A. C.
Champneys, Alan
Gibbison, Ben
author_sort Shillan, Duncan
collection PubMed
description BACKGROUND: Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients’ journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in order to provide decision support to clinicians. METHODS: Systematic review of the applications of machine learning to routinely collected ICU data. Web of Science and MEDLINE databases were searched to identify candidate articles: those on image processing were excluded. The study aim, the type of machine learning used, the size of dataset analysed, whether and how the model was validated, and measures of predictive accuracy were extracted. RESULTS: Of 2450 papers identified, 258 fulfilled eligibility criteria. The most common study aims were predicting complications (77 papers [29.8% of studies]), predicting mortality (70 [27.1%]), improving prognostic models (43 [16.7%]), and classifying sub-populations (29 [11.2%]). Median sample size was 488 (IQR 108–4099): 41 studies analysed data on > 10,000 patients. Analyses focused on 169 (65.5%) papers that used machine learning to predict complications, mortality, length of stay, or improvement of health. Predictions were validated in 161 (95.2%) of these studies: the area under the ROC curve (AUC) was reported by 97 (60.2%) but only 10 (6.2%) validated predictions using independent data. The median AUC was 0.83 in studies of 1000–10,000 patients, rising to 0.94 in studies of > 100,000 patients. The most common machine learning methods were neural networks (72 studies [42.6%]), support vector machines (40 [23.7%]), and classification/decision trees (34 [20.1%]). Since 2015 (125 studies [48.4%]), the most common methods were support vector machines (37 studies [29.6%]) and random forests (29 [23.2%]). CONCLUSIONS: The rate of publication of studies using machine learning to analyse routinely collected ICU data is increasing rapidly. The sample sizes used in many published studies are too small to exploit the potential of these methods. Methodological and reporting guidelines are needed, particularly with regard to the choice of method and validation of predictions, to increase confidence in reported findings and aid in translating findings towards routine use in clinical practice. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13054-019-2564-9) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-6704673
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-67046732019-08-22 Use of machine learning to analyse routinely collected intensive care unit data: a systematic review Shillan, Duncan Sterne, Jonathan A. C. Champneys, Alan Gibbison, Ben Crit Care Research BACKGROUND: Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients’ journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in order to provide decision support to clinicians. METHODS: Systematic review of the applications of machine learning to routinely collected ICU data. Web of Science and MEDLINE databases were searched to identify candidate articles: those on image processing were excluded. The study aim, the type of machine learning used, the size of dataset analysed, whether and how the model was validated, and measures of predictive accuracy were extracted. RESULTS: Of 2450 papers identified, 258 fulfilled eligibility criteria. The most common study aims were predicting complications (77 papers [29.8% of studies]), predicting mortality (70 [27.1%]), improving prognostic models (43 [16.7%]), and classifying sub-populations (29 [11.2%]). Median sample size was 488 (IQR 108–4099): 41 studies analysed data on > 10,000 patients. Analyses focused on 169 (65.5%) papers that used machine learning to predict complications, mortality, length of stay, or improvement of health. Predictions were validated in 161 (95.2%) of these studies: the area under the ROC curve (AUC) was reported by 97 (60.2%) but only 10 (6.2%) validated predictions using independent data. The median AUC was 0.83 in studies of 1000–10,000 patients, rising to 0.94 in studies of > 100,000 patients. The most common machine learning methods were neural networks (72 studies [42.6%]), support vector machines (40 [23.7%]), and classification/decision trees (34 [20.1%]). Since 2015 (125 studies [48.4%]), the most common methods were support vector machines (37 studies [29.6%]) and random forests (29 [23.2%]). CONCLUSIONS: The rate of publication of studies using machine learning to analyse routinely collected ICU data is increasing rapidly. The sample sizes used in many published studies are too small to exploit the potential of these methods. Methodological and reporting guidelines are needed, particularly with regard to the choice of method and validation of predictions, to increase confidence in reported findings and aid in translating findings towards routine use in clinical practice. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13054-019-2564-9) contains supplementary material, which is available to authorized users. BioMed Central 2019-08-22 /pmc/articles/PMC6704673/ /pubmed/31439010 http://dx.doi.org/10.1186/s13054-019-2564-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Shillan, Duncan
Sterne, Jonathan A. C.
Champneys, Alan
Gibbison, Ben
Use of machine learning to analyse routinely collected intensive care unit data: a systematic review
title Use of machine learning to analyse routinely collected intensive care unit data: a systematic review
title_full Use of machine learning to analyse routinely collected intensive care unit data: a systematic review
title_fullStr Use of machine learning to analyse routinely collected intensive care unit data: a systematic review
title_full_unstemmed Use of machine learning to analyse routinely collected intensive care unit data: a systematic review
title_short Use of machine learning to analyse routinely collected intensive care unit data: a systematic review
title_sort use of machine learning to analyse routinely collected intensive care unit data: a systematic review
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6704673/
https://www.ncbi.nlm.nih.gov/pubmed/31439010
http://dx.doi.org/10.1186/s13054-019-2564-9
work_keys_str_mv AT shillanduncan useofmachinelearningtoanalyseroutinelycollectedintensivecareunitdataasystematicreview
AT sternejonathanac useofmachinelearningtoanalyseroutinelycollectedintensivecareunitdataasystematicreview
AT champneysalan useofmachinelearningtoanalyseroutinelycollectedintensivecareunitdataasystematicreview
AT gibbisonben useofmachinelearningtoanalyseroutinelycollectedintensivecareunitdataasystematicreview