Cargando…
Use of machine learning to analyse routinely collected intensive care unit data: a systematic review
BACKGROUND: Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients’ journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2019
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6704673/ https://www.ncbi.nlm.nih.gov/pubmed/31439010 http://dx.doi.org/10.1186/s13054-019-2564-9 |
_version_ | 1783445549130711040 |
---|---|
author | Shillan, Duncan Sterne, Jonathan A. C. Champneys, Alan Gibbison, Ben |
author_facet | Shillan, Duncan Sterne, Jonathan A. C. Champneys, Alan Gibbison, Ben |
author_sort | Shillan, Duncan |
collection | PubMed |
description | BACKGROUND: Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients’ journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in order to provide decision support to clinicians. METHODS: Systematic review of the applications of machine learning to routinely collected ICU data. Web of Science and MEDLINE databases were searched to identify candidate articles: those on image processing were excluded. The study aim, the type of machine learning used, the size of dataset analysed, whether and how the model was validated, and measures of predictive accuracy were extracted. RESULTS: Of 2450 papers identified, 258 fulfilled eligibility criteria. The most common study aims were predicting complications (77 papers [29.8% of studies]), predicting mortality (70 [27.1%]), improving prognostic models (43 [16.7%]), and classifying sub-populations (29 [11.2%]). Median sample size was 488 (IQR 108–4099): 41 studies analysed data on > 10,000 patients. Analyses focused on 169 (65.5%) papers that used machine learning to predict complications, mortality, length of stay, or improvement of health. Predictions were validated in 161 (95.2%) of these studies: the area under the ROC curve (AUC) was reported by 97 (60.2%) but only 10 (6.2%) validated predictions using independent data. The median AUC was 0.83 in studies of 1000–10,000 patients, rising to 0.94 in studies of > 100,000 patients. The most common machine learning methods were neural networks (72 studies [42.6%]), support vector machines (40 [23.7%]), and classification/decision trees (34 [20.1%]). Since 2015 (125 studies [48.4%]), the most common methods were support vector machines (37 studies [29.6%]) and random forests (29 [23.2%]). CONCLUSIONS: The rate of publication of studies using machine learning to analyse routinely collected ICU data is increasing rapidly. The sample sizes used in many published studies are too small to exploit the potential of these methods. Methodological and reporting guidelines are needed, particularly with regard to the choice of method and validation of predictions, to increase confidence in reported findings and aid in translating findings towards routine use in clinical practice. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13054-019-2564-9) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-6704673 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2019 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-67046732019-08-22 Use of machine learning to analyse routinely collected intensive care unit data: a systematic review Shillan, Duncan Sterne, Jonathan A. C. Champneys, Alan Gibbison, Ben Crit Care Research BACKGROUND: Intensive care units (ICUs) face financial, bed management, and staffing constraints. Detailed data covering all aspects of patients’ journeys into and through intensive care are now collected and stored in electronic health records: machine learning has been used to analyse such data in order to provide decision support to clinicians. METHODS: Systematic review of the applications of machine learning to routinely collected ICU data. Web of Science and MEDLINE databases were searched to identify candidate articles: those on image processing were excluded. The study aim, the type of machine learning used, the size of dataset analysed, whether and how the model was validated, and measures of predictive accuracy were extracted. RESULTS: Of 2450 papers identified, 258 fulfilled eligibility criteria. The most common study aims were predicting complications (77 papers [29.8% of studies]), predicting mortality (70 [27.1%]), improving prognostic models (43 [16.7%]), and classifying sub-populations (29 [11.2%]). Median sample size was 488 (IQR 108–4099): 41 studies analysed data on > 10,000 patients. Analyses focused on 169 (65.5%) papers that used machine learning to predict complications, mortality, length of stay, or improvement of health. Predictions were validated in 161 (95.2%) of these studies: the area under the ROC curve (AUC) was reported by 97 (60.2%) but only 10 (6.2%) validated predictions using independent data. The median AUC was 0.83 in studies of 1000–10,000 patients, rising to 0.94 in studies of > 100,000 patients. The most common machine learning methods were neural networks (72 studies [42.6%]), support vector machines (40 [23.7%]), and classification/decision trees (34 [20.1%]). Since 2015 (125 studies [48.4%]), the most common methods were support vector machines (37 studies [29.6%]) and random forests (29 [23.2%]). CONCLUSIONS: The rate of publication of studies using machine learning to analyse routinely collected ICU data is increasing rapidly. The sample sizes used in many published studies are too small to exploit the potential of these methods. Methodological and reporting guidelines are needed, particularly with regard to the choice of method and validation of predictions, to increase confidence in reported findings and aid in translating findings towards routine use in clinical practice. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (10.1186/s13054-019-2564-9) contains supplementary material, which is available to authorized users. BioMed Central 2019-08-22 /pmc/articles/PMC6704673/ /pubmed/31439010 http://dx.doi.org/10.1186/s13054-019-2564-9 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Shillan, Duncan Sterne, Jonathan A. C. Champneys, Alan Gibbison, Ben Use of machine learning to analyse routinely collected intensive care unit data: a systematic review |
title | Use of machine learning to analyse routinely collected intensive care unit data: a systematic review |
title_full | Use of machine learning to analyse routinely collected intensive care unit data: a systematic review |
title_fullStr | Use of machine learning to analyse routinely collected intensive care unit data: a systematic review |
title_full_unstemmed | Use of machine learning to analyse routinely collected intensive care unit data: a systematic review |
title_short | Use of machine learning to analyse routinely collected intensive care unit data: a systematic review |
title_sort | use of machine learning to analyse routinely collected intensive care unit data: a systematic review |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6704673/ https://www.ncbi.nlm.nih.gov/pubmed/31439010 http://dx.doi.org/10.1186/s13054-019-2564-9 |
work_keys_str_mv | AT shillanduncan useofmachinelearningtoanalyseroutinelycollectedintensivecareunitdataasystematicreview AT sternejonathanac useofmachinelearningtoanalyseroutinelycollectedintensivecareunitdataasystematicreview AT champneysalan useofmachinelearningtoanalyseroutinelycollectedintensivecareunitdataasystematicreview AT gibbisonben useofmachinelearningtoanalyseroutinelycollectedintensivecareunitdataasystematicreview |