Cargando…

Using multiclass classification to automate the identification of patient safety incident reports by type and severity

BACKGROUND: Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident repo...

Descripción completa

Detalles Bibliográficos
Autores principales: Wang, Ying, Coiera, Enrico, Runciman, William, Magrabi, Farah
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468980/
https://www.ncbi.nlm.nih.gov/pubmed/28606174
http://dx.doi.org/10.1186/s12911-017-0483-8
_version_ 1783243495966769152
author Wang, Ying
Coiera, Enrico
Runciman, William
Magrabi, Farah
author_facet Wang, Ying
Coiera, Enrico
Runciman, William
Magrabi, Farah
author_sort Wang, Ying
collection PubMed
description BACKGROUND: Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident reports in a timely manner is limited by the sheer volumes of data collected. In this study, we aim to evaluate the feasibility of using multiclass classification to automate the identification of patient safety incidents in hospitals. METHODS: Text based classifiers were applied to identify 10 incident types and 4 severity levels. Using the one-versus-one (OvsO) and one-versus-all (OvsA) ensemble strategies, we evaluated regularized logistic regression, linear support vector machine (SVM) and SVM with a radial-basis function (RBF) kernel. Classifiers were trained and tested with “balanced” datasets (n_(Type) = 2860, n_(SeverityLevel) = 1160) from a state-wide incident reporting system. Testing was also undertaken with imbalanced “stratified” datasets (n_(Type) = 6000, n_(SeverityLevel) =5950) from the state-wide system and an independent hospital reporting system. Classifier performance was evaluated using a confusion matrix, as well as F-score, precision and recall. RESULTS: The most effective combination was a OvsO ensemble of binary SVM RBF classifiers with binary count feature extraction. For incident type, classifiers performed well on balanced and stratified datasets (F-score: 78.3, 73.9%), but were worse on independent datasets (68.5%). Reports about falls, medications, pressure injury, aggression and blood products were identified with high recall and precision. “Documentation” was the hardest type to identify. For severity level, F-score for severity assessment code (SAC) 1 (extreme risk) was 87.3 and 64% for SAC4 (low risk) on balanced data. With stratified data, high recall was achieved for SAC1 (82.8–84%) but precision was poor (6.8–11.2%). High risk incidents (SAC2) were confused with medium risk incidents (SAC3). CONCLUSIONS: Binary classifier ensembles appear to be a feasible method for identifying incidents by type and severity level. Automated identification should enable safety problems to be detected and addressed in a more timely manner. Multi-label classifiers may be necessary for reports that relate to more than one incident type. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-017-0483-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-5468980
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-54689802017-06-14 Using multiclass classification to automate the identification of patient safety incident reports by type and severity Wang, Ying Coiera, Enrico Runciman, William Magrabi, Farah BMC Med Inform Decis Mak Research Article BACKGROUND: Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident reports in a timely manner is limited by the sheer volumes of data collected. In this study, we aim to evaluate the feasibility of using multiclass classification to automate the identification of patient safety incidents in hospitals. METHODS: Text based classifiers were applied to identify 10 incident types and 4 severity levels. Using the one-versus-one (OvsO) and one-versus-all (OvsA) ensemble strategies, we evaluated regularized logistic regression, linear support vector machine (SVM) and SVM with a radial-basis function (RBF) kernel. Classifiers were trained and tested with “balanced” datasets (n_(Type) = 2860, n_(SeverityLevel) = 1160) from a state-wide incident reporting system. Testing was also undertaken with imbalanced “stratified” datasets (n_(Type) = 6000, n_(SeverityLevel) =5950) from the state-wide system and an independent hospital reporting system. Classifier performance was evaluated using a confusion matrix, as well as F-score, precision and recall. RESULTS: The most effective combination was a OvsO ensemble of binary SVM RBF classifiers with binary count feature extraction. For incident type, classifiers performed well on balanced and stratified datasets (F-score: 78.3, 73.9%), but were worse on independent datasets (68.5%). Reports about falls, medications, pressure injury, aggression and blood products were identified with high recall and precision. “Documentation” was the hardest type to identify. For severity level, F-score for severity assessment code (SAC) 1 (extreme risk) was 87.3 and 64% for SAC4 (low risk) on balanced data. With stratified data, high recall was achieved for SAC1 (82.8–84%) but precision was poor (6.8–11.2%). High risk incidents (SAC2) were confused with medium risk incidents (SAC3). CONCLUSIONS: Binary classifier ensembles appear to be a feasible method for identifying incidents by type and severity level. Automated identification should enable safety problems to be detected and addressed in a more timely manner. Multi-label classifiers may be necessary for reports that relate to more than one incident type. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-017-0483-8) contains supplementary material, which is available to authorized users. BioMed Central 2017-06-12 /pmc/articles/PMC5468980/ /pubmed/28606174 http://dx.doi.org/10.1186/s12911-017-0483-8 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Wang, Ying
Coiera, Enrico
Runciman, William
Magrabi, Farah
Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_full Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_fullStr Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_full_unstemmed Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_short Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_sort using multiclass classification to automate the identification of patient safety incident reports by type and severity
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468980/
https://www.ncbi.nlm.nih.gov/pubmed/28606174
http://dx.doi.org/10.1186/s12911-017-0483-8
work_keys_str_mv AT wangying usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity
AT coieraenrico usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity
AT runcimanwilliam usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity
AT magrabifarah usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity