Cargando…
Using multiclass classification to automate the identification of patient safety incident reports by type and severity
BACKGROUND: Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident repo...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468980/ https://www.ncbi.nlm.nih.gov/pubmed/28606174 http://dx.doi.org/10.1186/s12911-017-0483-8 |
_version_ | 1783243495966769152 |
---|---|
author | Wang, Ying Coiera, Enrico Runciman, William Magrabi, Farah |
author_facet | Wang, Ying Coiera, Enrico Runciman, William Magrabi, Farah |
author_sort | Wang, Ying |
collection | PubMed |
description | BACKGROUND: Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident reports in a timely manner is limited by the sheer volumes of data collected. In this study, we aim to evaluate the feasibility of using multiclass classification to automate the identification of patient safety incidents in hospitals. METHODS: Text based classifiers were applied to identify 10 incident types and 4 severity levels. Using the one-versus-one (OvsO) and one-versus-all (OvsA) ensemble strategies, we evaluated regularized logistic regression, linear support vector machine (SVM) and SVM with a radial-basis function (RBF) kernel. Classifiers were trained and tested with “balanced” datasets (n_(Type) = 2860, n_(SeverityLevel) = 1160) from a state-wide incident reporting system. Testing was also undertaken with imbalanced “stratified” datasets (n_(Type) = 6000, n_(SeverityLevel) =5950) from the state-wide system and an independent hospital reporting system. Classifier performance was evaluated using a confusion matrix, as well as F-score, precision and recall. RESULTS: The most effective combination was a OvsO ensemble of binary SVM RBF classifiers with binary count feature extraction. For incident type, classifiers performed well on balanced and stratified datasets (F-score: 78.3, 73.9%), but were worse on independent datasets (68.5%). Reports about falls, medications, pressure injury, aggression and blood products were identified with high recall and precision. “Documentation” was the hardest type to identify. For severity level, F-score for severity assessment code (SAC) 1 (extreme risk) was 87.3 and 64% for SAC4 (low risk) on balanced data. With stratified data, high recall was achieved for SAC1 (82.8–84%) but precision was poor (6.8–11.2%). High risk incidents (SAC2) were confused with medium risk incidents (SAC3). CONCLUSIONS: Binary classifier ensembles appear to be a feasible method for identifying incidents by type and severity level. Automated identification should enable safety problems to be detected and addressed in a more timely manner. Multi-label classifiers may be necessary for reports that relate to more than one incident type. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-017-0483-8) contains supplementary material, which is available to authorized users. |
format | Online Article Text |
id | pubmed-5468980 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-54689802017-06-14 Using multiclass classification to automate the identification of patient safety incident reports by type and severity Wang, Ying Coiera, Enrico Runciman, William Magrabi, Farah BMC Med Inform Decis Mak Research Article BACKGROUND: Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident reports in a timely manner is limited by the sheer volumes of data collected. In this study, we aim to evaluate the feasibility of using multiclass classification to automate the identification of patient safety incidents in hospitals. METHODS: Text based classifiers were applied to identify 10 incident types and 4 severity levels. Using the one-versus-one (OvsO) and one-versus-all (OvsA) ensemble strategies, we evaluated regularized logistic regression, linear support vector machine (SVM) and SVM with a radial-basis function (RBF) kernel. Classifiers were trained and tested with “balanced” datasets (n_(Type) = 2860, n_(SeverityLevel) = 1160) from a state-wide incident reporting system. Testing was also undertaken with imbalanced “stratified” datasets (n_(Type) = 6000, n_(SeverityLevel) =5950) from the state-wide system and an independent hospital reporting system. Classifier performance was evaluated using a confusion matrix, as well as F-score, precision and recall. RESULTS: The most effective combination was a OvsO ensemble of binary SVM RBF classifiers with binary count feature extraction. For incident type, classifiers performed well on balanced and stratified datasets (F-score: 78.3, 73.9%), but were worse on independent datasets (68.5%). Reports about falls, medications, pressure injury, aggression and blood products were identified with high recall and precision. “Documentation” was the hardest type to identify. For severity level, F-score for severity assessment code (SAC) 1 (extreme risk) was 87.3 and 64% for SAC4 (low risk) on balanced data. With stratified data, high recall was achieved for SAC1 (82.8–84%) but precision was poor (6.8–11.2%). High risk incidents (SAC2) were confused with medium risk incidents (SAC3). CONCLUSIONS: Binary classifier ensembles appear to be a feasible method for identifying incidents by type and severity level. Automated identification should enable safety problems to be detected and addressed in a more timely manner. Multi-label classifiers may be necessary for reports that relate to more than one incident type. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-017-0483-8) contains supplementary material, which is available to authorized users. BioMed Central 2017-06-12 /pmc/articles/PMC5468980/ /pubmed/28606174 http://dx.doi.org/10.1186/s12911-017-0483-8 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Wang, Ying Coiera, Enrico Runciman, William Magrabi, Farah Using multiclass classification to automate the identification of patient safety incident reports by type and severity |
title | Using multiclass classification to automate the identification of patient safety incident reports by type and severity |
title_full | Using multiclass classification to automate the identification of patient safety incident reports by type and severity |
title_fullStr | Using multiclass classification to automate the identification of patient safety incident reports by type and severity |
title_full_unstemmed | Using multiclass classification to automate the identification of patient safety incident reports by type and severity |
title_short | Using multiclass classification to automate the identification of patient safety incident reports by type and severity |
title_sort | using multiclass classification to automate the identification of patient safety incident reports by type and severity |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468980/ https://www.ncbi.nlm.nih.gov/pubmed/28606174 http://dx.doi.org/10.1186/s12911-017-0483-8 |
work_keys_str_mv | AT wangying usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity AT coieraenrico usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity AT runcimanwilliam usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity AT magrabifarah usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity |