Cargando…

Using multiclass classification to automate the identification of patient safety incident reports by type and severity

BACKGROUND: Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident repo...

Descripción completa

Detalles Bibliográficos
Autores principales:	Wang, Ying, Coiera, Enrico, Runciman, William, Magrabi, Farah
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	BioMed Central 2017
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468980/ https://www.ncbi.nlm.nih.gov/pubmed/28606174 http://dx.doi.org/10.1186/s12911-017-0483-8

_version_	1783243495966769152
author	Wang, Ying Coiera, Enrico Runciman, William Magrabi, Farah
author_facet	Wang, Ying Coiera, Enrico Runciman, William Magrabi, Farah
author_sort	Wang, Ying
collection	PubMed
description	BACKGROUND: Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident reports in a timely manner is limited by the sheer volumes of data collected. In this study, we aim to evaluate the feasibility of using multiclass classification to automate the identification of patient safety incidents in hospitals. METHODS: Text based classifiers were applied to identify 10 incident types and 4 severity levels. Using the one-versus-one (OvsO) and one-versus-all (OvsA) ensemble strategies, we evaluated regularized logistic regression, linear support vector machine (SVM) and SVM with a radial-basis function (RBF) kernel. Classifiers were trained and tested with “balanced” datasets (n_(Type) = 2860, n_(SeverityLevel) = 1160) from a state-wide incident reporting system. Testing was also undertaken with imbalanced “stratified” datasets (n_(Type) = 6000, n_(SeverityLevel) =5950) from the state-wide system and an independent hospital reporting system. Classifier performance was evaluated using a confusion matrix, as well as F-score, precision and recall. RESULTS: The most effective combination was a OvsO ensemble of binary SVM RBF classifiers with binary count feature extraction. For incident type, classifiers performed well on balanced and stratified datasets (F-score: 78.3, 73.9%), but were worse on independent datasets (68.5%). Reports about falls, medications, pressure injury, aggression and blood products were identified with high recall and precision. “Documentation” was the hardest type to identify. For severity level, F-score for severity assessment code (SAC) 1 (extreme risk) was 87.3 and 64% for SAC4 (low risk) on balanced data. With stratified data, high recall was achieved for SAC1 (82.8–84%) but precision was poor (6.8–11.2%). High risk incidents (SAC2) were confused with medium risk incidents (SAC3). CONCLUSIONS: Binary classifier ensembles appear to be a feasible method for identifying incidents by type and severity level. Automated identification should enable safety problems to be detected and addressed in a more timely manner. Multi-label classifiers may be necessary for reports that relate to more than one incident type. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-017-0483-8) contains supplementary material, which is available to authorized users.
format	Online Article Text
id	pubmed-5468980
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	BioMed Central
record_format	MEDLINE/PubMed
spelling	pubmed-54689802017-06-14 Using multiclass classification to automate the identification of patient safety incident reports by type and severity Wang, Ying Coiera, Enrico Runciman, William Magrabi, Farah BMC Med Inform Decis Mak Research Article BACKGROUND: Approximately 10% of admissions to acute-care hospitals are associated with an adverse event. Analysis of incident reports helps to understand how and why incidents occur and can inform policy and practice for safer care. Unfortunately our capacity to monitor and respond to incident reports in a timely manner is limited by the sheer volumes of data collected. In this study, we aim to evaluate the feasibility of using multiclass classification to automate the identification of patient safety incidents in hospitals. METHODS: Text based classifiers were applied to identify 10 incident types and 4 severity levels. Using the one-versus-one (OvsO) and one-versus-all (OvsA) ensemble strategies, we evaluated regularized logistic regression, linear support vector machine (SVM) and SVM with a radial-basis function (RBF) kernel. Classifiers were trained and tested with “balanced” datasets (n_(Type) = 2860, n_(SeverityLevel) = 1160) from a state-wide incident reporting system. Testing was also undertaken with imbalanced “stratified” datasets (n_(Type) = 6000, n_(SeverityLevel) =5950) from the state-wide system and an independent hospital reporting system. Classifier performance was evaluated using a confusion matrix, as well as F-score, precision and recall. RESULTS: The most effective combination was a OvsO ensemble of binary SVM RBF classifiers with binary count feature extraction. For incident type, classifiers performed well on balanced and stratified datasets (F-score: 78.3, 73.9%), but were worse on independent datasets (68.5%). Reports about falls, medications, pressure injury, aggression and blood products were identified with high recall and precision. “Documentation” was the hardest type to identify. For severity level, F-score for severity assessment code (SAC) 1 (extreme risk) was 87.3 and 64% for SAC4 (low risk) on balanced data. With stratified data, high recall was achieved for SAC1 (82.8–84%) but precision was poor (6.8–11.2%). High risk incidents (SAC2) were confused with medium risk incidents (SAC3). CONCLUSIONS: Binary classifier ensembles appear to be a feasible method for identifying incidents by type and severity level. Automated identification should enable safety problems to be detected and addressed in a more timely manner. Multi-label classifiers may be necessary for reports that relate to more than one incident type. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/s12911-017-0483-8) contains supplementary material, which is available to authorized users. BioMed Central 2017-06-12 /pmc/articles/PMC5468980/ /pubmed/28606174 http://dx.doi.org/10.1186/s12911-017-0483-8 Text en © The Author(s). 2017 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle	Research Article Wang, Ying Coiera, Enrico Runciman, William Magrabi, Farah Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title	Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_full	Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_fullStr	Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_full_unstemmed	Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_short	Using multiclass classification to automate the identification of patient safety incident reports by type and severity
title_sort	using multiclass classification to automate the identification of patient safety incident reports by type and severity
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5468980/ https://www.ncbi.nlm.nih.gov/pubmed/28606174 http://dx.doi.org/10.1186/s12911-017-0483-8
work_keys_str_mv	AT wangying usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity AT coieraenrico usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity AT runcimanwilliam usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity AT magrabifarah usingmulticlassclassificationtoautomatetheidentificationofpatientsafetyincidentreportsbytypeandseverity

Using multiclass classification to automate the identification of patient safety incident reports by type and severity

Ejemplares similares