Cargando…

A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection

Identifying rare but significant healthcare events in massive unstructured datasets has become a common task in healthcare data analytics. However, imbalanced class distribution in many practical datasets greatly hampers the detection of rare events, as most classification methods implicitly assume...

Descripción completa

Detalles Bibliográficos
Autores principales: Zhao, Yang, Wong, Zoie Shui-Yee, Tsui, Kwok Leung
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2018
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5987310/
https://www.ncbi.nlm.nih.gov/pubmed/29951182
http://dx.doi.org/10.1155/2018/6275435
_version_ 1783329086033100800
author Zhao, Yang
Wong, Zoie Shui-Yee
Tsui, Kwok Leung
author_facet Zhao, Yang
Wong, Zoie Shui-Yee
Tsui, Kwok Leung
author_sort Zhao, Yang
collection PubMed
description Identifying rare but significant healthcare events in massive unstructured datasets has become a common task in healthcare data analytics. However, imbalanced class distribution in many practical datasets greatly hampers the detection of rare events, as most classification methods implicitly assume an equal occurrence of classes and are designed to maximize the overall classification accuracy. In this study, we develop a framework for learning healthcare data with imbalanced distribution via incorporating different rebalancing strategies. The evaluation results showed that the developed framework can significantly improve the detection accuracy of medical incidents due to look-alike sound-alike (LASA) mix-ups. Specifically, logistic regression combined with the synthetic minority oversampling technique (SMOTE) produces the best detection results, with a significant 45.3% increase in recall (recall = 75.7%) compared with pure logistic regression (recall = 52.1%).
format Online
Article
Text
id pubmed-5987310
institution National Center for Biotechnology Information
language English
publishDate 2018
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-59873102018-06-27 A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection Zhao, Yang Wong, Zoie Shui-Yee Tsui, Kwok Leung J Healthc Eng Research Article Identifying rare but significant healthcare events in massive unstructured datasets has become a common task in healthcare data analytics. However, imbalanced class distribution in many practical datasets greatly hampers the detection of rare events, as most classification methods implicitly assume an equal occurrence of classes and are designed to maximize the overall classification accuracy. In this study, we develop a framework for learning healthcare data with imbalanced distribution via incorporating different rebalancing strategies. The evaluation results showed that the developed framework can significantly improve the detection accuracy of medical incidents due to look-alike sound-alike (LASA) mix-ups. Specifically, logistic regression combined with the synthetic minority oversampling technique (SMOTE) produces the best detection results, with a significant 45.3% increase in recall (recall = 75.7%) compared with pure logistic regression (recall = 52.1%). Hindawi 2018-05-22 /pmc/articles/PMC5987310/ /pubmed/29951182 http://dx.doi.org/10.1155/2018/6275435 Text en Copyright © 2018 Yang Zhao et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Zhao, Yang
Wong, Zoie Shui-Yee
Tsui, Kwok Leung
A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection
title A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection
title_full A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection
title_fullStr A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection
title_full_unstemmed A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection
title_short A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection
title_sort framework of rebalancing imbalanced healthcare data for rare events' classification: a case of look-alike sound-alike mix-up incident detection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5987310/
https://www.ncbi.nlm.nih.gov/pubmed/29951182
http://dx.doi.org/10.1155/2018/6275435
work_keys_str_mv AT zhaoyang aframeworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection
AT wongzoieshuiyee aframeworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection
AT tsuikwokleung aframeworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection
AT zhaoyang frameworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection
AT wongzoieshuiyee frameworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection
AT tsuikwokleung frameworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection