Cargando…
A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection
Identifying rare but significant healthcare events in massive unstructured datasets has become a common task in healthcare data analytics. However, imbalanced class distribution in many practical datasets greatly hampers the detection of rare events, as most classification methods implicitly assume...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5987310/ https://www.ncbi.nlm.nih.gov/pubmed/29951182 http://dx.doi.org/10.1155/2018/6275435 |
_version_ | 1783329086033100800 |
---|---|
author | Zhao, Yang Wong, Zoie Shui-Yee Tsui, Kwok Leung |
author_facet | Zhao, Yang Wong, Zoie Shui-Yee Tsui, Kwok Leung |
author_sort | Zhao, Yang |
collection | PubMed |
description | Identifying rare but significant healthcare events in massive unstructured datasets has become a common task in healthcare data analytics. However, imbalanced class distribution in many practical datasets greatly hampers the detection of rare events, as most classification methods implicitly assume an equal occurrence of classes and are designed to maximize the overall classification accuracy. In this study, we develop a framework for learning healthcare data with imbalanced distribution via incorporating different rebalancing strategies. The evaluation results showed that the developed framework can significantly improve the detection accuracy of medical incidents due to look-alike sound-alike (LASA) mix-ups. Specifically, logistic regression combined with the synthetic minority oversampling technique (SMOTE) produces the best detection results, with a significant 45.3% increase in recall (recall = 75.7%) compared with pure logistic regression (recall = 52.1%). |
format | Online Article Text |
id | pubmed-5987310 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-59873102018-06-27 A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection Zhao, Yang Wong, Zoie Shui-Yee Tsui, Kwok Leung J Healthc Eng Research Article Identifying rare but significant healthcare events in massive unstructured datasets has become a common task in healthcare data analytics. However, imbalanced class distribution in many practical datasets greatly hampers the detection of rare events, as most classification methods implicitly assume an equal occurrence of classes and are designed to maximize the overall classification accuracy. In this study, we develop a framework for learning healthcare data with imbalanced distribution via incorporating different rebalancing strategies. The evaluation results showed that the developed framework can significantly improve the detection accuracy of medical incidents due to look-alike sound-alike (LASA) mix-ups. Specifically, logistic regression combined with the synthetic minority oversampling technique (SMOTE) produces the best detection results, with a significant 45.3% increase in recall (recall = 75.7%) compared with pure logistic regression (recall = 52.1%). Hindawi 2018-05-22 /pmc/articles/PMC5987310/ /pubmed/29951182 http://dx.doi.org/10.1155/2018/6275435 Text en Copyright © 2018 Yang Zhao et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Zhao, Yang Wong, Zoie Shui-Yee Tsui, Kwok Leung A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection |
title | A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection |
title_full | A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection |
title_fullStr | A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection |
title_full_unstemmed | A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection |
title_short | A Framework of Rebalancing Imbalanced Healthcare Data for Rare Events' Classification: A Case of Look-Alike Sound-Alike Mix-Up Incident Detection |
title_sort | framework of rebalancing imbalanced healthcare data for rare events' classification: a case of look-alike sound-alike mix-up incident detection |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5987310/ https://www.ncbi.nlm.nih.gov/pubmed/29951182 http://dx.doi.org/10.1155/2018/6275435 |
work_keys_str_mv | AT zhaoyang aframeworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection AT wongzoieshuiyee aframeworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection AT tsuikwokleung aframeworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection AT zhaoyang frameworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection AT wongzoieshuiyee frameworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection AT tsuikwokleung frameworkofrebalancingimbalancedhealthcaredataforrareeventsclassificationacaseoflookalikesoundalikemixupincidentdetection |