Cargando…

Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming

One of the major causes of death in the world is cardiac arrhythmias. In the field of healthcare, physicians use the patient's electrocardiogram (ECG) records to detect arrhythmias, which indicate the electrical activity of the patient's heart. The problem is that the symptoms do not alway...

Descripción completa

Detalles Bibliográficos
Autores principales: Ilbeigipour, Sadegh, Albadvi, Amir, Akhondzadeh Noughabi, Elham
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Hindawi 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8084659/
https://www.ncbi.nlm.nih.gov/pubmed/33968352
http://dx.doi.org/10.1155/2021/6624829
_version_ 1783686199240556544
author Ilbeigipour, Sadegh
Albadvi, Amir
Akhondzadeh Noughabi, Elham
author_facet Ilbeigipour, Sadegh
Albadvi, Amir
Akhondzadeh Noughabi, Elham
author_sort Ilbeigipour, Sadegh
collection PubMed
description One of the major causes of death in the world is cardiac arrhythmias. In the field of healthcare, physicians use the patient's electrocardiogram (ECG) records to detect arrhythmias, which indicate the electrical activity of the patient's heart. The problem is that the symptoms do not always appear and the physician may be mistaken in the diagnosis. Therefore, patients need continuous monitoring through real-time ECG analysis to detect arrhythmias in a timely manner and prevent an eventual incident that threatens the patient's life. In this research, we used the Structured Streaming module built top on the open-source Apache Spark platform for the first time to implement a machine learning pipeline for real-time cardiac arrhythmias detection and evaluate the impact of using this new module on classification performance metrics and the rate of delay in arrhythmia detection. The ECG data collected from the MIT/BIH database for the detection of three class labels: normal beats, RBBB, and atrial fibrillation arrhythmias. We also developed three decision trees, random forest, and logistic regression multiclass classifiers for data classification where the random forest classifier showed better performance in classification than the other two classifiers. The results show previous results in performance metrics of the classification model and a significant decrease in pipeline runtime by using more class labels compared to previous studies.
format Online
Article
Text
id pubmed-8084659
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher Hindawi
record_format MEDLINE/PubMed
spelling pubmed-80846592021-05-06 Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming Ilbeigipour, Sadegh Albadvi, Amir Akhondzadeh Noughabi, Elham J Healthc Eng Research Article One of the major causes of death in the world is cardiac arrhythmias. In the field of healthcare, physicians use the patient's electrocardiogram (ECG) records to detect arrhythmias, which indicate the electrical activity of the patient's heart. The problem is that the symptoms do not always appear and the physician may be mistaken in the diagnosis. Therefore, patients need continuous monitoring through real-time ECG analysis to detect arrhythmias in a timely manner and prevent an eventual incident that threatens the patient's life. In this research, we used the Structured Streaming module built top on the open-source Apache Spark platform for the first time to implement a machine learning pipeline for real-time cardiac arrhythmias detection and evaluate the impact of using this new module on classification performance metrics and the rate of delay in arrhythmia detection. The ECG data collected from the MIT/BIH database for the detection of three class labels: normal beats, RBBB, and atrial fibrillation arrhythmias. We also developed three decision trees, random forest, and logistic regression multiclass classifiers for data classification where the random forest classifier showed better performance in classification than the other two classifiers. The results show previous results in performance metrics of the classification model and a significant decrease in pipeline runtime by using more class labels compared to previous studies. Hindawi 2021-04-22 /pmc/articles/PMC8084659/ /pubmed/33968352 http://dx.doi.org/10.1155/2021/6624829 Text en Copyright © 2021 Sadegh Ilbeigipour et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Ilbeigipour, Sadegh
Albadvi, Amir
Akhondzadeh Noughabi, Elham
Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming
title Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming
title_full Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming
title_fullStr Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming
title_full_unstemmed Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming
title_short Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming
title_sort real-time heart arrhythmia detection using apache spark structured streaming
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8084659/
https://www.ncbi.nlm.nih.gov/pubmed/33968352
http://dx.doi.org/10.1155/2021/6624829
work_keys_str_mv AT ilbeigipoursadegh realtimeheartarrhythmiadetectionusingapachesparkstructuredstreaming
AT albadviamir realtimeheartarrhythmiadetectionusingapachesparkstructuredstreaming
AT akhondzadehnoughabielham realtimeheartarrhythmiadetectionusingapachesparkstructuredstreaming