Cargando…
Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming
One of the major causes of death in the world is cardiac arrhythmias. In the field of healthcare, physicians use the patient's electrocardiogram (ECG) records to detect arrhythmias, which indicate the electrical activity of the patient's heart. The problem is that the symptoms do not alway...
Autores principales: | , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Hindawi
2021
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8084659/ https://www.ncbi.nlm.nih.gov/pubmed/33968352 http://dx.doi.org/10.1155/2021/6624829 |
_version_ | 1783686199240556544 |
---|---|
author | Ilbeigipour, Sadegh Albadvi, Amir Akhondzadeh Noughabi, Elham |
author_facet | Ilbeigipour, Sadegh Albadvi, Amir Akhondzadeh Noughabi, Elham |
author_sort | Ilbeigipour, Sadegh |
collection | PubMed |
description | One of the major causes of death in the world is cardiac arrhythmias. In the field of healthcare, physicians use the patient's electrocardiogram (ECG) records to detect arrhythmias, which indicate the electrical activity of the patient's heart. The problem is that the symptoms do not always appear and the physician may be mistaken in the diagnosis. Therefore, patients need continuous monitoring through real-time ECG analysis to detect arrhythmias in a timely manner and prevent an eventual incident that threatens the patient's life. In this research, we used the Structured Streaming module built top on the open-source Apache Spark platform for the first time to implement a machine learning pipeline for real-time cardiac arrhythmias detection and evaluate the impact of using this new module on classification performance metrics and the rate of delay in arrhythmia detection. The ECG data collected from the MIT/BIH database for the detection of three class labels: normal beats, RBBB, and atrial fibrillation arrhythmias. We also developed three decision trees, random forest, and logistic regression multiclass classifiers for data classification where the random forest classifier showed better performance in classification than the other two classifiers. The results show previous results in performance metrics of the classification model and a significant decrease in pipeline runtime by using more class labels compared to previous studies. |
format | Online Article Text |
id | pubmed-8084659 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2021 |
publisher | Hindawi |
record_format | MEDLINE/PubMed |
spelling | pubmed-80846592021-05-06 Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming Ilbeigipour, Sadegh Albadvi, Amir Akhondzadeh Noughabi, Elham J Healthc Eng Research Article One of the major causes of death in the world is cardiac arrhythmias. In the field of healthcare, physicians use the patient's electrocardiogram (ECG) records to detect arrhythmias, which indicate the electrical activity of the patient's heart. The problem is that the symptoms do not always appear and the physician may be mistaken in the diagnosis. Therefore, patients need continuous monitoring through real-time ECG analysis to detect arrhythmias in a timely manner and prevent an eventual incident that threatens the patient's life. In this research, we used the Structured Streaming module built top on the open-source Apache Spark platform for the first time to implement a machine learning pipeline for real-time cardiac arrhythmias detection and evaluate the impact of using this new module on classification performance metrics and the rate of delay in arrhythmia detection. The ECG data collected from the MIT/BIH database for the detection of three class labels: normal beats, RBBB, and atrial fibrillation arrhythmias. We also developed three decision trees, random forest, and logistic regression multiclass classifiers for data classification where the random forest classifier showed better performance in classification than the other two classifiers. The results show previous results in performance metrics of the classification model and a significant decrease in pipeline runtime by using more class labels compared to previous studies. Hindawi 2021-04-22 /pmc/articles/PMC8084659/ /pubmed/33968352 http://dx.doi.org/10.1155/2021/6624829 Text en Copyright © 2021 Sadegh Ilbeigipour et al. https://creativecommons.org/licenses/by/4.0/This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Ilbeigipour, Sadegh Albadvi, Amir Akhondzadeh Noughabi, Elham Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming |
title | Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming |
title_full | Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming |
title_fullStr | Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming |
title_full_unstemmed | Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming |
title_short | Real-Time Heart Arrhythmia Detection Using Apache Spark Structured Streaming |
title_sort | real-time heart arrhythmia detection using apache spark structured streaming |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8084659/ https://www.ncbi.nlm.nih.gov/pubmed/33968352 http://dx.doi.org/10.1155/2021/6624829 |
work_keys_str_mv | AT ilbeigipoursadegh realtimeheartarrhythmiadetectionusingapachesparkstructuredstreaming AT albadviamir realtimeheartarrhythmiadetectionusingapachesparkstructuredstreaming AT akhondzadehnoughabielham realtimeheartarrhythmiadetectionusingapachesparkstructuredstreaming |