Cargando…

Combining Optical Character Recognition With Paper ECG Digitization

Objective: We propose a MATLAB-based tool to convert electrocardiography (ECG) waveforms from paper-based ECG records into digitized ECG signals that is vendor-agnostic. The tool is packaged as an open source standalone graphical user interface (GUI) based application. Methods and procedures: To rea...

Descripción completa

Detalles Bibliográficos
Formato: Online Artículo Texto
Lenguaje:English
Publicado: IEEE 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8248903/
https://www.ncbi.nlm.nih.gov/pubmed/34235006
http://dx.doi.org/10.1109/JTEHM.2021.3083482
_version_ 1783716813243154432
collection PubMed
description Objective: We propose a MATLAB-based tool to convert electrocardiography (ECG) waveforms from paper-based ECG records into digitized ECG signals that is vendor-agnostic. The tool is packaged as an open source standalone graphical user interface (GUI) based application. Methods and procedures: To reach this objective we: (1) preprocess the ECG records, which includes skew correction, background grid removal and linear filtering; (2) segment ECG signals using Connected Components Analysis (CCA); (3) implement Optical Character Recognition (OCR) for removal of overlapping ECG lead characters and for interfacing of patients’ demographic information with their research records or their electronic medical record (EMR). The ECG digitization results are validated through a reader study where clinically salient features, such as intervals of QRST complex, between the paper ECG records and the digitized ECG records are compared. Results: Comparison of clinically important features between the paper-based ECG records and the digitized ECG signals, reveals intra- and inter-observer correlations of 0.86–0.99 and 0.79–0.94, respectively. The kappa statistic was found to average at 0.86 and 0.72 for intra- and inter-observer correlations, respectively. Conclusion: The clinically salient features of the ECG waveforms such as the intervals of QRST complex, are preserved during the digitization procedure. Clinical and Healthcare Impact: This open-source digitization tool can be used as a research resource to digitize paper ECG records thereby enabling development of new prediction algorithms to risk stratify individuals with cardiovascular disease, and/or allow for development of ECG-based cardiovascular diagnoses relying upon automated digital algorithms.
format Online
Article
Text
id pubmed-8248903
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher IEEE
record_format MEDLINE/PubMed
spelling pubmed-82489032021-07-06 Combining Optical Character Recognition With Paper ECG Digitization IEEE J Transl Eng Health Med Article Objective: We propose a MATLAB-based tool to convert electrocardiography (ECG) waveforms from paper-based ECG records into digitized ECG signals that is vendor-agnostic. The tool is packaged as an open source standalone graphical user interface (GUI) based application. Methods and procedures: To reach this objective we: (1) preprocess the ECG records, which includes skew correction, background grid removal and linear filtering; (2) segment ECG signals using Connected Components Analysis (CCA); (3) implement Optical Character Recognition (OCR) for removal of overlapping ECG lead characters and for interfacing of patients’ demographic information with their research records or their electronic medical record (EMR). The ECG digitization results are validated through a reader study where clinically salient features, such as intervals of QRST complex, between the paper ECG records and the digitized ECG records are compared. Results: Comparison of clinically important features between the paper-based ECG records and the digitized ECG signals, reveals intra- and inter-observer correlations of 0.86–0.99 and 0.79–0.94, respectively. The kappa statistic was found to average at 0.86 and 0.72 for intra- and inter-observer correlations, respectively. Conclusion: The clinically salient features of the ECG waveforms such as the intervals of QRST complex, are preserved during the digitization procedure. Clinical and Healthcare Impact: This open-source digitization tool can be used as a research resource to digitize paper ECG records thereby enabling development of new prediction algorithms to risk stratify individuals with cardiovascular disease, and/or allow for development of ECG-based cardiovascular diagnoses relying upon automated digital algorithms. IEEE 2021-05-25 /pmc/articles/PMC8248903/ /pubmed/34235006 http://dx.doi.org/10.1109/JTEHM.2021.3083482 Text en https://creativecommons.org/licenses/by/4.0/This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Combining Optical Character Recognition With Paper ECG Digitization
title Combining Optical Character Recognition With Paper ECG Digitization
title_full Combining Optical Character Recognition With Paper ECG Digitization
title_fullStr Combining Optical Character Recognition With Paper ECG Digitization
title_full_unstemmed Combining Optical Character Recognition With Paper ECG Digitization
title_short Combining Optical Character Recognition With Paper ECG Digitization
title_sort combining optical character recognition with paper ecg digitization
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8248903/
https://www.ncbi.nlm.nih.gov/pubmed/34235006
http://dx.doi.org/10.1109/JTEHM.2021.3083482
work_keys_str_mv AT combiningopticalcharacterrecognitionwithpaperecgdigitization
AT combiningopticalcharacterrecognitionwithpaperecgdigitization
AT combiningopticalcharacterrecognitionwithpaperecgdigitization
AT combiningopticalcharacterrecognitionwithpaperecgdigitization
AT combiningopticalcharacterrecognitionwithpaperecgdigitization
AT combiningopticalcharacterrecognitionwithpaperecgdigitization