Cargando…

AssistMED project - a tool for rapid data extraction from textual electronic health records for cardiovascular research- system design and preliminary results

FUNDING ACKNOWLEDGEMENTS: Type of funding sources: Public grant(s) – National budget only. Main funding source(s): Ministry of Science and Higher Education BACKGROUND: Retrospective and observational analyses are an important part of cardiovascular research. The adoption of Electronic Health Records...

Descripción completa

Detalles Bibliográficos
Autores principales: Maciejewski, C M, Ozieranski, K O, Adam Barwiolek, A B, Basza, M B, Bozym, A B, Ciurla, M C, Krajsman, M J K, Maciejewska, M M, Lodzinski, P L, Opolski, G O, Grabowski, M G, Cacko, A C, Balsam, P B
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Oxford University Press 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10206650/
http://dx.doi.org/10.1093/europace/euad122.546
_version_ 1785046277282594816
author Maciejewski, C M
Ozieranski, K O
Adam Barwiolek, A B
Basza, M B
Bozym, A B
Ciurla, M C
Krajsman, M J K
Maciejewska, M M
Lodzinski, P L
Opolski, G O
Grabowski, M G
Cacko, A C
Balsam, P B
author_facet Maciejewski, C M
Ozieranski, K O
Adam Barwiolek, A B
Basza, M B
Bozym, A B
Ciurla, M C
Krajsman, M J K
Maciejewska, M M
Lodzinski, P L
Opolski, G O
Grabowski, M G
Cacko, A C
Balsam, P B
author_sort Maciejewski, C M
collection PubMed
description FUNDING ACKNOWLEDGEMENTS: Type of funding sources: Public grant(s) – National budget only. Main funding source(s): Ministry of Science and Higher Education BACKGROUND: Retrospective and observational analyses are an important part of cardiovascular research. The adoption of Electronic Health Records (EHR) improved the availability of medical documentation for research purposes, but rapid data collection is hampered by the predominantly unstructured nature of EHR. A significant proportion of data is textual information that cannot be utilized for research purposes until is manually coded into a database by a healthcare professional through manual chart review- this conventional method is both cost and time- consuming. Text mining could be utilized to accelerate the process, but little is known about attainable data accuracy and estimation of time gain. PURPOSE: We developed a comprehensive text-processing tool for the cardiology field. The algorithm employs advanced text processing based on a specifically designed, vast database of medical terminology, drug lists and echocardiography parameters with a data structure tailored to the needs of cardiovascular research conduction. The algorithm can automatically analyze three types of textual data which are universal parts of discharge summary in Poland: (1) descriptive list of medical diagnoses; (2) discharge recommendations; (3) echocardiography report (if performed). A set of discharge summaries was analyzed with both the conventional method and the algorithm to demonstrate the process of acquisition of basic characteristics of the cohort of patients with atrial fibrillation/flutter (AF/AFl). METHODS: Discharge summaries of 394 patients hospitalized at one cardiology department were analyzed (1) automatically and (2) manually coded into the database by a healthcare professional utilizing a proprietary developed annotation tool to accelerate the annotation process, minimize errors and calculate total effective data acquisition time. RESULTS: The time of manual and automatic data analysis was 19:11 and 0:15 hours respectively. Of the total 394 patients, 319 (81%) had AF/AFl according to both manual identification and the algorithm (full agreement). The characteristics of the study group obtained with manual and automatic method is presented in Table. There was high agreement in the detection of diseases, presence of drug groups and echocardiographic parameters; some differences between the two classifications were noted but did not reach statistical significance. The median CHA2DS2VASc score calculated by the algorithm was 4 (IQR 3-5) versus 4 (IQR 3-5) for the manual method (p=0.92) and HAS-BLED score by the algorithm was 1 (IQR 1-2) versus 1 (IQR 1-2) for the manual method (p=0.98). CONCLUSIONS: The utilization of the algorithm greatly reduced the time required for basic characteristics of the study group acquisition without significantly compromising the quality of the data. Automatic detection of retrospective study cohorts through the application of text processing techniques from electronic health records in Polish is promising and feasible. [Figure: see text] [Figure: see text]
format Online
Article
Text
id pubmed-10206650
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Oxford University Press
record_format MEDLINE/PubMed
spelling pubmed-102066502023-05-25 AssistMED project - a tool for rapid data extraction from textual electronic health records for cardiovascular research- system design and preliminary results Maciejewski, C M Ozieranski, K O Adam Barwiolek, A B Basza, M B Bozym, A B Ciurla, M C Krajsman, M J K Maciejewska, M M Lodzinski, P L Opolski, G O Grabowski, M G Cacko, A C Balsam, P B Europace 38.6 - Hospital Information Systems, Electronic Medical Records, Clinical Decision Support FUNDING ACKNOWLEDGEMENTS: Type of funding sources: Public grant(s) – National budget only. Main funding source(s): Ministry of Science and Higher Education BACKGROUND: Retrospective and observational analyses are an important part of cardiovascular research. The adoption of Electronic Health Records (EHR) improved the availability of medical documentation for research purposes, but rapid data collection is hampered by the predominantly unstructured nature of EHR. A significant proportion of data is textual information that cannot be utilized for research purposes until is manually coded into a database by a healthcare professional through manual chart review- this conventional method is both cost and time- consuming. Text mining could be utilized to accelerate the process, but little is known about attainable data accuracy and estimation of time gain. PURPOSE: We developed a comprehensive text-processing tool for the cardiology field. The algorithm employs advanced text processing based on a specifically designed, vast database of medical terminology, drug lists and echocardiography parameters with a data structure tailored to the needs of cardiovascular research conduction. The algorithm can automatically analyze three types of textual data which are universal parts of discharge summary in Poland: (1) descriptive list of medical diagnoses; (2) discharge recommendations; (3) echocardiography report (if performed). A set of discharge summaries was analyzed with both the conventional method and the algorithm to demonstrate the process of acquisition of basic characteristics of the cohort of patients with atrial fibrillation/flutter (AF/AFl). METHODS: Discharge summaries of 394 patients hospitalized at one cardiology department were analyzed (1) automatically and (2) manually coded into the database by a healthcare professional utilizing a proprietary developed annotation tool to accelerate the annotation process, minimize errors and calculate total effective data acquisition time. RESULTS: The time of manual and automatic data analysis was 19:11 and 0:15 hours respectively. Of the total 394 patients, 319 (81%) had AF/AFl according to both manual identification and the algorithm (full agreement). The characteristics of the study group obtained with manual and automatic method is presented in Table. There was high agreement in the detection of diseases, presence of drug groups and echocardiographic parameters; some differences between the two classifications were noted but did not reach statistical significance. The median CHA2DS2VASc score calculated by the algorithm was 4 (IQR 3-5) versus 4 (IQR 3-5) for the manual method (p=0.92) and HAS-BLED score by the algorithm was 1 (IQR 1-2) versus 1 (IQR 1-2) for the manual method (p=0.98). CONCLUSIONS: The utilization of the algorithm greatly reduced the time required for basic characteristics of the study group acquisition without significantly compromising the quality of the data. Automatic detection of retrospective study cohorts through the application of text processing techniques from electronic health records in Polish is promising and feasible. [Figure: see text] [Figure: see text] Oxford University Press 2023-05-24 /pmc/articles/PMC10206650/ http://dx.doi.org/10.1093/europace/euad122.546 Text en © The Author(s) 2023. Published by Oxford University Press on behalf of the European Society of Cardiology. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial-NoDerivs licence (https://creativecommons.org/licenses/by-nc-nd/4.0/), which permits non-commercial reproduction and distribution of the work, in any medium, provided the original work is not altered or transformed in any way, and that the work is properly cited. For commercial re-use, please contact journals.permissions@oup.com
spellingShingle 38.6 - Hospital Information Systems, Electronic Medical Records, Clinical Decision Support
Maciejewski, C M
Ozieranski, K O
Adam Barwiolek, A B
Basza, M B
Bozym, A B
Ciurla, M C
Krajsman, M J K
Maciejewska, M M
Lodzinski, P L
Opolski, G O
Grabowski, M G
Cacko, A C
Balsam, P B
AssistMED project - a tool for rapid data extraction from textual electronic health records for cardiovascular research- system design and preliminary results
title AssistMED project - a tool for rapid data extraction from textual electronic health records for cardiovascular research- system design and preliminary results
title_full AssistMED project - a tool for rapid data extraction from textual electronic health records for cardiovascular research- system design and preliminary results
title_fullStr AssistMED project - a tool for rapid data extraction from textual electronic health records for cardiovascular research- system design and preliminary results
title_full_unstemmed AssistMED project - a tool for rapid data extraction from textual electronic health records for cardiovascular research- system design and preliminary results
title_short AssistMED project - a tool for rapid data extraction from textual electronic health records for cardiovascular research- system design and preliminary results
title_sort assistmed project - a tool for rapid data extraction from textual electronic health records for cardiovascular research- system design and preliminary results
topic 38.6 - Hospital Information Systems, Electronic Medical Records, Clinical Decision Support
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10206650/
http://dx.doi.org/10.1093/europace/euad122.546
work_keys_str_mv AT maciejewskicm assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT ozieranskiko assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT adambarwiolekab assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT baszamb assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT bozymab assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT ciurlamc assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT krajsmanmjk assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT maciejewskamm assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT lodzinskipl assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT opolskigo assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT grabowskimg assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT cackoac assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults
AT balsampb assistmedprojectatoolforrapiddataextractionfromtextualelectronichealthrecordsforcardiovascularresearchsystemdesignandpreliminaryresults