Cargando…

Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials

BACKGROUND: Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized...

Descripción completa

Detalles Bibliográficos
Autores principales: Joseph, Nora, Lindblad, Ida, Zaker, Sara, Elfversson, Sharareh, Albinzon, Maria, Ødegård, Øyvind, Hantler, Li, Hellström, Per M.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Open Academia 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8809051/
https://www.ncbi.nlm.nih.gov/pubmed/35173908
http://dx.doi.org/10.48101/ujms.v127.8260
_version_ 1784643942139035648
author Joseph, Nora
Lindblad, Ida
Zaker, Sara
Elfversson, Sharareh
Albinzon, Maria
Ødegård, Øyvind
Hantler, Li
Hellström, Per M.
author_facet Joseph, Nora
Lindblad, Ida
Zaker, Sara
Elfversson, Sharareh
Albinzon, Maria
Ødegård, Øyvind
Hantler, Li
Hellström, Per M.
author_sort Joseph, Nora
collection PubMed
description BACKGROUND: Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized eXtraction Program (CXP). METHODS: The CXP extracts and structures data in rapid standardised processes. The CXP was programmed to extract TNFα-native active ulcerative colitis (UC) patients from EMRs using defined International Classification of Disease-10 (ICD-10) codes. Extracted data were read in parallel with manual assessment of the EMR to compare with CXP-extracted data. RESULTS: From the complete EMR set, 2,802 patients with code K51 (UC) were extracted. Then, CXP extracted 332 patients according to inclusion and exclusion criteria. Of these, 97.5% were correctly identified, resulting in a final set of 320 cases eligible for the study. When comparing CXP-extracted data against manually assessed EMRs, the recovery rate was 95.6–101.1% over the years with 96.1% weighted average sensitivity. CONCLUSION: Utilisation of the CXP software can be considered as an effective way to extract relevant EMR data without significant errors. Hence, by extracting from EMRs, CXP accurately identifies patients and has the capacity to facilitate research studies and clinical trials by finding patients with the requested code as well as funnel down itemised individuals according to specified inclusion and exclusion criteria. Beyond this, medical procedures and laboratory data can rapidly be retrieved from the EMRs to create tailored databases of extracted material for immediate use in clinical trials.
format Online
Article
Text
id pubmed-8809051
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Open Academia
record_format MEDLINE/PubMed
spelling pubmed-88090512022-02-15 Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials Joseph, Nora Lindblad, Ida Zaker, Sara Elfversson, Sharareh Albinzon, Maria Ødegård, Øyvind Hantler, Li Hellström, Per M. Ups J Med Sci Original Article BACKGROUND: Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized eXtraction Program (CXP). METHODS: The CXP extracts and structures data in rapid standardised processes. The CXP was programmed to extract TNFα-native active ulcerative colitis (UC) patients from EMRs using defined International Classification of Disease-10 (ICD-10) codes. Extracted data were read in parallel with manual assessment of the EMR to compare with CXP-extracted data. RESULTS: From the complete EMR set, 2,802 patients with code K51 (UC) were extracted. Then, CXP extracted 332 patients according to inclusion and exclusion criteria. Of these, 97.5% were correctly identified, resulting in a final set of 320 cases eligible for the study. When comparing CXP-extracted data against manually assessed EMRs, the recovery rate was 95.6–101.1% over the years with 96.1% weighted average sensitivity. CONCLUSION: Utilisation of the CXP software can be considered as an effective way to extract relevant EMR data without significant errors. Hence, by extracting from EMRs, CXP accurately identifies patients and has the capacity to facilitate research studies and clinical trials by finding patients with the requested code as well as funnel down itemised individuals according to specified inclusion and exclusion criteria. Beyond this, medical procedures and laboratory data can rapidly be retrieved from the EMRs to create tailored databases of extracted material for immediate use in clinical trials. Open Academia 2022-01-27 /pmc/articles/PMC8809051/ /pubmed/35173908 http://dx.doi.org/10.48101/ujms.v127.8260 Text en © 2022 The Author(s). Published by Upsala Medical Society. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Original Article
Joseph, Nora
Lindblad, Ida
Zaker, Sara
Elfversson, Sharareh
Albinzon, Maria
Ødegård, Øyvind
Hantler, Li
Hellström, Per M.
Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials
title Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials
title_full Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials
title_fullStr Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials
title_full_unstemmed Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials
title_short Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials
title_sort automated data extraction of electronic medical records: validity of data mining to construct research databases for eligibility in gastroenterological clinical trials
topic Original Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8809051/
https://www.ncbi.nlm.nih.gov/pubmed/35173908
http://dx.doi.org/10.48101/ujms.v127.8260
work_keys_str_mv AT josephnora automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials
AT lindbladida automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials
AT zakersara automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials
AT elfverssonsharareh automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials
AT albinzonmaria automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials
AT ødegardøyvind automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials
AT hantlerli automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials
AT hellstromperm automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials