Cargando…
Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials
BACKGROUND: Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized...
Autores principales: | , , , , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Open Academia
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8809051/ https://www.ncbi.nlm.nih.gov/pubmed/35173908 http://dx.doi.org/10.48101/ujms.v127.8260 |
_version_ | 1784643942139035648 |
---|---|
author | Joseph, Nora Lindblad, Ida Zaker, Sara Elfversson, Sharareh Albinzon, Maria Ødegård, Øyvind Hantler, Li Hellström, Per M. |
author_facet | Joseph, Nora Lindblad, Ida Zaker, Sara Elfversson, Sharareh Albinzon, Maria Ødegård, Øyvind Hantler, Li Hellström, Per M. |
author_sort | Joseph, Nora |
collection | PubMed |
description | BACKGROUND: Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized eXtraction Program (CXP). METHODS: The CXP extracts and structures data in rapid standardised processes. The CXP was programmed to extract TNFα-native active ulcerative colitis (UC) patients from EMRs using defined International Classification of Disease-10 (ICD-10) codes. Extracted data were read in parallel with manual assessment of the EMR to compare with CXP-extracted data. RESULTS: From the complete EMR set, 2,802 patients with code K51 (UC) were extracted. Then, CXP extracted 332 patients according to inclusion and exclusion criteria. Of these, 97.5% were correctly identified, resulting in a final set of 320 cases eligible for the study. When comparing CXP-extracted data against manually assessed EMRs, the recovery rate was 95.6–101.1% over the years with 96.1% weighted average sensitivity. CONCLUSION: Utilisation of the CXP software can be considered as an effective way to extract relevant EMR data without significant errors. Hence, by extracting from EMRs, CXP accurately identifies patients and has the capacity to facilitate research studies and clinical trials by finding patients with the requested code as well as funnel down itemised individuals according to specified inclusion and exclusion criteria. Beyond this, medical procedures and laboratory data can rapidly be retrieved from the EMRs to create tailored databases of extracted material for immediate use in clinical trials. |
format | Online Article Text |
id | pubmed-8809051 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | Open Academia |
record_format | MEDLINE/PubMed |
spelling | pubmed-88090512022-02-15 Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials Joseph, Nora Lindblad, Ida Zaker, Sara Elfversson, Sharareh Albinzon, Maria Ødegård, Øyvind Hantler, Li Hellström, Per M. Ups J Med Sci Original Article BACKGROUND: Electronic medical records (EMRs) are adopted for storing patient-related healthcare information. Using data mining techniques, it is possible to make use of and derive benefit from this massive amount of data effectively. We aimed to evaluate validity of data extracted by the Customized eXtraction Program (CXP). METHODS: The CXP extracts and structures data in rapid standardised processes. The CXP was programmed to extract TNFα-native active ulcerative colitis (UC) patients from EMRs using defined International Classification of Disease-10 (ICD-10) codes. Extracted data were read in parallel with manual assessment of the EMR to compare with CXP-extracted data. RESULTS: From the complete EMR set, 2,802 patients with code K51 (UC) were extracted. Then, CXP extracted 332 patients according to inclusion and exclusion criteria. Of these, 97.5% were correctly identified, resulting in a final set of 320 cases eligible for the study. When comparing CXP-extracted data against manually assessed EMRs, the recovery rate was 95.6–101.1% over the years with 96.1% weighted average sensitivity. CONCLUSION: Utilisation of the CXP software can be considered as an effective way to extract relevant EMR data without significant errors. Hence, by extracting from EMRs, CXP accurately identifies patients and has the capacity to facilitate research studies and clinical trials by finding patients with the requested code as well as funnel down itemised individuals according to specified inclusion and exclusion criteria. Beyond this, medical procedures and laboratory data can rapidly be retrieved from the EMRs to create tailored databases of extracted material for immediate use in clinical trials. Open Academia 2022-01-27 /pmc/articles/PMC8809051/ /pubmed/35173908 http://dx.doi.org/10.48101/ujms.v127.8260 Text en © 2022 The Author(s). Published by Upsala Medical Society. https://creativecommons.org/licenses/by/4.0/This is an Open Access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Original Article Joseph, Nora Lindblad, Ida Zaker, Sara Elfversson, Sharareh Albinzon, Maria Ødegård, Øyvind Hantler, Li Hellström, Per M. Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials |
title | Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials |
title_full | Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials |
title_fullStr | Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials |
title_full_unstemmed | Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials |
title_short | Automated data extraction of electronic medical records: Validity of data mining to construct research databases for eligibility in gastroenterological clinical trials |
title_sort | automated data extraction of electronic medical records: validity of data mining to construct research databases for eligibility in gastroenterological clinical trials |
topic | Original Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8809051/ https://www.ncbi.nlm.nih.gov/pubmed/35173908 http://dx.doi.org/10.48101/ujms.v127.8260 |
work_keys_str_mv | AT josephnora automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials AT lindbladida automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials AT zakersara automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials AT elfverssonsharareh automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials AT albinzonmaria automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials AT ødegardøyvind automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials AT hantlerli automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials AT hellstromperm automateddataextractionofelectronicmedicalrecordsvalidityofdataminingtoconstructresearchdatabasesforeligibilityingastroenterologicalclinicaltrials |