Cargando…

ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials

BACKGROUND: Screening for eligible patients continues to pose a great challenge for many clinical trials. This has led to a rapidly growing interest in standardizing computable representations of eligibility criteria (EC) in order to develop tools that leverage data from electronic health record (EH...

Descripción completa

Detalles Bibliográficos
Autores principales: Rafee, Ahmed, Riepenhausen, Sarah, Neuhaus, Philipp, Meidt, Alexandra, Dugas, Martin, Varghese, Julian
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9107639/
https://www.ncbi.nlm.nih.gov/pubmed/35568796
http://dx.doi.org/10.1186/s12874-022-01611-y
_version_ 1784708526224965632
author Rafee, Ahmed
Riepenhausen, Sarah
Neuhaus, Philipp
Meidt, Alexandra
Dugas, Martin
Varghese, Julian
author_facet Rafee, Ahmed
Riepenhausen, Sarah
Neuhaus, Philipp
Meidt, Alexandra
Dugas, Martin
Varghese, Julian
author_sort Rafee, Ahmed
collection PubMed
description BACKGROUND: Screening for eligible patients continues to pose a great challenge for many clinical trials. This has led to a rapidly growing interest in standardizing computable representations of eligibility criteria (EC) in order to develop tools that leverage data from electronic health record (EHR) systems. Although laboratory procedures (LP) represent a common entity of EC that is readily available and retrievable from EHR systems, there is a lack of interoperable data models for this entity of EC. A public, specialized data model that utilizes international, widely-adopted terminology for LP, e.g. Logical Observation Identifiers Names and Codes (LOINC®), is much needed to support automated screening tools. OBJECTIVE: The aim of this study is to establish a core dataset for LP most frequently requested to recruit patients for clinical trials using LOINC terminology. Employing such a core dataset could enhance the interface between study feasibility platforms and EHR systems and significantly improve automatic patient recruitment. METHODS: We used a semi-automated approach to analyze 10,516 screening forms from the Medical Data Models (MDM) portal’s data repository that are pre-annotated with Unified Medical Language System (UMLS). An automated semantic analysis based on concept frequency is followed by an extensive manual expert review performed by physicians to analyze complex recruitment-relevant concepts not amenable to automatic approach. RESULTS: Based on analysis of 138,225 EC from 10,516 screening forms, 55 laboratory procedures represented 77.87% of all UMLS laboratory concept occurrences identified in the selected EC forms. We identified 26,413 unique UMLS concepts from 118 UMLS semantic types and covered the vast majority of Medical Subject Headings (MeSH) disease domains. CONCLUSIONS: Only a small set of common LP covers the majority of laboratory concepts in screening EC forms which supports the feasibility of establishing a focused core dataset for LP. We present ELaPro, a novel, LOINC-mapped, core dataset for the most frequent 55 LP requested in screening for clinical trials. ELaPro is available in multiple machine-readable data formats like CSV, ODM and HL7 FHIR. The extensive manual curation of this large number of free-text EC as well as the combining of UMLS and LOINC terminologies distinguishes this specialized dataset from previous relevant datasets in the literature. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01611-y.
format Online
Article
Text
id pubmed-9107639
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-91076392022-05-16 ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials Rafee, Ahmed Riepenhausen, Sarah Neuhaus, Philipp Meidt, Alexandra Dugas, Martin Varghese, Julian BMC Med Res Methodol Research BACKGROUND: Screening for eligible patients continues to pose a great challenge for many clinical trials. This has led to a rapidly growing interest in standardizing computable representations of eligibility criteria (EC) in order to develop tools that leverage data from electronic health record (EHR) systems. Although laboratory procedures (LP) represent a common entity of EC that is readily available and retrievable from EHR systems, there is a lack of interoperable data models for this entity of EC. A public, specialized data model that utilizes international, widely-adopted terminology for LP, e.g. Logical Observation Identifiers Names and Codes (LOINC®), is much needed to support automated screening tools. OBJECTIVE: The aim of this study is to establish a core dataset for LP most frequently requested to recruit patients for clinical trials using LOINC terminology. Employing such a core dataset could enhance the interface between study feasibility platforms and EHR systems and significantly improve automatic patient recruitment. METHODS: We used a semi-automated approach to analyze 10,516 screening forms from the Medical Data Models (MDM) portal’s data repository that are pre-annotated with Unified Medical Language System (UMLS). An automated semantic analysis based on concept frequency is followed by an extensive manual expert review performed by physicians to analyze complex recruitment-relevant concepts not amenable to automatic approach. RESULTS: Based on analysis of 138,225 EC from 10,516 screening forms, 55 laboratory procedures represented 77.87% of all UMLS laboratory concept occurrences identified in the selected EC forms. We identified 26,413 unique UMLS concepts from 118 UMLS semantic types and covered the vast majority of Medical Subject Headings (MeSH) disease domains. CONCLUSIONS: Only a small set of common LP covers the majority of laboratory concepts in screening EC forms which supports the feasibility of establishing a focused core dataset for LP. We present ELaPro, a novel, LOINC-mapped, core dataset for the most frequent 55 LP requested in screening for clinical trials. ELaPro is available in multiple machine-readable data formats like CSV, ODM and HL7 FHIR. The extensive manual curation of this large number of free-text EC as well as the combining of UMLS and LOINC terminologies distinguishes this specialized dataset from previous relevant datasets in the literature. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01611-y. BioMed Central 2022-05-14 /pmc/articles/PMC9107639/ /pubmed/35568796 http://dx.doi.org/10.1186/s12874-022-01611-y Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Rafee, Ahmed
Riepenhausen, Sarah
Neuhaus, Philipp
Meidt, Alexandra
Dugas, Martin
Varghese, Julian
ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials
title ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials
title_full ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials
title_fullStr ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials
title_full_unstemmed ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials
title_short ELaPro, a LOINC-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials
title_sort elapro, a loinc-mapped core dataset for top laboratory procedures of eligibility screening for clinical trials
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9107639/
https://www.ncbi.nlm.nih.gov/pubmed/35568796
http://dx.doi.org/10.1186/s12874-022-01611-y
work_keys_str_mv AT rafeeahmed elaproaloincmappedcoredatasetfortoplaboratoryproceduresofeligibilityscreeningforclinicaltrials
AT riepenhausensarah elaproaloincmappedcoredatasetfortoplaboratoryproceduresofeligibilityscreeningforclinicaltrials
AT neuhausphilipp elaproaloincmappedcoredatasetfortoplaboratoryproceduresofeligibilityscreeningforclinicaltrials
AT meidtalexandra elaproaloincmappedcoredatasetfortoplaboratoryproceduresofeligibilityscreeningforclinicaltrials
AT dugasmartin elaproaloincmappedcoredatasetfortoplaboratoryproceduresofeligibilityscreeningforclinicaltrials
AT varghesejulian elaproaloincmappedcoredatasetfortoplaboratoryproceduresofeligibilityscreeningforclinicaltrials