Cargando…

Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use

INTRODUCTION: Secondary use of electronic health record (EHR) data for research requires that the data are fit for use. Data quality (DQ) frameworks have traditionally focused on structural conformance and completeness of clinical data extracted from source systems. In this paper, we propose a frame...

Descripción completa

Detalles Bibliográficos
Autores principales: Razzaghi, Hanieh, Greenberg, Jane, Bailey, L. Charles
Formato: Online Artículo Texto
Lenguaje:English
Publicado: John Wiley and Sons Inc. 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8753309/
https://www.ncbi.nlm.nih.gov/pubmed/35036548
http://dx.doi.org/10.1002/lrh2.10264
_version_ 1784632067782344704
author Razzaghi, Hanieh
Greenberg, Jane
Bailey, L. Charles
author_facet Razzaghi, Hanieh
Greenberg, Jane
Bailey, L. Charles
author_sort Razzaghi, Hanieh
collection PubMed
description INTRODUCTION: Secondary use of electronic health record (EHR) data for research requires that the data are fit for use. Data quality (DQ) frameworks have traditionally focused on structural conformance and completeness of clinical data extracted from source systems. In this paper, we propose a framework for evaluating semantic DQ that will allow researchers to evaluate fitness for use prior to analyses. METHODS: We reviewed current DQ literature, as well as experience from recent multisite network studies, and identified gaps in the literature and current practice. Derived principles were used to construct the conceptual framework with attention to both analytic fitness and informatics practice. RESULTS: We developed a systematic framework that guides researchers in assessing whether a data source is fit for use for their intended study or project. It combines tools for evaluating clinical context with DQ principles, as well as factoring in the characteristics of the data source, in order to develop semantic DQ checks. CONCLUSIONS: Our framework provides a systematic process for DQ development. Further work is needed to codify practices and metadata around both structural and semantic data quality.
format Online
Article
Text
id pubmed-8753309
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher John Wiley and Sons Inc.
record_format MEDLINE/PubMed
spelling pubmed-87533092022-01-14 Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use Razzaghi, Hanieh Greenberg, Jane Bailey, L. Charles Learn Health Syst Technical Reports INTRODUCTION: Secondary use of electronic health record (EHR) data for research requires that the data are fit for use. Data quality (DQ) frameworks have traditionally focused on structural conformance and completeness of clinical data extracted from source systems. In this paper, we propose a framework for evaluating semantic DQ that will allow researchers to evaluate fitness for use prior to analyses. METHODS: We reviewed current DQ literature, as well as experience from recent multisite network studies, and identified gaps in the literature and current practice. Derived principles were used to construct the conceptual framework with attention to both analytic fitness and informatics practice. RESULTS: We developed a systematic framework that guides researchers in assessing whether a data source is fit for use for their intended study or project. It combines tools for evaluating clinical context with DQ principles, as well as factoring in the characteristics of the data source, in order to develop semantic DQ checks. CONCLUSIONS: Our framework provides a systematic process for DQ development. Further work is needed to codify practices and metadata around both structural and semantic data quality. John Wiley and Sons Inc. 2021-05-03 /pmc/articles/PMC8753309/ /pubmed/35036548 http://dx.doi.org/10.1002/lrh2.10264 Text en © 2021 The Authors. Learning Health Systems published by Wiley Periodicals LLC on behalf of University of Michigan. https://creativecommons.org/licenses/by-nc-nd/4.0/This is an open access article under the terms of the http://creativecommons.org/licenses/by-nc-nd/4.0/ (https://creativecommons.org/licenses/by-nc-nd/4.0/) License, which permits use and distribution in any medium, provided the original work is properly cited, the use is non‐commercial and no modifications or adaptations are made.
spellingShingle Technical Reports
Razzaghi, Hanieh
Greenberg, Jane
Bailey, L. Charles
Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use
title Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use
title_full Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use
title_fullStr Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use
title_full_unstemmed Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use
title_short Developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use
title_sort developing a systematic approach to assessing data quality in secondary use of clinical data based on intended use
topic Technical Reports
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8753309/
https://www.ncbi.nlm.nih.gov/pubmed/35036548
http://dx.doi.org/10.1002/lrh2.10264
work_keys_str_mv AT razzaghihanieh developingasystematicapproachtoassessingdataqualityinsecondaryuseofclinicaldatabasedonintendeduse
AT greenbergjane developingasystematicapproachtoassessingdataqualityinsecondaryuseofclinicaldatabasedonintendeduse
AT baileylcharles developingasystematicapproachtoassessingdataqualityinsecondaryuseofclinicaldatabasedonintendeduse