Cargando…

Rare Diseases in Hospital Information Systems—An Interoperable Methodology for Distributed Data Quality Assessments

Background  Multisite research networks such as the project “Collaboration on Rare Diseases” connect various hospitals to obtain sufficient data for clinical research. However, data quality (DQ) remains a challenge for the secondary use of data recorded in different health information systems. High...

Descripción completa

Detalles Bibliográficos
Autores principales: Tahar, Kais, Martin, Tamara, Mou, Yongli, Verbuecheln, Raphael, Graessner, Holm, Krefting, Dagmar
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Georg Thieme Verlag KG 2023
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462432/
https://www.ncbi.nlm.nih.gov/pubmed/36596461
http://dx.doi.org/10.1055/a-2006-1018
_version_ 1785098030278508544
author Tahar, Kais
Martin, Tamara
Mou, Yongli
Verbuecheln, Raphael
Graessner, Holm
Krefting, Dagmar
author_facet Tahar, Kais
Martin, Tamara
Mou, Yongli
Verbuecheln, Raphael
Graessner, Holm
Krefting, Dagmar
author_sort Tahar, Kais
collection PubMed
description Background  Multisite research networks such as the project “Collaboration on Rare Diseases” connect various hospitals to obtain sufficient data for clinical research. However, data quality (DQ) remains a challenge for the secondary use of data recorded in different health information systems. High levels of DQ as well as appropriate quality assessment methods are needed to support the reuse of such distributed data. Objectives  The aim of this work is the development of an interoperable methodology for assessing the quality of data recorded in heterogeneous sources to improve the quality of rare disease (RD) documentation and support clinical research. Methods  We first developed a conceptual framework for DQ assessment. Using this theoretical guidance, we implemented a software framework that provides appropriate tools for calculating DQ metrics and for generating local as well as cross-institutional reports. We further applied our methodology on synthetic data distributed across multiple hospitals using Personal Health Train. Finally, we used precision and recall as metrics to validate our implementation. Results  Four DQ dimensions were defined and represented as disjunct ontological categories. Based on these top dimensions, 9 DQ concepts, 10 DQ indicators, and 25 DQ parameters were developed and applied to different data sets. Randomly introduced DQ issues were all identified and reported automatically. The generated reports show the resulting DQ indicators and detected DQ issues. Conclusion  We have shown that our approach yields promising results, which can be used for local and cross-institutional DQ assessments. The developed frameworks provide useful methods for interoperable and privacy-preserving assessments of DQ that meet the specified requirements. This study has demonstrated that our methodology is capable of detecting DQ issues such as ambiguity or implausibility of coded diagnoses. It can be used for DQ benchmarking to improve the quality of RD documentation and to support clinical research on distributed data.
format Online
Article
Text
id pubmed-10462432
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher Georg Thieme Verlag KG
record_format MEDLINE/PubMed
spelling pubmed-104624322023-08-29 Rare Diseases in Hospital Information Systems—An Interoperable Methodology for Distributed Data Quality Assessments Tahar, Kais Martin, Tamara Mou, Yongli Verbuecheln, Raphael Graessner, Holm Krefting, Dagmar Methods Inf Med Background  Multisite research networks such as the project “Collaboration on Rare Diseases” connect various hospitals to obtain sufficient data for clinical research. However, data quality (DQ) remains a challenge for the secondary use of data recorded in different health information systems. High levels of DQ as well as appropriate quality assessment methods are needed to support the reuse of such distributed data. Objectives  The aim of this work is the development of an interoperable methodology for assessing the quality of data recorded in heterogeneous sources to improve the quality of rare disease (RD) documentation and support clinical research. Methods  We first developed a conceptual framework for DQ assessment. Using this theoretical guidance, we implemented a software framework that provides appropriate tools for calculating DQ metrics and for generating local as well as cross-institutional reports. We further applied our methodology on synthetic data distributed across multiple hospitals using Personal Health Train. Finally, we used precision and recall as metrics to validate our implementation. Results  Four DQ dimensions were defined and represented as disjunct ontological categories. Based on these top dimensions, 9 DQ concepts, 10 DQ indicators, and 25 DQ parameters were developed and applied to different data sets. Randomly introduced DQ issues were all identified and reported automatically. The generated reports show the resulting DQ indicators and detected DQ issues. Conclusion  We have shown that our approach yields promising results, which can be used for local and cross-institutional DQ assessments. The developed frameworks provide useful methods for interoperable and privacy-preserving assessments of DQ that meet the specified requirements. This study has demonstrated that our methodology is capable of detecting DQ issues such as ambiguity or implausibility of coded diagnoses. It can be used for DQ benchmarking to improve the quality of RD documentation and to support clinical research on distributed data. Georg Thieme Verlag KG 2023-05-16 /pmc/articles/PMC10462432/ /pubmed/36596461 http://dx.doi.org/10.1055/a-2006-1018 Text en The Author(s). This is an open access article published by Thieme under the terms of the Creative Commons Attribution License, permitting unrestricted use, distribution, and reproduction so long as the original work is properly cited. ( https://creativecommons.org/licenses/by/4.0/ ) https://creativecommons.org/licenses/by/4.0/This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Tahar, Kais
Martin, Tamara
Mou, Yongli
Verbuecheln, Raphael
Graessner, Holm
Krefting, Dagmar
Rare Diseases in Hospital Information Systems—An Interoperable Methodology for Distributed Data Quality Assessments
title Rare Diseases in Hospital Information Systems—An Interoperable Methodology for Distributed Data Quality Assessments
title_full Rare Diseases in Hospital Information Systems—An Interoperable Methodology for Distributed Data Quality Assessments
title_fullStr Rare Diseases in Hospital Information Systems—An Interoperable Methodology for Distributed Data Quality Assessments
title_full_unstemmed Rare Diseases in Hospital Information Systems—An Interoperable Methodology for Distributed Data Quality Assessments
title_short Rare Diseases in Hospital Information Systems—An Interoperable Methodology for Distributed Data Quality Assessments
title_sort rare diseases in hospital information systems—an interoperable methodology for distributed data quality assessments
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10462432/
https://www.ncbi.nlm.nih.gov/pubmed/36596461
http://dx.doi.org/10.1055/a-2006-1018
work_keys_str_mv AT taharkais rarediseasesinhospitalinformationsystemsaninteroperablemethodologyfordistributeddataqualityassessments
AT martintamara rarediseasesinhospitalinformationsystemsaninteroperablemethodologyfordistributeddataqualityassessments
AT mouyongli rarediseasesinhospitalinformationsystemsaninteroperablemethodologyfordistributeddataqualityassessments
AT verbuechelnraphael rarediseasesinhospitalinformationsystemsaninteroperablemethodologyfordistributeddataqualityassessments
AT graessnerholm rarediseasesinhospitalinformationsystemsaninteroperablemethodologyfordistributeddataqualityassessments
AT kreftingdagmar rarediseasesinhospitalinformationsystemsaninteroperablemethodologyfordistributeddataqualityassessments