Cargando…

Development of novel composite data quality scores to evaluate facility-level data quality in electronic data in Kenya: a nationwide retrospective cohort study

BACKGROUND: In this evaluation, we aim to strengthen Routine Health Information Systems (RHIS) through the digitization of data quality assessment (DQA) processes. We leverage electronic data from the Kenya Health Information System (KHIS) which is based on the District Health Information System ver...

Descripción completa

Detalles Bibliográficos
Autores principales: Odeny, Beryne M., Njoroge, Anne, Gloyd, Steve, Hughes, James P., Wagenaar, Bradley H., Odhiambo, Jacob, Nyagah, Lilly M., Manya, Ayub, Oghera, Ooga Wesley, Puttkammer, Nancy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2023
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10594801/
https://www.ncbi.nlm.nih.gov/pubmed/37872540
http://dx.doi.org/10.1186/s12913-023-10133-2
_version_ 1785124727369498624
author Odeny, Beryne M.
Njoroge, Anne
Gloyd, Steve
Hughes, James P.
Wagenaar, Bradley H.
Odhiambo, Jacob
Nyagah, Lilly M.
Manya, Ayub
Oghera, Ooga Wesley
Puttkammer, Nancy
author_facet Odeny, Beryne M.
Njoroge, Anne
Gloyd, Steve
Hughes, James P.
Wagenaar, Bradley H.
Odhiambo, Jacob
Nyagah, Lilly M.
Manya, Ayub
Oghera, Ooga Wesley
Puttkammer, Nancy
author_sort Odeny, Beryne M.
collection PubMed
description BACKGROUND: In this evaluation, we aim to strengthen Routine Health Information Systems (RHIS) through the digitization of data quality assessment (DQA) processes. We leverage electronic data from the Kenya Health Information System (KHIS) which is based on the District Health Information System version 2 (DHIS2) to perform DQAs at scale. We provide a systematic guide to developing composite data quality scores and use these scores to assess data quality in Kenya. METHODS: We evaluated 187 HIV care facilities with electronic medical records across Kenya. Using quarterly, longitudinal KHIS data from January 2011 to June 2018 (total N = 30 quarters), we extracted indicators encompassing general HIV services including services to prevent mother-to-child transmission (PMTCT). We assessed the accuracy (the extent to which data were correct and free of error) of these data using three data-driven composite scores: 1) completeness score; 2) consistency score; and 3) discrepancy score. Completeness refers to the presence of the appropriate amount of data. Consistency refers to uniformity of data across multiple indicators. Discrepancy (measured on a Z-scale) refers to the degree of alignment (or lack thereof) of data with rules that defined the possible valid values for the data. RESULTS: A total of 5,610 unique facility-quarters were extracted from KHIS. The mean completeness score was 61.1% [standard deviation (SD) = 27%]. The mean consistency score was 80% (SD = 16.4%). The mean discrepancy score was 0.07 (SD = 0.22). A strong and positive correlation was identified between the consistency score and discrepancy score (correlation coefficient = 0.77), whereas the correlation of either score with the completeness score was low with a correlation coefficient of -0.12 (with consistency score) and -0.36 (with discrepancy score). General HIV indicators were more complete, but less consistent, and less plausible than PMTCT indicators. CONCLUSION: We observed a lack of correlation between the completeness score and the other two scores. As such, for a holistic DQA, completeness assessment should be paired with the measurement of either consistency or discrepancy to reflect distinct dimensions of data quality. Given the complexity of the discrepancy score, we recommend the simpler consistency score, since they were highly correlated. Routine use of composite scores on KHIS data could enhance efficiencies in DQA at scale as digitization of health information expands and could be applied to other health sectors beyondHIV clinics. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12913-023-10133-2.
format Online
Article
Text
id pubmed-10594801
institution National Center for Biotechnology Information
language English
publishDate 2023
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-105948012023-10-25 Development of novel composite data quality scores to evaluate facility-level data quality in electronic data in Kenya: a nationwide retrospective cohort study Odeny, Beryne M. Njoroge, Anne Gloyd, Steve Hughes, James P. Wagenaar, Bradley H. Odhiambo, Jacob Nyagah, Lilly M. Manya, Ayub Oghera, Ooga Wesley Puttkammer, Nancy BMC Health Serv Res Research BACKGROUND: In this evaluation, we aim to strengthen Routine Health Information Systems (RHIS) through the digitization of data quality assessment (DQA) processes. We leverage electronic data from the Kenya Health Information System (KHIS) which is based on the District Health Information System version 2 (DHIS2) to perform DQAs at scale. We provide a systematic guide to developing composite data quality scores and use these scores to assess data quality in Kenya. METHODS: We evaluated 187 HIV care facilities with electronic medical records across Kenya. Using quarterly, longitudinal KHIS data from January 2011 to June 2018 (total N = 30 quarters), we extracted indicators encompassing general HIV services including services to prevent mother-to-child transmission (PMTCT). We assessed the accuracy (the extent to which data were correct and free of error) of these data using three data-driven composite scores: 1) completeness score; 2) consistency score; and 3) discrepancy score. Completeness refers to the presence of the appropriate amount of data. Consistency refers to uniformity of data across multiple indicators. Discrepancy (measured on a Z-scale) refers to the degree of alignment (or lack thereof) of data with rules that defined the possible valid values for the data. RESULTS: A total of 5,610 unique facility-quarters were extracted from KHIS. The mean completeness score was 61.1% [standard deviation (SD) = 27%]. The mean consistency score was 80% (SD = 16.4%). The mean discrepancy score was 0.07 (SD = 0.22). A strong and positive correlation was identified between the consistency score and discrepancy score (correlation coefficient = 0.77), whereas the correlation of either score with the completeness score was low with a correlation coefficient of -0.12 (with consistency score) and -0.36 (with discrepancy score). General HIV indicators were more complete, but less consistent, and less plausible than PMTCT indicators. CONCLUSION: We observed a lack of correlation between the completeness score and the other two scores. As such, for a holistic DQA, completeness assessment should be paired with the measurement of either consistency or discrepancy to reflect distinct dimensions of data quality. Given the complexity of the discrepancy score, we recommend the simpler consistency score, since they were highly correlated. Routine use of composite scores on KHIS data could enhance efficiencies in DQA at scale as digitization of health information expands and could be applied to other health sectors beyondHIV clinics. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12913-023-10133-2. BioMed Central 2023-10-23 /pmc/articles/PMC10594801/ /pubmed/37872540 http://dx.doi.org/10.1186/s12913-023-10133-2 Text en © The Author(s) 2023 https://creativecommons.org/licenses/by/4.0/Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Odeny, Beryne M.
Njoroge, Anne
Gloyd, Steve
Hughes, James P.
Wagenaar, Bradley H.
Odhiambo, Jacob
Nyagah, Lilly M.
Manya, Ayub
Oghera, Ooga Wesley
Puttkammer, Nancy
Development of novel composite data quality scores to evaluate facility-level data quality in electronic data in Kenya: a nationwide retrospective cohort study
title Development of novel composite data quality scores to evaluate facility-level data quality in electronic data in Kenya: a nationwide retrospective cohort study
title_full Development of novel composite data quality scores to evaluate facility-level data quality in electronic data in Kenya: a nationwide retrospective cohort study
title_fullStr Development of novel composite data quality scores to evaluate facility-level data quality in electronic data in Kenya: a nationwide retrospective cohort study
title_full_unstemmed Development of novel composite data quality scores to evaluate facility-level data quality in electronic data in Kenya: a nationwide retrospective cohort study
title_short Development of novel composite data quality scores to evaluate facility-level data quality in electronic data in Kenya: a nationwide retrospective cohort study
title_sort development of novel composite data quality scores to evaluate facility-level data quality in electronic data in kenya: a nationwide retrospective cohort study
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC10594801/
https://www.ncbi.nlm.nih.gov/pubmed/37872540
http://dx.doi.org/10.1186/s12913-023-10133-2
work_keys_str_mv AT odenyberynem developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy
AT njorogeanne developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy
AT gloydsteve developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy
AT hughesjamesp developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy
AT wagenaarbradleyh developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy
AT odhiambojacob developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy
AT nyagahlillym developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy
AT manyaayub developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy
AT ogheraoogawesley developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy
AT puttkammernancy developmentofnovelcompositedataqualityscorestoevaluatefacilityleveldataqualityinelectronicdatainkenyaanationwideretrospectivecohortstudy