Cargando…

Reducing defects in the datasets of clinical research studies: conformance with data quality metrics

BACKGROUND: A dataset is indispensable to answer the research questions of clinical research studies. Inaccurate data lead to ambiguous results, and the removal of errors results in increased cost. The aim of this Quality Improvement Project (QIP) was to improve the Data Quality (DQ) by enhancing co...

Descripción completa

Detalles Bibliográficos
Autores principales: Shaheen, Naila A., Manezhi, Bipin, Thomas, Abin, AlKelya, Mohammed
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511206/
https://www.ncbi.nlm.nih.gov/pubmed/31077148
http://dx.doi.org/10.1186/s12874-019-0735-7
_version_ 1783417541758025728
author Shaheen, Naila A.
Manezhi, Bipin
Thomas, Abin
AlKelya, Mohammed
author_facet Shaheen, Naila A.
Manezhi, Bipin
Thomas, Abin
AlKelya, Mohammed
author_sort Shaheen, Naila A.
collection PubMed
description BACKGROUND: A dataset is indispensable to answer the research questions of clinical research studies. Inaccurate data lead to ambiguous results, and the removal of errors results in increased cost. The aim of this Quality Improvement Project (QIP) was to improve the Data Quality (DQ) by enhancing conformance and minimizing data entry errors. METHODS: This is a QIP which was conducted in the Department of Biostatistics using historical datasets submitted for statistical data analysis from the department’s knowledge base system. Forty-five datasets received for statistical data analysis, were included at baseline. A 12-item checklist based on six DQ domains (i) completeness (ii) uniqueness (iii) timeliness (iv) accuracy (v) validity and (vi) consistency was developed to assess the DQ. The checklist was comprised of 12 items; missing values, un-coded values, miscoded values, embedded values, implausible values, unformatted values, missing codebook, inconsistencies with the codebook, inaccurate format, unanalyzable data structure, missing outcome variables, and missing analytic variables. The outcome was the number of defects per dataset. Quality improvement DMAIC (Define, Measure, Analyze, Improve, Control) framework and sigma improvement tools were used. Pre-Post design was implemented using mode of interventions. Pre-Post change in defects (zero, one, two or more defects) was compared by using chi-square test. RESULTS: At baseline, out of forty-five datasets; six (13.3%) datasets had zero defects, eight (17.8%) had one defect, and 31(69%) had ≥2 defects. The association between the nature of data capture (single vs. multiple data points) and defective data was statistically significant (p = 0.008). Twenty-one datasets were received during post-intervention for statistical data analysis. Seventeen (81%) had zero defects, two (9.5%) had one defect, and two (9.5%) had two or more defects. The proportion of datasets with zero defects had increased from 13.3 to 81%, whereas the proportion of datasets with two or more defects had decreased from 69 to 9.5% (p = < 0.001). CONCLUSION: Clinical research study teams often have limited knowledge of data structuring. Given the need for good quality data, we recommend training programs, consultation with data experts prior to data structuring and use of electronic data capturing methods.
format Online
Article
Text
id pubmed-6511206
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-65112062019-05-20 Reducing defects in the datasets of clinical research studies: conformance with data quality metrics Shaheen, Naila A. Manezhi, Bipin Thomas, Abin AlKelya, Mohammed BMC Med Res Methodol Research Article BACKGROUND: A dataset is indispensable to answer the research questions of clinical research studies. Inaccurate data lead to ambiguous results, and the removal of errors results in increased cost. The aim of this Quality Improvement Project (QIP) was to improve the Data Quality (DQ) by enhancing conformance and minimizing data entry errors. METHODS: This is a QIP which was conducted in the Department of Biostatistics using historical datasets submitted for statistical data analysis from the department’s knowledge base system. Forty-five datasets received for statistical data analysis, were included at baseline. A 12-item checklist based on six DQ domains (i) completeness (ii) uniqueness (iii) timeliness (iv) accuracy (v) validity and (vi) consistency was developed to assess the DQ. The checklist was comprised of 12 items; missing values, un-coded values, miscoded values, embedded values, implausible values, unformatted values, missing codebook, inconsistencies with the codebook, inaccurate format, unanalyzable data structure, missing outcome variables, and missing analytic variables. The outcome was the number of defects per dataset. Quality improvement DMAIC (Define, Measure, Analyze, Improve, Control) framework and sigma improvement tools were used. Pre-Post design was implemented using mode of interventions. Pre-Post change in defects (zero, one, two or more defects) was compared by using chi-square test. RESULTS: At baseline, out of forty-five datasets; six (13.3%) datasets had zero defects, eight (17.8%) had one defect, and 31(69%) had ≥2 defects. The association between the nature of data capture (single vs. multiple data points) and defective data was statistically significant (p = 0.008). Twenty-one datasets were received during post-intervention for statistical data analysis. Seventeen (81%) had zero defects, two (9.5%) had one defect, and two (9.5%) had two or more defects. The proportion of datasets with zero defects had increased from 13.3 to 81%, whereas the proportion of datasets with two or more defects had decreased from 69 to 9.5% (p = < 0.001). CONCLUSION: Clinical research study teams often have limited knowledge of data structuring. Given the need for good quality data, we recommend training programs, consultation with data experts prior to data structuring and use of electronic data capturing methods. BioMed Central 2019-05-10 /pmc/articles/PMC6511206/ /pubmed/31077148 http://dx.doi.org/10.1186/s12874-019-0735-7 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Shaheen, Naila A.
Manezhi, Bipin
Thomas, Abin
AlKelya, Mohammed
Reducing defects in the datasets of clinical research studies: conformance with data quality metrics
title Reducing defects in the datasets of clinical research studies: conformance with data quality metrics
title_full Reducing defects in the datasets of clinical research studies: conformance with data quality metrics
title_fullStr Reducing defects in the datasets of clinical research studies: conformance with data quality metrics
title_full_unstemmed Reducing defects in the datasets of clinical research studies: conformance with data quality metrics
title_short Reducing defects in the datasets of clinical research studies: conformance with data quality metrics
title_sort reducing defects in the datasets of clinical research studies: conformance with data quality metrics
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6511206/
https://www.ncbi.nlm.nih.gov/pubmed/31077148
http://dx.doi.org/10.1186/s12874-019-0735-7
work_keys_str_mv AT shaheennailaa reducingdefectsinthedatasetsofclinicalresearchstudiesconformancewithdataqualitymetrics
AT manezhibipin reducingdefectsinthedatasetsofclinicalresearchstudiesconformancewithdataqualitymetrics
AT thomasabin reducingdefectsinthedatasetsofclinicalresearchstudiesconformancewithdataqualitymetrics
AT alkelyamohammed reducingdefectsinthedatasetsofclinicalresearchstudiesconformancewithdataqualitymetrics