Cargando…

Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection

BACKGROUND: To evaluate institutional nursing care performance in the context of national comparative statistics (benchmarks), approximately one in every three major healthcare institutions (over 1,800 hospitals) across the United States, have joined the National Database for Nursing Quality Indicat...

Descripción completa

Detalles Bibliográficos
Autores principales: Hou, Qingjiang, Crosser, Brandon, Mahnken, Jonathan D, Gajewski, Byron J, Dunton, Nancy
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2012
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3542164/
https://www.ncbi.nlm.nih.gov/pubmed/22920157
http://dx.doi.org/10.1186/1756-0500-5-456
_version_ 1782255464700968960
author Hou, Qingjiang
Crosser, Brandon
Mahnken, Jonathan D
Gajewski, Byron J
Dunton, Nancy
author_facet Hou, Qingjiang
Crosser, Brandon
Mahnken, Jonathan D
Gajewski, Byron J
Dunton, Nancy
author_sort Hou, Qingjiang
collection PubMed
description BACKGROUND: To evaluate institutional nursing care performance in the context of national comparative statistics (benchmarks), approximately one in every three major healthcare institutions (over 1,800 hospitals) across the United States, have joined the National Database for Nursing Quality Indicators® (NDNQI®). With over 18,000 hospital units contributing data for nearly 200 quantitative measures at present, a reliable and efficient input data screening for all quantitative measures for data quality control is critical to the integrity, validity, and on-time delivery of NDNQI reports. METHODS: With Monte Carlo simulation and quantitative NDNQI indicator examples, we compared two ad-hoc methods using robust scale estimators, Inter Quartile Range (IQR) and Median Absolute Deviation from the Median (MAD), to the classic, theoretically-based Minimum Covariance Determinant (FAST-MCD) approach, for initial univariate outlier detection. RESULTS: While the theoretically based FAST-MCD used in one dimension can be sensitive and is better suited for identifying groups of outliers because of its high breakdown point, the ad-hoc IQR and MAD approaches are fast, easy to implement, and could be more robust and efficient, depending on the distributional property of the underlying measure of interest. CONCLUSION: With highly skewed distributions for most NDNQI indicators within a short data screen window, the FAST-MCD approach, when used in one dimensional raw data setting, could overestimate the false alarm rates for potential outliers than the IQR and MAD with the same pre-set of critical value, thus, overburden data quality control at both the data entry and administrative ends in our setting.
format Online
Article
Text
id pubmed-3542164
institution National Center for Biotechnology Information
language English
publishDate 2012
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-35421642013-01-11 Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection Hou, Qingjiang Crosser, Brandon Mahnken, Jonathan D Gajewski, Byron J Dunton, Nancy BMC Res Notes Research Article BACKGROUND: To evaluate institutional nursing care performance in the context of national comparative statistics (benchmarks), approximately one in every three major healthcare institutions (over 1,800 hospitals) across the United States, have joined the National Database for Nursing Quality Indicators® (NDNQI®). With over 18,000 hospital units contributing data for nearly 200 quantitative measures at present, a reliable and efficient input data screening for all quantitative measures for data quality control is critical to the integrity, validity, and on-time delivery of NDNQI reports. METHODS: With Monte Carlo simulation and quantitative NDNQI indicator examples, we compared two ad-hoc methods using robust scale estimators, Inter Quartile Range (IQR) and Median Absolute Deviation from the Median (MAD), to the classic, theoretically-based Minimum Covariance Determinant (FAST-MCD) approach, for initial univariate outlier detection. RESULTS: While the theoretically based FAST-MCD used in one dimension can be sensitive and is better suited for identifying groups of outliers because of its high breakdown point, the ad-hoc IQR and MAD approaches are fast, easy to implement, and could be more robust and efficient, depending on the distributional property of the underlying measure of interest. CONCLUSION: With highly skewed distributions for most NDNQI indicators within a short data screen window, the FAST-MCD approach, when used in one dimensional raw data setting, could overestimate the false alarm rates for potential outliers than the IQR and MAD with the same pre-set of critical value, thus, overburden data quality control at both the data entry and administrative ends in our setting. BioMed Central 2012-08-25 /pmc/articles/PMC3542164/ /pubmed/22920157 http://dx.doi.org/10.1186/1756-0500-5-456 Text en Copyright ©2012 Hou et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Research Article
Hou, Qingjiang
Crosser, Brandon
Mahnken, Jonathan D
Gajewski, Byron J
Dunton, Nancy
Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection
title Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection
title_full Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection
title_fullStr Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection
title_full_unstemmed Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection
title_short Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection
title_sort input data quality control for ndnqi national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3542164/
https://www.ncbi.nlm.nih.gov/pubmed/22920157
http://dx.doi.org/10.1186/1756-0500-5-456
work_keys_str_mv AT houqingjiang inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection
AT crosserbrandon inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection
AT mahnkenjonathand inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection
AT gajewskibyronj inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection
AT duntonnancy inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection