Cargando…
Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection
BACKGROUND: To evaluate institutional nursing care performance in the context of national comparative statistics (benchmarks), approximately one in every three major healthcare institutions (over 1,800 hospitals) across the United States, have joined the National Database for Nursing Quality Indicat...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2012
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3542164/ https://www.ncbi.nlm.nih.gov/pubmed/22920157 http://dx.doi.org/10.1186/1756-0500-5-456 |
_version_ | 1782255464700968960 |
---|---|
author | Hou, Qingjiang Crosser, Brandon Mahnken, Jonathan D Gajewski, Byron J Dunton, Nancy |
author_facet | Hou, Qingjiang Crosser, Brandon Mahnken, Jonathan D Gajewski, Byron J Dunton, Nancy |
author_sort | Hou, Qingjiang |
collection | PubMed |
description | BACKGROUND: To evaluate institutional nursing care performance in the context of national comparative statistics (benchmarks), approximately one in every three major healthcare institutions (over 1,800 hospitals) across the United States, have joined the National Database for Nursing Quality Indicators® (NDNQI®). With over 18,000 hospital units contributing data for nearly 200 quantitative measures at present, a reliable and efficient input data screening for all quantitative measures for data quality control is critical to the integrity, validity, and on-time delivery of NDNQI reports. METHODS: With Monte Carlo simulation and quantitative NDNQI indicator examples, we compared two ad-hoc methods using robust scale estimators, Inter Quartile Range (IQR) and Median Absolute Deviation from the Median (MAD), to the classic, theoretically-based Minimum Covariance Determinant (FAST-MCD) approach, for initial univariate outlier detection. RESULTS: While the theoretically based FAST-MCD used in one dimension can be sensitive and is better suited for identifying groups of outliers because of its high breakdown point, the ad-hoc IQR and MAD approaches are fast, easy to implement, and could be more robust and efficient, depending on the distributional property of the underlying measure of interest. CONCLUSION: With highly skewed distributions for most NDNQI indicators within a short data screen window, the FAST-MCD approach, when used in one dimensional raw data setting, could overestimate the false alarm rates for potential outliers than the IQR and MAD with the same pre-set of critical value, thus, overburden data quality control at both the data entry and administrative ends in our setting. |
format | Online Article Text |
id | pubmed-3542164 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2012 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-35421642013-01-11 Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection Hou, Qingjiang Crosser, Brandon Mahnken, Jonathan D Gajewski, Byron J Dunton, Nancy BMC Res Notes Research Article BACKGROUND: To evaluate institutional nursing care performance in the context of national comparative statistics (benchmarks), approximately one in every three major healthcare institutions (over 1,800 hospitals) across the United States, have joined the National Database for Nursing Quality Indicators® (NDNQI®). With over 18,000 hospital units contributing data for nearly 200 quantitative measures at present, a reliable and efficient input data screening for all quantitative measures for data quality control is critical to the integrity, validity, and on-time delivery of NDNQI reports. METHODS: With Monte Carlo simulation and quantitative NDNQI indicator examples, we compared two ad-hoc methods using robust scale estimators, Inter Quartile Range (IQR) and Median Absolute Deviation from the Median (MAD), to the classic, theoretically-based Minimum Covariance Determinant (FAST-MCD) approach, for initial univariate outlier detection. RESULTS: While the theoretically based FAST-MCD used in one dimension can be sensitive and is better suited for identifying groups of outliers because of its high breakdown point, the ad-hoc IQR and MAD approaches are fast, easy to implement, and could be more robust and efficient, depending on the distributional property of the underlying measure of interest. CONCLUSION: With highly skewed distributions for most NDNQI indicators within a short data screen window, the FAST-MCD approach, when used in one dimensional raw data setting, could overestimate the false alarm rates for potential outliers than the IQR and MAD with the same pre-set of critical value, thus, overburden data quality control at both the data entry and administrative ends in our setting. BioMed Central 2012-08-25 /pmc/articles/PMC3542164/ /pubmed/22920157 http://dx.doi.org/10.1186/1756-0500-5-456 Text en Copyright ©2012 Hou et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Research Article Hou, Qingjiang Crosser, Brandon Mahnken, Jonathan D Gajewski, Byron J Dunton, Nancy Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection |
title | Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection |
title_full | Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection |
title_fullStr | Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection |
title_full_unstemmed | Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection |
title_short | Input data quality control for NDNQI national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection |
title_sort | input data quality control for ndnqi national comparative statistics and quarterly reports: a contrast of three robust scale estimators for multiple outlier detection |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3542164/ https://www.ncbi.nlm.nih.gov/pubmed/22920157 http://dx.doi.org/10.1186/1756-0500-5-456 |
work_keys_str_mv | AT houqingjiang inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection AT crosserbrandon inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection AT mahnkenjonathand inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection AT gajewskibyronj inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection AT duntonnancy inputdataqualitycontrolforndnqinationalcomparativestatisticsandquarterlyreportsacontrastofthreerobustscaleestimatorsformultipleoutlierdetection |