Cargando…
Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy
OBJECTIVE: Valid measurement of outcomes such as disease prevalence using health care utilization data is fundamental to the implementation of a “learning health system.” Definitions of such outcomes can be complex, based on multiple diagnostic codes. The literature on validating such data demonstra...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Lippincott Williams & Wilkins
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5510703/ https://www.ncbi.nlm.nih.gov/pubmed/25821898 http://dx.doi.org/10.1097/MLR.0000000000000324 |
_version_ | 1783250217843294208 |
---|---|
author | Cadieux, Geneviève Tamblyn, Robyn Buckeridge, David L. Dendukuri, Nandini |
author_facet | Cadieux, Geneviève Tamblyn, Robyn Buckeridge, David L. Dendukuri, Nandini |
author_sort | Cadieux, Geneviève |
collection | PubMed |
description | OBJECTIVE: Valid measurement of outcomes such as disease prevalence using health care utilization data is fundamental to the implementation of a “learning health system.” Definitions of such outcomes can be complex, based on multiple diagnostic codes. The literature on validating such data demonstrates a lack of awareness of the need for a stratified sampling design and corresponding statistical methods. We propose a method for validating the measurement of diagnostic groups that have: (1) different prevalences of diagnostic codes within the group; and (2) low prevalence. METHODS: We describe an estimation method whereby: (1) low-prevalence diagnostic codes are oversampled, and the positive predictive value (PPV) of the diagnostic group is estimated as a weighted average of the PPV of each diagnostic code; and (2) claims that fall within a low-prevalence diagnostic group are oversampled relative to claims that are not, and bias-adjusted estimators of sensitivity and specificity are generated. APPLICATION: We illustrate our proposed method using an example from population health surveillance in which diagnostic groups are applied to physician claims to identify cases of acute respiratory illness. CONCLUSIONS: Failure to account for the prevalence of each diagnostic code within a diagnostic group leads to the underestimation of the PPV, because low-prevalence diagnostic codes are more likely to be false positives. Failure to adjust for oversampling of claims that fall within the low-prevalence diagnostic group relative to those that do not leads to the overestimation of sensitivity and underestimation of specificity. |
format | Online Article Text |
id | pubmed-5510703 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | Lippincott Williams & Wilkins |
record_format | MEDLINE/PubMed |
spelling | pubmed-55107032017-07-31 Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy Cadieux, Geneviève Tamblyn, Robyn Buckeridge, David L. Dendukuri, Nandini Med Care Online Article: Applied Methods OBJECTIVE: Valid measurement of outcomes such as disease prevalence using health care utilization data is fundamental to the implementation of a “learning health system.” Definitions of such outcomes can be complex, based on multiple diagnostic codes. The literature on validating such data demonstrates a lack of awareness of the need for a stratified sampling design and corresponding statistical methods. We propose a method for validating the measurement of diagnostic groups that have: (1) different prevalences of diagnostic codes within the group; and (2) low prevalence. METHODS: We describe an estimation method whereby: (1) low-prevalence diagnostic codes are oversampled, and the positive predictive value (PPV) of the diagnostic group is estimated as a weighted average of the PPV of each diagnostic code; and (2) claims that fall within a low-prevalence diagnostic group are oversampled relative to claims that are not, and bias-adjusted estimators of sensitivity and specificity are generated. APPLICATION: We illustrate our proposed method using an example from population health surveillance in which diagnostic groups are applied to physician claims to identify cases of acute respiratory illness. CONCLUSIONS: Failure to account for the prevalence of each diagnostic code within a diagnostic group leads to the underestimation of the PPV, because low-prevalence diagnostic codes are more likely to be false positives. Failure to adjust for oversampling of claims that fall within the low-prevalence diagnostic group relative to those that do not leads to the overestimation of sensitivity and underestimation of specificity. Lippincott Williams & Wilkins 2017-08 2015-03-27 /pmc/articles/PMC5510703/ /pubmed/25821898 http://dx.doi.org/10.1097/MLR.0000000000000324 Text en Copyright © 2015 The Author(s). Published by Wolters Kluwer Health, Inc. This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (http://creativecommons.org/licenses/by-nc-nd/4.0/) (CCBY-NC-ND), where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in any way or used commercially without permission from the journal. http://creativecommons.org/licenses/by-nc-nd/4.0/. |
spellingShingle | Online Article: Applied Methods Cadieux, Geneviève Tamblyn, Robyn Buckeridge, David L. Dendukuri, Nandini Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy |
title | Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy |
title_full | Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy |
title_fullStr | Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy |
title_full_unstemmed | Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy |
title_short | Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy |
title_sort | validation of diagnostic groups based on health care utilization data should adjust for sampling strategy |
topic | Online Article: Applied Methods |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5510703/ https://www.ncbi.nlm.nih.gov/pubmed/25821898 http://dx.doi.org/10.1097/MLR.0000000000000324 |
work_keys_str_mv | AT cadieuxgenevieve validationofdiagnosticgroupsbasedonhealthcareutilizationdatashouldadjustforsamplingstrategy AT tamblynrobyn validationofdiagnosticgroupsbasedonhealthcareutilizationdatashouldadjustforsamplingstrategy AT buckeridgedavidl validationofdiagnosticgroupsbasedonhealthcareutilizationdatashouldadjustforsamplingstrategy AT dendukurinandini validationofdiagnosticgroupsbasedonhealthcareutilizationdatashouldadjustforsamplingstrategy |