Cargando…

Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy

OBJECTIVE: Valid measurement of outcomes such as disease prevalence using health care utilization data is fundamental to the implementation of a “learning health system.” Definitions of such outcomes can be complex, based on multiple diagnostic codes. The literature on validating such data demonstra...

Descripción completa

Detalles Bibliográficos
Autores principales:	Cadieux, Geneviève, Tamblyn, Robyn, Buckeridge, David L., Dendukuri, Nandini
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Lippincott Williams & Wilkins 2017
Materias:	Online Article: Applied Methods
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5510703/ https://www.ncbi.nlm.nih.gov/pubmed/25821898 http://dx.doi.org/10.1097/MLR.0000000000000324

_version_	1783250217843294208
author	Cadieux, Geneviève Tamblyn, Robyn Buckeridge, David L. Dendukuri, Nandini
author_facet	Cadieux, Geneviève Tamblyn, Robyn Buckeridge, David L. Dendukuri, Nandini
author_sort	Cadieux, Geneviève
collection	PubMed
description	OBJECTIVE: Valid measurement of outcomes such as disease prevalence using health care utilization data is fundamental to the implementation of a “learning health system.” Definitions of such outcomes can be complex, based on multiple diagnostic codes. The literature on validating such data demonstrates a lack of awareness of the need for a stratified sampling design and corresponding statistical methods. We propose a method for validating the measurement of diagnostic groups that have: (1) different prevalences of diagnostic codes within the group; and (2) low prevalence. METHODS: We describe an estimation method whereby: (1) low-prevalence diagnostic codes are oversampled, and the positive predictive value (PPV) of the diagnostic group is estimated as a weighted average of the PPV of each diagnostic code; and (2) claims that fall within a low-prevalence diagnostic group are oversampled relative to claims that are not, and bias-adjusted estimators of sensitivity and specificity are generated. APPLICATION: We illustrate our proposed method using an example from population health surveillance in which diagnostic groups are applied to physician claims to identify cases of acute respiratory illness. CONCLUSIONS: Failure to account for the prevalence of each diagnostic code within a diagnostic group leads to the underestimation of the PPV, because low-prevalence diagnostic codes are more likely to be false positives. Failure to adjust for oversampling of claims that fall within the low-prevalence diagnostic group relative to those that do not leads to the overestimation of sensitivity and underestimation of specificity.
format	Online Article Text
id	pubmed-5510703
institution	National Center for Biotechnology Information
language	English
publishDate	2017
publisher	Lippincott Williams & Wilkins
record_format	MEDLINE/PubMed
spelling	pubmed-55107032017-07-31 Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy Cadieux, Geneviève Tamblyn, Robyn Buckeridge, David L. Dendukuri, Nandini Med Care Online Article: Applied Methods OBJECTIVE: Valid measurement of outcomes such as disease prevalence using health care utilization data is fundamental to the implementation of a “learning health system.” Definitions of such outcomes can be complex, based on multiple diagnostic codes. The literature on validating such data demonstrates a lack of awareness of the need for a stratified sampling design and corresponding statistical methods. We propose a method for validating the measurement of diagnostic groups that have: (1) different prevalences of diagnostic codes within the group; and (2) low prevalence. METHODS: We describe an estimation method whereby: (1) low-prevalence diagnostic codes are oversampled, and the positive predictive value (PPV) of the diagnostic group is estimated as a weighted average of the PPV of each diagnostic code; and (2) claims that fall within a low-prevalence diagnostic group are oversampled relative to claims that are not, and bias-adjusted estimators of sensitivity and specificity are generated. APPLICATION: We illustrate our proposed method using an example from population health surveillance in which diagnostic groups are applied to physician claims to identify cases of acute respiratory illness. CONCLUSIONS: Failure to account for the prevalence of each diagnostic code within a diagnostic group leads to the underestimation of the PPV, because low-prevalence diagnostic codes are more likely to be false positives. Failure to adjust for oversampling of claims that fall within the low-prevalence diagnostic group relative to those that do not leads to the overestimation of sensitivity and underestimation of specificity. Lippincott Williams & Wilkins 2017-08 2015-03-27 /pmc/articles/PMC5510703/ /pubmed/25821898 http://dx.doi.org/10.1097/MLR.0000000000000324 Text en Copyright © 2015 The Author(s). Published by Wolters Kluwer Health, Inc. This is an open-access article distributed under the terms of the Creative Commons Attribution-Non Commercial-No Derivatives License 4.0 (http://creativecommons.org/licenses/by-nc-nd/4.0/) (CCBY-NC-ND), where it is permissible to download and share the work provided it is properly cited. The work cannot be changed in any way or used commercially without permission from the journal. http://creativecommons.org/licenses/by-nc-nd/4.0/.
spellingShingle	Online Article: Applied Methods Cadieux, Geneviève Tamblyn, Robyn Buckeridge, David L. Dendukuri, Nandini Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy
title	Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy
title_full	Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy
title_fullStr	Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy
title_full_unstemmed	Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy
title_short	Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy
title_sort	validation of diagnostic groups based on health care utilization data should adjust for sampling strategy
topic	Online Article: Applied Methods
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5510703/ https://www.ncbi.nlm.nih.gov/pubmed/25821898 http://dx.doi.org/10.1097/MLR.0000000000000324
work_keys_str_mv	AT cadieuxgenevieve validationofdiagnosticgroupsbasedonhealthcareutilizationdatashouldadjustforsamplingstrategy AT tamblynrobyn validationofdiagnosticgroupsbasedonhealthcareutilizationdatashouldadjustforsamplingstrategy AT buckeridgedavidl validationofdiagnosticgroupsbasedonhealthcareutilizationdatashouldadjustforsamplingstrategy AT dendukurinandini validationofdiagnosticgroupsbasedonhealthcareutilizationdatashouldadjustforsamplingstrategy

Validation of Diagnostic Groups Based on Health Care Utilization Data Should Adjust for Sampling Strategy

Ejemplares similares