Cargando…

Adjusting selection bias in German health insurance records for regional prevalence estimation

BACKGROUND: Regional prevalence estimation requires epidemiologic data with substantial local detail. National health surveys may lack in sufficient local observations due to limited resources. Therefore, corresponding prevalence estimates may not capture regional morbidity patterns with the necessa...

Descripción completa

Detalles Bibliográficos
Autores principales: Münnich, Ralf Thomas, Burgard, Jan Pablo, Krause, Joscha
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2019
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6712777/
https://www.ncbi.nlm.nih.gov/pubmed/31455350
http://dx.doi.org/10.1186/s12963-019-0189-5
_version_ 1783446749258448896
author Münnich, Ralf Thomas
Burgard, Jan Pablo
Krause, Joscha
author_facet Münnich, Ralf Thomas
Burgard, Jan Pablo
Krause, Joscha
author_sort Münnich, Ralf Thomas
collection PubMed
description BACKGROUND: Regional prevalence estimation requires epidemiologic data with substantial local detail. National health surveys may lack in sufficient local observations due to limited resources. Therefore, corresponding prevalence estimates may not capture regional morbidity patterns with the necessary accuracy. Health insurance records represent alternative data sources for this purpose. Fund-specific member populations have more local observations than surveys, which benefits regional prevalence estimation. However, due to national insurance market regulations, insurance membership can be informative for morbidity. Regional fund-specific prevalence proportions are selective in the sense that the morbidity structure of a fund’s member population cannot be extrapolated to the national population. This implies a selection bias that marks a major obstacle for statistical inference. We provide a methodology to adjust fund-specific selectivity and perform regional prevalence estimation from health insurance records. The methodology is applied to estimate regional cohort-referenced diabetes mellitus type 2 prevalence in Germany. METHODS: Records of the German Public Health Insurance Company from 2014 and Diagnosis-Related Group Statistics data are combined within a benchmarked multi-level model. The fund-specific selectivity is adjusted in a two-step procedure. Firstly, the conditional expectation of the insurance company’s regional prevalence given related inpatient diagnosis frequencies of its members is quantified. Secondly, the regional prevalence is estimated by extrapolating the conditional expectation using corresponding inpatient diagnosis frequencies of the Diagnosis-Related Group Statistics as benchmarks. Model assumptions are validated via Monte Carlo simulation. Variable selection is performed via multivariate methods. The optimal model fit is determined by analysis of variance. 95% confidence intervals for the estimates are constructed via semiparametric bootstrapping. RESULTS: The national diabetes mellitus type 2 prevalence is estimated at 8.70% with a 95% confidence interval of [8.48%, 9.35%]. This indicates an adjustment of the original fund-specific prevalence from − 32.79 to − 25.93%. The estimated disease distribution shows significant morbidity differences between regions, especially between eastern and western Germany. However, the cohort-referenced estimates suggest that these differences can be partially explained by regional demography. CONCLUSIONS: The proposed methodology allows regional prevalence estimation in remarkable detail despite fund-specific selectivity. This enhances and encourages the use of health insurance records for future epidemiologic studies.
format Online
Article
Text
id pubmed-6712777
institution National Center for Biotechnology Information
language English
publishDate 2019
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-67127772019-08-29 Adjusting selection bias in German health insurance records for regional prevalence estimation Münnich, Ralf Thomas Burgard, Jan Pablo Krause, Joscha Popul Health Metr Research BACKGROUND: Regional prevalence estimation requires epidemiologic data with substantial local detail. National health surveys may lack in sufficient local observations due to limited resources. Therefore, corresponding prevalence estimates may not capture regional morbidity patterns with the necessary accuracy. Health insurance records represent alternative data sources for this purpose. Fund-specific member populations have more local observations than surveys, which benefits regional prevalence estimation. However, due to national insurance market regulations, insurance membership can be informative for morbidity. Regional fund-specific prevalence proportions are selective in the sense that the morbidity structure of a fund’s member population cannot be extrapolated to the national population. This implies a selection bias that marks a major obstacle for statistical inference. We provide a methodology to adjust fund-specific selectivity and perform regional prevalence estimation from health insurance records. The methodology is applied to estimate regional cohort-referenced diabetes mellitus type 2 prevalence in Germany. METHODS: Records of the German Public Health Insurance Company from 2014 and Diagnosis-Related Group Statistics data are combined within a benchmarked multi-level model. The fund-specific selectivity is adjusted in a two-step procedure. Firstly, the conditional expectation of the insurance company’s regional prevalence given related inpatient diagnosis frequencies of its members is quantified. Secondly, the regional prevalence is estimated by extrapolating the conditional expectation using corresponding inpatient diagnosis frequencies of the Diagnosis-Related Group Statistics as benchmarks. Model assumptions are validated via Monte Carlo simulation. Variable selection is performed via multivariate methods. The optimal model fit is determined by analysis of variance. 95% confidence intervals for the estimates are constructed via semiparametric bootstrapping. RESULTS: The national diabetes mellitus type 2 prevalence is estimated at 8.70% with a 95% confidence interval of [8.48%, 9.35%]. This indicates an adjustment of the original fund-specific prevalence from − 32.79 to − 25.93%. The estimated disease distribution shows significant morbidity differences between regions, especially between eastern and western Germany. However, the cohort-referenced estimates suggest that these differences can be partially explained by regional demography. CONCLUSIONS: The proposed methodology allows regional prevalence estimation in remarkable detail despite fund-specific selectivity. This enhances and encourages the use of health insurance records for future epidemiologic studies. BioMed Central 2019-08-27 /pmc/articles/PMC6712777/ /pubmed/31455350 http://dx.doi.org/10.1186/s12963-019-0189-5 Text en © The Author(s). 2019 Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research
Münnich, Ralf Thomas
Burgard, Jan Pablo
Krause, Joscha
Adjusting selection bias in German health insurance records for regional prevalence estimation
title Adjusting selection bias in German health insurance records for regional prevalence estimation
title_full Adjusting selection bias in German health insurance records for regional prevalence estimation
title_fullStr Adjusting selection bias in German health insurance records for regional prevalence estimation
title_full_unstemmed Adjusting selection bias in German health insurance records for regional prevalence estimation
title_short Adjusting selection bias in German health insurance records for regional prevalence estimation
title_sort adjusting selection bias in german health insurance records for regional prevalence estimation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC6712777/
https://www.ncbi.nlm.nih.gov/pubmed/31455350
http://dx.doi.org/10.1186/s12963-019-0189-5
work_keys_str_mv AT munnichralfthomas adjustingselectionbiasingermanhealthinsurancerecordsforregionalprevalenceestimation
AT burgardjanpablo adjustingselectionbiasingermanhealthinsurancerecordsforregionalprevalenceestimation
AT krausejoscha adjustingselectionbiasingermanhealthinsurancerecordsforregionalprevalenceestimation