Cargando…

Tweedie distributions for fitting semicontinuous health care utilization cost data

BACKGROUND: The statistical analysis of health care cost data is often problematic because these data are usually non-negative, right-skewed and have excess zeros for non-users. This prevents the use of linear models based on the Gaussian or Gamma distribution. A common way to counter this is the us...

Descripción completa

Detalles Bibliográficos
Autor principal: Kurz, Christoph F.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2017
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735804/
https://www.ncbi.nlm.nih.gov/pubmed/29258428
http://dx.doi.org/10.1186/s12874-017-0445-y
_version_ 1783287272541519872
author Kurz, Christoph F.
author_facet Kurz, Christoph F.
author_sort Kurz, Christoph F.
collection PubMed
description BACKGROUND: The statistical analysis of health care cost data is often problematic because these data are usually non-negative, right-skewed and have excess zeros for non-users. This prevents the use of linear models based on the Gaussian or Gamma distribution. A common way to counter this is the use of Two-part or Tobit models, which makes interpretation of the results more difficult. In this study, I explore a statistical distribution from the Tweedie family of distributions that can simultaneously model the probability of zero outcome, i.e. of being a non-user of health care utilization and continuous costs for users. METHODS: I assess the usefulness of the Tweedie model in a Monte Carlo simulation study that addresses two common situations of low and high correlation of the users and the non-users of health care utilization. Furthermore, I compare the Tweedie model with several other models using a real data set from the RAND health insurance experiment. RESULTS: I show that the Tweedie distribution fits cost data very well and provides better fit, especially when the number of non-users is low and the correlation between users and non-users is high. CONCLUSION: The Tweedie distribution provides an interesting solution to many statistical problems in health economic analyses.
format Online
Article
Text
id pubmed-5735804
institution National Center for Biotechnology Information
language English
publishDate 2017
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-57358042017-12-21 Tweedie distributions for fitting semicontinuous health care utilization cost data Kurz, Christoph F. BMC Med Res Methodol Research Article BACKGROUND: The statistical analysis of health care cost data is often problematic because these data are usually non-negative, right-skewed and have excess zeros for non-users. This prevents the use of linear models based on the Gaussian or Gamma distribution. A common way to counter this is the use of Two-part or Tobit models, which makes interpretation of the results more difficult. In this study, I explore a statistical distribution from the Tweedie family of distributions that can simultaneously model the probability of zero outcome, i.e. of being a non-user of health care utilization and continuous costs for users. METHODS: I assess the usefulness of the Tweedie model in a Monte Carlo simulation study that addresses two common situations of low and high correlation of the users and the non-users of health care utilization. Furthermore, I compare the Tweedie model with several other models using a real data set from the RAND health insurance experiment. RESULTS: I show that the Tweedie distribution fits cost data very well and provides better fit, especially when the number of non-users is low and the correlation between users and non-users is high. CONCLUSION: The Tweedie distribution provides an interesting solution to many statistical problems in health economic analyses. BioMed Central 2017-12-19 /pmc/articles/PMC5735804/ /pubmed/29258428 http://dx.doi.org/10.1186/s12874-017-0445-y Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
Kurz, Christoph F.
Tweedie distributions for fitting semicontinuous health care utilization cost data
title Tweedie distributions for fitting semicontinuous health care utilization cost data
title_full Tweedie distributions for fitting semicontinuous health care utilization cost data
title_fullStr Tweedie distributions for fitting semicontinuous health care utilization cost data
title_full_unstemmed Tweedie distributions for fitting semicontinuous health care utilization cost data
title_short Tweedie distributions for fitting semicontinuous health care utilization cost data
title_sort tweedie distributions for fitting semicontinuous health care utilization cost data
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735804/
https://www.ncbi.nlm.nih.gov/pubmed/29258428
http://dx.doi.org/10.1186/s12874-017-0445-y
work_keys_str_mv AT kurzchristophf tweediedistributionsforfittingsemicontinuoushealthcareutilizationcostdata