Cargando…
Tweedie distributions for fitting semicontinuous health care utilization cost data
BACKGROUND: The statistical analysis of health care cost data is often problematic because these data are usually non-negative, right-skewed and have excess zeros for non-users. This prevents the use of linear models based on the Gaussian or Gamma distribution. A common way to counter this is the us...
Autor principal: | |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2017
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735804/ https://www.ncbi.nlm.nih.gov/pubmed/29258428 http://dx.doi.org/10.1186/s12874-017-0445-y |
_version_ | 1783287272541519872 |
---|---|
author | Kurz, Christoph F. |
author_facet | Kurz, Christoph F. |
author_sort | Kurz, Christoph F. |
collection | PubMed |
description | BACKGROUND: The statistical analysis of health care cost data is often problematic because these data are usually non-negative, right-skewed and have excess zeros for non-users. This prevents the use of linear models based on the Gaussian or Gamma distribution. A common way to counter this is the use of Two-part or Tobit models, which makes interpretation of the results more difficult. In this study, I explore a statistical distribution from the Tweedie family of distributions that can simultaneously model the probability of zero outcome, i.e. of being a non-user of health care utilization and continuous costs for users. METHODS: I assess the usefulness of the Tweedie model in a Monte Carlo simulation study that addresses two common situations of low and high correlation of the users and the non-users of health care utilization. Furthermore, I compare the Tweedie model with several other models using a real data set from the RAND health insurance experiment. RESULTS: I show that the Tweedie distribution fits cost data very well and provides better fit, especially when the number of non-users is low and the correlation between users and non-users is high. CONCLUSION: The Tweedie distribution provides an interesting solution to many statistical problems in health economic analyses. |
format | Online Article Text |
id | pubmed-5735804 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2017 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-57358042017-12-21 Tweedie distributions for fitting semicontinuous health care utilization cost data Kurz, Christoph F. BMC Med Res Methodol Research Article BACKGROUND: The statistical analysis of health care cost data is often problematic because these data are usually non-negative, right-skewed and have excess zeros for non-users. This prevents the use of linear models based on the Gaussian or Gamma distribution. A common way to counter this is the use of Two-part or Tobit models, which makes interpretation of the results more difficult. In this study, I explore a statistical distribution from the Tweedie family of distributions that can simultaneously model the probability of zero outcome, i.e. of being a non-user of health care utilization and continuous costs for users. METHODS: I assess the usefulness of the Tweedie model in a Monte Carlo simulation study that addresses two common situations of low and high correlation of the users and the non-users of health care utilization. Furthermore, I compare the Tweedie model with several other models using a real data set from the RAND health insurance experiment. RESULTS: I show that the Tweedie distribution fits cost data very well and provides better fit, especially when the number of non-users is low and the correlation between users and non-users is high. CONCLUSION: The Tweedie distribution provides an interesting solution to many statistical problems in health economic analyses. BioMed Central 2017-12-19 /pmc/articles/PMC5735804/ /pubmed/29258428 http://dx.doi.org/10.1186/s12874-017-0445-y Text en © The Author(s) 2017 Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver(http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated. |
spellingShingle | Research Article Kurz, Christoph F. Tweedie distributions for fitting semicontinuous health care utilization cost data |
title | Tweedie distributions for fitting semicontinuous health care utilization cost data |
title_full | Tweedie distributions for fitting semicontinuous health care utilization cost data |
title_fullStr | Tweedie distributions for fitting semicontinuous health care utilization cost data |
title_full_unstemmed | Tweedie distributions for fitting semicontinuous health care utilization cost data |
title_short | Tweedie distributions for fitting semicontinuous health care utilization cost data |
title_sort | tweedie distributions for fitting semicontinuous health care utilization cost data |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5735804/ https://www.ncbi.nlm.nih.gov/pubmed/29258428 http://dx.doi.org/10.1186/s12874-017-0445-y |
work_keys_str_mv | AT kurzchristophf tweediedistributionsforfittingsemicontinuoushealthcareutilizationcostdata |