Cargando…
Count data models for outpatient health services utilisation
BACKGROUND: Count data from the national survey captures healthcare utilisation within a specific reference period, resulting in excess zeros and skewed positive tails. Often, it is modelled using count data models. This study aims to identify the best-fitting model for outpatient healthcare utilisa...
Autores principales: | , , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
BioMed Central
2022
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9533534/ https://www.ncbi.nlm.nih.gov/pubmed/36199028 http://dx.doi.org/10.1186/s12874-022-01733-3 |
_version_ | 1784802367101730816 |
---|---|
author | Abu Bakar, Nurul Salwana Ab Hamid, Jabrullah Mohd Nor Sham, Mohd Shaiful Jefri Sham, Mohd Nor Jailani, Anis Syakira |
author_facet | Abu Bakar, Nurul Salwana Ab Hamid, Jabrullah Mohd Nor Sham, Mohd Shaiful Jefri Sham, Mohd Nor Jailani, Anis Syakira |
author_sort | Abu Bakar, Nurul Salwana |
collection | PubMed |
description | BACKGROUND: Count data from the national survey captures healthcare utilisation within a specific reference period, resulting in excess zeros and skewed positive tails. Often, it is modelled using count data models. This study aims to identify the best-fitting model for outpatient healthcare utilisation using data from the Malaysian National Health and Morbidity Survey 2019 (NHMS 2019) and utilisation factors among adults in Malaysia. METHODS: The frequency of outpatient visits is the dependent variable, and instrumental variable selection is based on Andersen’s model. Six different models were used: ordinary least squares (OLS), Poisson regression, negative binomial regression (NB), inflated models: zero-inflated Poisson, marginalized-zero-inflated negative binomial (MZINB), and hurdle model. Identification of the best-fitting model was based on model selection criteria, goodness-of-fit and statistical test of the factors associated with outpatient visits. RESULTS: The frequency of zero was 90%. Of the sample, 8.35% of adults utilized healthcare services only once, and 1.04% utilized them twice. The mean-variance value varied between 0.14 and 0.39. Across six models, the zero-inflated model (ZIM) possesses the smallest log-likelihood, Akaike information criterion, Bayesian information criterion, and a positive Vuong corrected value. Fourteen instrumental variables, five predisposing factors, six enablers, and three need factors were identified. Data overdispersion is characterized by excess zeros, a large mean to variance value, and skewed positive tails. We assumed frequency and true zeros throughout the study reference period. ZIM is the best-fitting model based on the model selection criteria, smallest Root Mean Square Error (RMSE) and higher R2. Both Vuong corrected and uncorrected values with different Stata commands yielded positive values with small differences. CONCLUSION: State as a place of residence, ethnicity, household income quintile, and health needs were significantly associated with healthcare utilisation. Our findings suggest using ZIM over traditional OLS. This study encourages the use of this count data model as it has a better fit, is easy to interpret, and has appropriate assumptions based on the survey methodology. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01733-3. |
format | Online Article Text |
id | pubmed-9533534 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2022 |
publisher | BioMed Central |
record_format | MEDLINE/PubMed |
spelling | pubmed-95335342022-10-06 Count data models for outpatient health services utilisation Abu Bakar, Nurul Salwana Ab Hamid, Jabrullah Mohd Nor Sham, Mohd Shaiful Jefri Sham, Mohd Nor Jailani, Anis Syakira BMC Med Res Methodol Research BACKGROUND: Count data from the national survey captures healthcare utilisation within a specific reference period, resulting in excess zeros and skewed positive tails. Often, it is modelled using count data models. This study aims to identify the best-fitting model for outpatient healthcare utilisation using data from the Malaysian National Health and Morbidity Survey 2019 (NHMS 2019) and utilisation factors among adults in Malaysia. METHODS: The frequency of outpatient visits is the dependent variable, and instrumental variable selection is based on Andersen’s model. Six different models were used: ordinary least squares (OLS), Poisson regression, negative binomial regression (NB), inflated models: zero-inflated Poisson, marginalized-zero-inflated negative binomial (MZINB), and hurdle model. Identification of the best-fitting model was based on model selection criteria, goodness-of-fit and statistical test of the factors associated with outpatient visits. RESULTS: The frequency of zero was 90%. Of the sample, 8.35% of adults utilized healthcare services only once, and 1.04% utilized them twice. The mean-variance value varied between 0.14 and 0.39. Across six models, the zero-inflated model (ZIM) possesses the smallest log-likelihood, Akaike information criterion, Bayesian information criterion, and a positive Vuong corrected value. Fourteen instrumental variables, five predisposing factors, six enablers, and three need factors were identified. Data overdispersion is characterized by excess zeros, a large mean to variance value, and skewed positive tails. We assumed frequency and true zeros throughout the study reference period. ZIM is the best-fitting model based on the model selection criteria, smallest Root Mean Square Error (RMSE) and higher R2. Both Vuong corrected and uncorrected values with different Stata commands yielded positive values with small differences. CONCLUSION: State as a place of residence, ethnicity, household income quintile, and health needs were significantly associated with healthcare utilisation. Our findings suggest using ZIM over traditional OLS. This study encourages the use of this count data model as it has a better fit, is easy to interpret, and has appropriate assumptions based on the survey methodology. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01733-3. BioMed Central 2022-10-05 /pmc/articles/PMC9533534/ /pubmed/36199028 http://dx.doi.org/10.1186/s12874-022-01733-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data. |
spellingShingle | Research Abu Bakar, Nurul Salwana Ab Hamid, Jabrullah Mohd Nor Sham, Mohd Shaiful Jefri Sham, Mohd Nor Jailani, Anis Syakira Count data models for outpatient health services utilisation |
title |
Count data models for outpatient health services utilisation
|
title_full |
Count data models for outpatient health services utilisation
|
title_fullStr |
Count data models for outpatient health services utilisation
|
title_full_unstemmed |
Count data models for outpatient health services utilisation
|
title_short |
Count data models for outpatient health services utilisation
|
title_sort | count data models for outpatient health services utilisation |
topic | Research |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9533534/ https://www.ncbi.nlm.nih.gov/pubmed/36199028 http://dx.doi.org/10.1186/s12874-022-01733-3 |
work_keys_str_mv | AT abubakarnurulsalwana countdatamodelsforoutpatienthealthservicesutilisation AT abhamidjabrullah countdatamodelsforoutpatienthealthservicesutilisation AT mohdnorshammohdshaifuljefri countdatamodelsforoutpatienthealthservicesutilisation AT shammohdnor countdatamodelsforoutpatienthealthservicesutilisation AT jailanianissyakira countdatamodelsforoutpatienthealthservicesutilisation |