Cargando…

Count data models for outpatient health services utilisation

BACKGROUND: Count data from the national survey captures healthcare utilisation within a specific reference period, resulting in excess zeros and skewed positive tails. Often, it is modelled using count data models. This study aims to identify the best-fitting model for outpatient healthcare utilisa...

Descripción completa

Detalles Bibliográficos
Autores principales: Abu Bakar, Nurul Salwana, Ab Hamid, Jabrullah, Mohd Nor Sham, Mohd Shaiful Jefri, Sham, Mohd Nor, Jailani, Anis Syakira
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9533534/
https://www.ncbi.nlm.nih.gov/pubmed/36199028
http://dx.doi.org/10.1186/s12874-022-01733-3
_version_ 1784802367101730816
author Abu Bakar, Nurul Salwana
Ab Hamid, Jabrullah
Mohd Nor Sham, Mohd Shaiful Jefri
Sham, Mohd Nor
Jailani, Anis Syakira
author_facet Abu Bakar, Nurul Salwana
Ab Hamid, Jabrullah
Mohd Nor Sham, Mohd Shaiful Jefri
Sham, Mohd Nor
Jailani, Anis Syakira
author_sort Abu Bakar, Nurul Salwana
collection PubMed
description BACKGROUND: Count data from the national survey captures healthcare utilisation within a specific reference period, resulting in excess zeros and skewed positive tails. Often, it is modelled using count data models. This study aims to identify the best-fitting model for outpatient healthcare utilisation using data from the Malaysian National Health and Morbidity Survey 2019 (NHMS 2019) and utilisation factors among adults in Malaysia. METHODS: The frequency of outpatient visits is the dependent variable, and instrumental variable selection is based on Andersen’s model. Six different models were used: ordinary least squares (OLS), Poisson regression, negative binomial regression (NB), inflated models: zero-inflated Poisson, marginalized-zero-inflated negative binomial (MZINB), and hurdle model. Identification of the best-fitting model was based on model selection criteria, goodness-of-fit and statistical test of the factors associated with outpatient visits. RESULTS: The frequency of zero was 90%. Of the sample, 8.35% of adults utilized healthcare services only once, and 1.04% utilized them twice. The mean-variance value varied between 0.14 and 0.39. Across six models, the zero-inflated model (ZIM) possesses the smallest log-likelihood, Akaike information criterion, Bayesian information criterion, and a positive Vuong corrected value. Fourteen instrumental variables, five predisposing factors, six enablers, and three need factors were identified. Data overdispersion is characterized by excess zeros, a large mean to variance value, and skewed positive tails. We assumed frequency and true zeros throughout the study reference period. ZIM is the best-fitting model based on the model selection criteria, smallest Root Mean Square Error (RMSE) and higher R2. Both Vuong corrected and uncorrected values with different Stata commands yielded positive values with small differences. CONCLUSION: State as a place of residence, ethnicity, household income quintile, and health needs were significantly associated with healthcare utilisation. Our findings suggest using ZIM over traditional OLS. This study encourages the use of this count data model as it has a better fit, is easy to interpret, and has appropriate assumptions based on the survey methodology. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01733-3.
format Online
Article
Text
id pubmed-9533534
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-95335342022-10-06 Count data models for outpatient health services utilisation Abu Bakar, Nurul Salwana Ab Hamid, Jabrullah Mohd Nor Sham, Mohd Shaiful Jefri Sham, Mohd Nor Jailani, Anis Syakira BMC Med Res Methodol Research BACKGROUND: Count data from the national survey captures healthcare utilisation within a specific reference period, resulting in excess zeros and skewed positive tails. Often, it is modelled using count data models. This study aims to identify the best-fitting model for outpatient healthcare utilisation using data from the Malaysian National Health and Morbidity Survey 2019 (NHMS 2019) and utilisation factors among adults in Malaysia. METHODS: The frequency of outpatient visits is the dependent variable, and instrumental variable selection is based on Andersen’s model. Six different models were used: ordinary least squares (OLS), Poisson regression, negative binomial regression (NB), inflated models: zero-inflated Poisson, marginalized-zero-inflated negative binomial (MZINB), and hurdle model. Identification of the best-fitting model was based on model selection criteria, goodness-of-fit and statistical test of the factors associated with outpatient visits. RESULTS: The frequency of zero was 90%. Of the sample, 8.35% of adults utilized healthcare services only once, and 1.04% utilized them twice. The mean-variance value varied between 0.14 and 0.39. Across six models, the zero-inflated model (ZIM) possesses the smallest log-likelihood, Akaike information criterion, Bayesian information criterion, and a positive Vuong corrected value. Fourteen instrumental variables, five predisposing factors, six enablers, and three need factors were identified. Data overdispersion is characterized by excess zeros, a large mean to variance value, and skewed positive tails. We assumed frequency and true zeros throughout the study reference period. ZIM is the best-fitting model based on the model selection criteria, smallest Root Mean Square Error (RMSE) and higher R2. Both Vuong corrected and uncorrected values with different Stata commands yielded positive values with small differences. CONCLUSION: State as a place of residence, ethnicity, household income quintile, and health needs were significantly associated with healthcare utilisation. Our findings suggest using ZIM over traditional OLS. This study encourages the use of this count data model as it has a better fit, is easy to interpret, and has appropriate assumptions based on the survey methodology. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12874-022-01733-3. BioMed Central 2022-10-05 /pmc/articles/PMC9533534/ /pubmed/36199028 http://dx.doi.org/10.1186/s12874-022-01733-3 Text en © The Author(s) 2022 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research
Abu Bakar, Nurul Salwana
Ab Hamid, Jabrullah
Mohd Nor Sham, Mohd Shaiful Jefri
Sham, Mohd Nor
Jailani, Anis Syakira
Count data models for outpatient health services utilisation
title Count data models for outpatient health services utilisation
title_full Count data models for outpatient health services utilisation
title_fullStr Count data models for outpatient health services utilisation
title_full_unstemmed Count data models for outpatient health services utilisation
title_short Count data models for outpatient health services utilisation
title_sort count data models for outpatient health services utilisation
topic Research
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9533534/
https://www.ncbi.nlm.nih.gov/pubmed/36199028
http://dx.doi.org/10.1186/s12874-022-01733-3
work_keys_str_mv AT abubakarnurulsalwana countdatamodelsforoutpatienthealthservicesutilisation
AT abhamidjabrullah countdatamodelsforoutpatienthealthservicesutilisation
AT mohdnorshammohdshaifuljefri countdatamodelsforoutpatienthealthservicesutilisation
AT shammohdnor countdatamodelsforoutpatienthealthservicesutilisation
AT jailanianissyakira countdatamodelsforoutpatienthealthservicesutilisation