Cargando…

Introducing an efficient sampling method for national surveys with limited sample sizes: application to a national study to determine quality and cost of healthcare

BACKGROUND: Sampling a small number of participants from an entire country is not straightforward. In this case, researchers reluctantly sample from a single setting or few settings, which limits the generalizability of findings. Therefore, there is a need to design efficient sampling method for sma...

Descripción completa

Detalles Bibliográficos
Autores principales: Parsaeian, Mahboubeh, Mahdavi, Mahdi, Saadati, Mojdeh, Mehdipour, Parinaz, Sheidaei, Ali, Khatibzadeh, Shahab, Farzadfar, Farshad, Shahraz, Saeid
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2021
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8285867/
https://www.ncbi.nlm.nih.gov/pubmed/34273940
http://dx.doi.org/10.1186/s12889-021-11441-0
_version_ 1783723634679873536
author Parsaeian, Mahboubeh
Mahdavi, Mahdi
Saadati, Mojdeh
Mehdipour, Parinaz
Sheidaei, Ali
Khatibzadeh, Shahab
Farzadfar, Farshad
Shahraz, Saeid
author_facet Parsaeian, Mahboubeh
Mahdavi, Mahdi
Saadati, Mojdeh
Mehdipour, Parinaz
Sheidaei, Ali
Khatibzadeh, Shahab
Farzadfar, Farshad
Shahraz, Saeid
author_sort Parsaeian, Mahboubeh
collection PubMed
description BACKGROUND: Sampling a small number of participants from an entire country is not straightforward. In this case, researchers reluctantly sample from a single setting or few settings, which limits the generalizability of findings. Therefore, there is a need to design efficient sampling method for small sample size surveys that can produce generalizable results at the country level. METHODS: Data comprised of twenty proxy variables to measure health services demands, structures, and outcomes of 413 districts of Iran. We used two data mining methods (hierarchical clustering method (HCM) and model-based clustering method (MCM)) to create homogenous groups of districts, i.e., strata based on these variables. We compared the internal and stability validity of the methods by statistical indices. An expert group checked the face validity of the methods, particularly regarding the total number of strata and the combination of districts in each stratum. The efficiency of selected method, which is measured by the inverse of variance, was compared with a simple random sampling (SRS) through simulation. The sampling design was tested in a national study in Iran, which aimed to evaluate the quality and costs of medical care for eight selected diseases by only recruiting 300 participants per disease at the country level. RESULTS: MCM and HCM divided the districts into eight and two clusters, respectively. The measures of internal and stability validity showed that clusters created by MCM were more separated, compact, and stable, thus forming our optimum strata. The probability of death from stroke, chronic obstructive pulmonary disease, and in-hospital mortality rate were the most important indicators that distinguished the eight strata. Based on the simulation results, MCM increased the efficiency of the sampling design up to 1.7 times compared to SRS. CONCLUSIONS: The use of data mining improved the efficiency of sampling up to 1.7 times greater than SRS and markedly reduced the number of strata to eight in the entire country. The proposed sampling design also identified key variables that could be used to classify districts in Iran for sampling from these target populations in the future studies. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12889-021-11441-0.
format Online
Article
Text
id pubmed-8285867
institution National Center for Biotechnology Information
language English
publishDate 2021
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-82858672021-07-19 Introducing an efficient sampling method for national surveys with limited sample sizes: application to a national study to determine quality and cost of healthcare Parsaeian, Mahboubeh Mahdavi, Mahdi Saadati, Mojdeh Mehdipour, Parinaz Sheidaei, Ali Khatibzadeh, Shahab Farzadfar, Farshad Shahraz, Saeid BMC Public Health Research Article BACKGROUND: Sampling a small number of participants from an entire country is not straightforward. In this case, researchers reluctantly sample from a single setting or few settings, which limits the generalizability of findings. Therefore, there is a need to design efficient sampling method for small sample size surveys that can produce generalizable results at the country level. METHODS: Data comprised of twenty proxy variables to measure health services demands, structures, and outcomes of 413 districts of Iran. We used two data mining methods (hierarchical clustering method (HCM) and model-based clustering method (MCM)) to create homogenous groups of districts, i.e., strata based on these variables. We compared the internal and stability validity of the methods by statistical indices. An expert group checked the face validity of the methods, particularly regarding the total number of strata and the combination of districts in each stratum. The efficiency of selected method, which is measured by the inverse of variance, was compared with a simple random sampling (SRS) through simulation. The sampling design was tested in a national study in Iran, which aimed to evaluate the quality and costs of medical care for eight selected diseases by only recruiting 300 participants per disease at the country level. RESULTS: MCM and HCM divided the districts into eight and two clusters, respectively. The measures of internal and stability validity showed that clusters created by MCM were more separated, compact, and stable, thus forming our optimum strata. The probability of death from stroke, chronic obstructive pulmonary disease, and in-hospital mortality rate were the most important indicators that distinguished the eight strata. Based on the simulation results, MCM increased the efficiency of the sampling design up to 1.7 times compared to SRS. CONCLUSIONS: The use of data mining improved the efficiency of sampling up to 1.7 times greater than SRS and markedly reduced the number of strata to eight in the entire country. The proposed sampling design also identified key variables that could be used to classify districts in Iran for sampling from these target populations in the future studies. SUPPLEMENTARY INFORMATION: The online version contains supplementary material available at 10.1186/s12889-021-11441-0. BioMed Central 2021-07-17 /pmc/articles/PMC8285867/ /pubmed/34273940 http://dx.doi.org/10.1186/s12889-021-11441-0 Text en © The Author(s) 2021 https://creativecommons.org/licenses/by/4.0/Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/ (https://creativecommons.org/licenses/by/4.0/) . The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/ (https://creativecommons.org/publicdomain/zero/1.0/) ) applies to the data made available in this article, unless otherwise stated in a credit line to the data.
spellingShingle Research Article
Parsaeian, Mahboubeh
Mahdavi, Mahdi
Saadati, Mojdeh
Mehdipour, Parinaz
Sheidaei, Ali
Khatibzadeh, Shahab
Farzadfar, Farshad
Shahraz, Saeid
Introducing an efficient sampling method for national surveys with limited sample sizes: application to a national study to determine quality and cost of healthcare
title Introducing an efficient sampling method for national surveys with limited sample sizes: application to a national study to determine quality and cost of healthcare
title_full Introducing an efficient sampling method for national surveys with limited sample sizes: application to a national study to determine quality and cost of healthcare
title_fullStr Introducing an efficient sampling method for national surveys with limited sample sizes: application to a national study to determine quality and cost of healthcare
title_full_unstemmed Introducing an efficient sampling method for national surveys with limited sample sizes: application to a national study to determine quality and cost of healthcare
title_short Introducing an efficient sampling method for national surveys with limited sample sizes: application to a national study to determine quality and cost of healthcare
title_sort introducing an efficient sampling method for national surveys with limited sample sizes: application to a national study to determine quality and cost of healthcare
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC8285867/
https://www.ncbi.nlm.nih.gov/pubmed/34273940
http://dx.doi.org/10.1186/s12889-021-11441-0
work_keys_str_mv AT parsaeianmahboubeh introducinganefficientsamplingmethodfornationalsurveyswithlimitedsamplesizesapplicationtoanationalstudytodeterminequalityandcostofhealthcare
AT mahdavimahdi introducinganefficientsamplingmethodfornationalsurveyswithlimitedsamplesizesapplicationtoanationalstudytodeterminequalityandcostofhealthcare
AT saadatimojdeh introducinganefficientsamplingmethodfornationalsurveyswithlimitedsamplesizesapplicationtoanationalstudytodeterminequalityandcostofhealthcare
AT mehdipourparinaz introducinganefficientsamplingmethodfornationalsurveyswithlimitedsamplesizesapplicationtoanationalstudytodeterminequalityandcostofhealthcare
AT sheidaeiali introducinganefficientsamplingmethodfornationalsurveyswithlimitedsamplesizesapplicationtoanationalstudytodeterminequalityandcostofhealthcare
AT khatibzadehshahab introducinganefficientsamplingmethodfornationalsurveyswithlimitedsamplesizesapplicationtoanationalstudytodeterminequalityandcostofhealthcare
AT farzadfarfarshad introducinganefficientsamplingmethodfornationalsurveyswithlimitedsamplesizesapplicationtoanationalstudytodeterminequalityandcostofhealthcare
AT shahrazsaeid introducinganefficientsamplingmethodfornationalsurveyswithlimitedsamplesizesapplicationtoanationalstudytodeterminequalityandcostofhealthcare