Cargando…

The effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments

BACKGROUND: Traditional Lot Quality Assurance Sampling (LQAS) designs assume observations are collected using simple random sampling. Alternatively, randomly sampling clusters of observations and then individuals within clusters reduces costs but decreases the precision of the classifications. In th...

Descripción completa

Detalles Bibliográficos
Autores principales: Hedt-Gauthier, Bethany L, Mitsunaga, Tisha, Hund, Lauren, Olives, Casey, Pagano, Marcello
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2013
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3819670/
https://www.ncbi.nlm.nih.gov/pubmed/24160725
http://dx.doi.org/10.1186/1742-7622-10-11
_version_ 1782290018431139840
author Hedt-Gauthier, Bethany L
Mitsunaga, Tisha
Hund, Lauren
Olives, Casey
Pagano, Marcello
author_facet Hedt-Gauthier, Bethany L
Mitsunaga, Tisha
Hund, Lauren
Olives, Casey
Pagano, Marcello
author_sort Hedt-Gauthier, Bethany L
collection PubMed
description BACKGROUND: Traditional Lot Quality Assurance Sampling (LQAS) designs assume observations are collected using simple random sampling. Alternatively, randomly sampling clusters of observations and then individuals within clusters reduces costs but decreases the precision of the classifications. In this paper, we develop a general framework for designing the cluster(C)-LQAS system and illustrate the method with the design of data quality assessments for the community health worker program in Rwanda. RESULTS: To determine sample size and decision rules for C-LQAS, we use the beta-binomial distribution to account for inflated risk of errors introduced by sampling clusters at the first stage. We present general theory and code for sample size calculations. The C-LQAS sample sizes provided in this paper constrain misclassification risks below user-specified limits. Multiple C-LQAS systems meet the specified risk requirements, but numerous considerations, including per-cluster versus per-individual sampling costs, help identify optimal systems for distinct applications. CONCLUSIONS: We show the utility of C-LQAS for data quality assessments, but the method generalizes to numerous applications. This paper provides the necessary technical detail and supplemental code to support the design of C-LQAS for specific programs.
format Online
Article
Text
id pubmed-3819670
institution National Center for Biotechnology Information
language English
publishDate 2013
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-38196702013-11-11 The effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments Hedt-Gauthier, Bethany L Mitsunaga, Tisha Hund, Lauren Olives, Casey Pagano, Marcello Emerg Themes Epidemiol Methodology BACKGROUND: Traditional Lot Quality Assurance Sampling (LQAS) designs assume observations are collected using simple random sampling. Alternatively, randomly sampling clusters of observations and then individuals within clusters reduces costs but decreases the precision of the classifications. In this paper, we develop a general framework for designing the cluster(C)-LQAS system and illustrate the method with the design of data quality assessments for the community health worker program in Rwanda. RESULTS: To determine sample size and decision rules for C-LQAS, we use the beta-binomial distribution to account for inflated risk of errors introduced by sampling clusters at the first stage. We present general theory and code for sample size calculations. The C-LQAS sample sizes provided in this paper constrain misclassification risks below user-specified limits. Multiple C-LQAS systems meet the specified risk requirements, but numerous considerations, including per-cluster versus per-individual sampling costs, help identify optimal systems for distinct applications. CONCLUSIONS: We show the utility of C-LQAS for data quality assessments, but the method generalizes to numerous applications. This paper provides the necessary technical detail and supplemental code to support the design of C-LQAS for specific programs. BioMed Central 2013-10-26 /pmc/articles/PMC3819670/ /pubmed/24160725 http://dx.doi.org/10.1186/1742-7622-10-11 Text en Copyright © 2013 Hedt-Gauthier et al.; licensee BioMed Central Ltd. http://creativecommons.org/licenses/by/2.0 This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Methodology
Hedt-Gauthier, Bethany L
Mitsunaga, Tisha
Hund, Lauren
Olives, Casey
Pagano, Marcello
The effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments
title The effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments
title_full The effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments
title_fullStr The effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments
title_full_unstemmed The effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments
title_short The effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments
title_sort effect of clustering on lot quality assurance sampling: a probabilistic model to calculate sample sizes for quality assessments
topic Methodology
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC3819670/
https://www.ncbi.nlm.nih.gov/pubmed/24160725
http://dx.doi.org/10.1186/1742-7622-10-11
work_keys_str_mv AT hedtgauthierbethanyl theeffectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments
AT mitsunagatisha theeffectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments
AT hundlauren theeffectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments
AT olivescasey theeffectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments
AT paganomarcello theeffectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments
AT hedtgauthierbethanyl effectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments
AT mitsunagatisha effectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments
AT hundlauren effectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments
AT olivescasey effectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments
AT paganomarcello effectofclusteringonlotqualityassurancesamplingaprobabilisticmodeltocalculatesamplesizesforqualityassessments