Cargando…

A grid-based sample design framework for household surveys

Traditional sample designs for household surveys are contingent upon the availability of a representative primary sampling frame. This is defined using enumeration units and population counts retrieved from decennial national censuses that can become rapidly inaccurate in highly dynamic demographic...

Descripción completa

Detalles Bibliográficos
Autores principales: Boo, Gianluca, Darin, Edith, Thomson, Dana R., Tatem, Andrew J.
Formato: Online Artículo Texto
Lenguaje:English
Publicado: F1000 Research Limited 2020
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7076148/
https://www.ncbi.nlm.nih.gov/pubmed/32211596
http://dx.doi.org/10.12688/gatesopenres.13107.1
_version_ 1783507166607441920
author Boo, Gianluca
Darin, Edith
Thomson, Dana R.
Tatem, Andrew J.
author_facet Boo, Gianluca
Darin, Edith
Thomson, Dana R.
Tatem, Andrew J.
author_sort Boo, Gianluca
collection PubMed
description Traditional sample designs for household surveys are contingent upon the availability of a representative primary sampling frame. This is defined using enumeration units and population counts retrieved from decennial national censuses that can become rapidly inaccurate in highly dynamic demographic settings. To tackle the need for representative sampling frames, we propose an original grid-based sample design framework introducing essential concepts of spatial sampling in household surveys. In this framework, the sampling frame is defined based on gridded population estimates and formalized as a bi-dimensional random field, characterized by spatial trends, spatial autocorrelation, and stratification. The sampling design reflects the characteristics of the random field by combining contextual stratification and proportional to population size sampling. A nonparametric estimator is applied to evaluate the sampling design and inform sample size estimation. We demonstrate an application of the proposed framework through a case study developed in two provinces located in the western part of the Democratic Republic of the Congo. We define a sampling frame consisting of settled cells with associated population estimates. We then perform a contextual stratification by applying a principal component analysis (PCA) and k-means clustering to a set of gridded geospatial covariates, and sample settled cells proportionally to population size. Lastly, we evaluate the sampling design by contrasting the empirical cumulative distribution function for the entire population of interest and its weighted counterpart across different sample sizes and identify an adequate sample size using the Kolmogorov-Smirnov distance between the two functions. The results of the case study underscore the strengths and limitations of the proposed grid-based sample design framework and foster further research into the application of spatial sampling concepts in household surveys.
format Online
Article
Text
id pubmed-7076148
institution National Center for Biotechnology Information
language English
publishDate 2020
publisher F1000 Research Limited
record_format MEDLINE/PubMed
spelling pubmed-70761482020-03-23 A grid-based sample design framework for household surveys Boo, Gianluca Darin, Edith Thomson, Dana R. Tatem, Andrew J. Gates Open Res Method Article Traditional sample designs for household surveys are contingent upon the availability of a representative primary sampling frame. This is defined using enumeration units and population counts retrieved from decennial national censuses that can become rapidly inaccurate in highly dynamic demographic settings. To tackle the need for representative sampling frames, we propose an original grid-based sample design framework introducing essential concepts of spatial sampling in household surveys. In this framework, the sampling frame is defined based on gridded population estimates and formalized as a bi-dimensional random field, characterized by spatial trends, spatial autocorrelation, and stratification. The sampling design reflects the characteristics of the random field by combining contextual stratification and proportional to population size sampling. A nonparametric estimator is applied to evaluate the sampling design and inform sample size estimation. We demonstrate an application of the proposed framework through a case study developed in two provinces located in the western part of the Democratic Republic of the Congo. We define a sampling frame consisting of settled cells with associated population estimates. We then perform a contextual stratification by applying a principal component analysis (PCA) and k-means clustering to a set of gridded geospatial covariates, and sample settled cells proportionally to population size. Lastly, we evaluate the sampling design by contrasting the empirical cumulative distribution function for the entire population of interest and its weighted counterpart across different sample sizes and identify an adequate sample size using the Kolmogorov-Smirnov distance between the two functions. The results of the case study underscore the strengths and limitations of the proposed grid-based sample design framework and foster further research into the application of spatial sampling concepts in household surveys. F1000 Research Limited 2020-01-27 /pmc/articles/PMC7076148/ /pubmed/32211596 http://dx.doi.org/10.12688/gatesopenres.13107.1 Text en Copyright: © 2020 Boo G et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
spellingShingle Method Article
Boo, Gianluca
Darin, Edith
Thomson, Dana R.
Tatem, Andrew J.
A grid-based sample design framework for household surveys
title A grid-based sample design framework for household surveys
title_full A grid-based sample design framework for household surveys
title_fullStr A grid-based sample design framework for household surveys
title_full_unstemmed A grid-based sample design framework for household surveys
title_short A grid-based sample design framework for household surveys
title_sort grid-based sample design framework for household surveys
topic Method Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7076148/
https://www.ncbi.nlm.nih.gov/pubmed/32211596
http://dx.doi.org/10.12688/gatesopenres.13107.1
work_keys_str_mv AT boogianluca agridbasedsampledesignframeworkforhouseholdsurveys
AT darinedith agridbasedsampledesignframeworkforhouseholdsurveys
AT thomsondanar agridbasedsampledesignframeworkforhouseholdsurveys
AT tatemandrewj agridbasedsampledesignframeworkforhouseholdsurveys
AT boogianluca gridbasedsampledesignframeworkforhouseholdsurveys
AT darinedith gridbasedsampledesignframeworkforhouseholdsurveys
AT thomsondanar gridbasedsampledesignframeworkforhouseholdsurveys
AT tatemandrewj gridbasedsampledesignframeworkforhouseholdsurveys