Cargando…
A grid-based sample design framework for household surveys
Traditional sample designs for household surveys are contingent upon the availability of a representative primary sampling frame. This is defined using enumeration units and population counts retrieved from decennial national censuses that can become rapidly inaccurate in highly dynamic demographic...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
F1000 Research Limited
2020
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7076148/ https://www.ncbi.nlm.nih.gov/pubmed/32211596 http://dx.doi.org/10.12688/gatesopenres.13107.1 |
_version_ | 1783507166607441920 |
---|---|
author | Boo, Gianluca Darin, Edith Thomson, Dana R. Tatem, Andrew J. |
author_facet | Boo, Gianluca Darin, Edith Thomson, Dana R. Tatem, Andrew J. |
author_sort | Boo, Gianluca |
collection | PubMed |
description | Traditional sample designs for household surveys are contingent upon the availability of a representative primary sampling frame. This is defined using enumeration units and population counts retrieved from decennial national censuses that can become rapidly inaccurate in highly dynamic demographic settings. To tackle the need for representative sampling frames, we propose an original grid-based sample design framework introducing essential concepts of spatial sampling in household surveys. In this framework, the sampling frame is defined based on gridded population estimates and formalized as a bi-dimensional random field, characterized by spatial trends, spatial autocorrelation, and stratification. The sampling design reflects the characteristics of the random field by combining contextual stratification and proportional to population size sampling. A nonparametric estimator is applied to evaluate the sampling design and inform sample size estimation. We demonstrate an application of the proposed framework through a case study developed in two provinces located in the western part of the Democratic Republic of the Congo. We define a sampling frame consisting of settled cells with associated population estimates. We then perform a contextual stratification by applying a principal component analysis (PCA) and k-means clustering to a set of gridded geospatial covariates, and sample settled cells proportionally to population size. Lastly, we evaluate the sampling design by contrasting the empirical cumulative distribution function for the entire population of interest and its weighted counterpart across different sample sizes and identify an adequate sample size using the Kolmogorov-Smirnov distance between the two functions. The results of the case study underscore the strengths and limitations of the proposed grid-based sample design framework and foster further research into the application of spatial sampling concepts in household surveys. |
format | Online Article Text |
id | pubmed-7076148 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2020 |
publisher | F1000 Research Limited |
record_format | MEDLINE/PubMed |
spelling | pubmed-70761482020-03-23 A grid-based sample design framework for household surveys Boo, Gianluca Darin, Edith Thomson, Dana R. Tatem, Andrew J. Gates Open Res Method Article Traditional sample designs for household surveys are contingent upon the availability of a representative primary sampling frame. This is defined using enumeration units and population counts retrieved from decennial national censuses that can become rapidly inaccurate in highly dynamic demographic settings. To tackle the need for representative sampling frames, we propose an original grid-based sample design framework introducing essential concepts of spatial sampling in household surveys. In this framework, the sampling frame is defined based on gridded population estimates and formalized as a bi-dimensional random field, characterized by spatial trends, spatial autocorrelation, and stratification. The sampling design reflects the characteristics of the random field by combining contextual stratification and proportional to population size sampling. A nonparametric estimator is applied to evaluate the sampling design and inform sample size estimation. We demonstrate an application of the proposed framework through a case study developed in two provinces located in the western part of the Democratic Republic of the Congo. We define a sampling frame consisting of settled cells with associated population estimates. We then perform a contextual stratification by applying a principal component analysis (PCA) and k-means clustering to a set of gridded geospatial covariates, and sample settled cells proportionally to population size. Lastly, we evaluate the sampling design by contrasting the empirical cumulative distribution function for the entire population of interest and its weighted counterpart across different sample sizes and identify an adequate sample size using the Kolmogorov-Smirnov distance between the two functions. The results of the case study underscore the strengths and limitations of the proposed grid-based sample design framework and foster further research into the application of spatial sampling concepts in household surveys. F1000 Research Limited 2020-01-27 /pmc/articles/PMC7076148/ /pubmed/32211596 http://dx.doi.org/10.12688/gatesopenres.13107.1 Text en Copyright: © 2020 Boo G et al. http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution Licence, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. |
spellingShingle | Method Article Boo, Gianluca Darin, Edith Thomson, Dana R. Tatem, Andrew J. A grid-based sample design framework for household surveys |
title | A grid-based sample design framework for household surveys |
title_full | A grid-based sample design framework for household surveys |
title_fullStr | A grid-based sample design framework for household surveys |
title_full_unstemmed | A grid-based sample design framework for household surveys |
title_short | A grid-based sample design framework for household surveys |
title_sort | grid-based sample design framework for household surveys |
topic | Method Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7076148/ https://www.ncbi.nlm.nih.gov/pubmed/32211596 http://dx.doi.org/10.12688/gatesopenres.13107.1 |
work_keys_str_mv | AT boogianluca agridbasedsampledesignframeworkforhouseholdsurveys AT darinedith agridbasedsampledesignframeworkforhouseholdsurveys AT thomsondanar agridbasedsampledesignframeworkforhouseholdsurveys AT tatemandrewj agridbasedsampledesignframeworkforhouseholdsurveys AT boogianluca gridbasedsampledesignframeworkforhouseholdsurveys AT darinedith gridbasedsampledesignframeworkforhouseholdsurveys AT thomsondanar gridbasedsampledesignframeworkforhouseholdsurveys AT tatemandrewj gridbasedsampledesignframeworkforhouseholdsurveys |