Cargando…
One-step estimation of networked population size: Respondent-driven capture-recapture with anonymity
Size estimation is particularly important for populations whose members experience disproportionate health issues or pose elevated health risks to the ambient social structures in which they are embedded. Efforts to derive size estimates are often frustrated when the population is hidden or hard-to-...
Autores principales: | , , , |
---|---|
Formato: | Online Artículo Texto |
Lenguaje: | English |
Publicado: |
Public Library of Science
2018
|
Materias: | |
Acceso en línea: | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5919671/ https://www.ncbi.nlm.nih.gov/pubmed/29698493 http://dx.doi.org/10.1371/journal.pone.0195959 |
_version_ | 1783317681038950400 |
---|---|
author | Khan, Bilal Lee, Hsuan-Wei Fellows, Ian Dombrowski, Kirk |
author_facet | Khan, Bilal Lee, Hsuan-Wei Fellows, Ian Dombrowski, Kirk |
author_sort | Khan, Bilal |
collection | PubMed |
description | Size estimation is particularly important for populations whose members experience disproportionate health issues or pose elevated health risks to the ambient social structures in which they are embedded. Efforts to derive size estimates are often frustrated when the population is hidden or hard-to-reach in ways that preclude conventional survey strategies, as is the case when social stigma is associated with group membership or when group members are involved in illegal activities. This paper extends prior research on the problem of network population size estimation, building on established survey/sampling methodologies commonly used with hard-to-reach groups. Three novel one-step, network-based population size estimators are presented, for use in the context of uniform random sampling, respondent-driven sampling, and when networks exhibit significant clustering effects. We give provably sufficient conditions for the consistency of these estimators in large configuration networks. Simulation experiments across a wide range of synthetic network topologies validate the performance of the estimators, which also perform well on a real-world location-based social networking data set with significant clustering. Finally, the proposed schemes are extended to allow them to be used in settings where participant anonymity is required. Systematic experiments show favorable tradeoffs between anonymity guarantees and estimator performance. Taken together, we demonstrate that reasonable population size estimates are derived from anonymous respondent driven samples of 250-750 individuals, within ambient populations of 5,000-40,000. The method thus represents a novel and cost-effective means for health planners and those agencies concerned with health and disease surveillance to estimate the size of hidden populations. We discuss limitations and future work in the concluding section. |
format | Online Article Text |
id | pubmed-5919671 |
institution | National Center for Biotechnology Information |
language | English |
publishDate | 2018 |
publisher | Public Library of Science |
record_format | MEDLINE/PubMed |
spelling | pubmed-59196712018-05-11 One-step estimation of networked population size: Respondent-driven capture-recapture with anonymity Khan, Bilal Lee, Hsuan-Wei Fellows, Ian Dombrowski, Kirk PLoS One Research Article Size estimation is particularly important for populations whose members experience disproportionate health issues or pose elevated health risks to the ambient social structures in which they are embedded. Efforts to derive size estimates are often frustrated when the population is hidden or hard-to-reach in ways that preclude conventional survey strategies, as is the case when social stigma is associated with group membership or when group members are involved in illegal activities. This paper extends prior research on the problem of network population size estimation, building on established survey/sampling methodologies commonly used with hard-to-reach groups. Three novel one-step, network-based population size estimators are presented, for use in the context of uniform random sampling, respondent-driven sampling, and when networks exhibit significant clustering effects. We give provably sufficient conditions for the consistency of these estimators in large configuration networks. Simulation experiments across a wide range of synthetic network topologies validate the performance of the estimators, which also perform well on a real-world location-based social networking data set with significant clustering. Finally, the proposed schemes are extended to allow them to be used in settings where participant anonymity is required. Systematic experiments show favorable tradeoffs between anonymity guarantees and estimator performance. Taken together, we demonstrate that reasonable population size estimates are derived from anonymous respondent driven samples of 250-750 individuals, within ambient populations of 5,000-40,000. The method thus represents a novel and cost-effective means for health planners and those agencies concerned with health and disease surveillance to estimate the size of hidden populations. We discuss limitations and future work in the concluding section. Public Library of Science 2018-04-26 /pmc/articles/PMC5919671/ /pubmed/29698493 http://dx.doi.org/10.1371/journal.pone.0195959 Text en © 2018 Khan et al http://creativecommons.org/licenses/by/4.0/ This is an open access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0/) , which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. |
spellingShingle | Research Article Khan, Bilal Lee, Hsuan-Wei Fellows, Ian Dombrowski, Kirk One-step estimation of networked population size: Respondent-driven capture-recapture with anonymity |
title | One-step estimation of networked population size: Respondent-driven capture-recapture with anonymity |
title_full | One-step estimation of networked population size: Respondent-driven capture-recapture with anonymity |
title_fullStr | One-step estimation of networked population size: Respondent-driven capture-recapture with anonymity |
title_full_unstemmed | One-step estimation of networked population size: Respondent-driven capture-recapture with anonymity |
title_short | One-step estimation of networked population size: Respondent-driven capture-recapture with anonymity |
title_sort | one-step estimation of networked population size: respondent-driven capture-recapture with anonymity |
topic | Research Article |
url | https://www.ncbi.nlm.nih.gov/pmc/articles/PMC5919671/ https://www.ncbi.nlm.nih.gov/pubmed/29698493 http://dx.doi.org/10.1371/journal.pone.0195959 |
work_keys_str_mv | AT khanbilal onestepestimationofnetworkedpopulationsizerespondentdrivencapturerecapturewithanonymity AT leehsuanwei onestepestimationofnetworkedpopulationsizerespondentdrivencapturerecapturewithanonymity AT fellowsian onestepestimationofnetworkedpopulationsizerespondentdrivencapturerecapturewithanonymity AT dombrowskikirk onestepestimationofnetworkedpopulationsizerespondentdrivencapturerecapturewithanonymity |