Cargando…

Comparative performance of multiple-list estimators of key population size

Estimates of the sizes of key populations (KPs) affected by HIV, including men who have sex with men, female sex workers and people who inject drugs, are required for targeting epidemic control efforts where they are most needed. Unfortunately, different estimators often produce discrepant results,...

Descripción completa

Detalles Bibliográficos
Autor principal: Gutreuter, Steve
Formato: Online Artículo Texto
Lenguaje:English
Publicado: Public Library of Science 2022
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9345571/
https://www.ncbi.nlm.nih.gov/pubmed/35928219
http://dx.doi.org/10.1371/journal.pgph.0000155
_version_ 1784761463532945408
author Gutreuter, Steve
author_facet Gutreuter, Steve
author_sort Gutreuter, Steve
collection PubMed
description Estimates of the sizes of key populations (KPs) affected by HIV, including men who have sex with men, female sex workers and people who inject drugs, are required for targeting epidemic control efforts where they are most needed. Unfortunately, different estimators often produce discrepant results, and an objective basis for choice is lacking. This simulation study provides the first comparison of information-theoretic selection of loglinear models (LLM-AIC), Bayesian model averaging of loglinear models (LLM-BMA) and Bayesian nonparametric latent-class modeling (BLCM) for estimation of population size from multiple lists. Four hundred random samples from populations of size 1,000, 10,000 and 20,000, each including five encounter opportunities, were independently simulated using each of 30 data-generating models obtained from combinations of six patterns of variation in encounter probabilities and five expected per-list encounter probabilities, producing a total of 36,000 samples. Population size was estimated for each combination of sample and sequentially cumulative sets of 2–5 lists using LLM-AIC, LLM-BMA and BLCM. LLM-BMA and BLCM were quite robust and performed comparably in terms of root mean-squared error and bias, and outperformed LLM-AIC. All estimation methods produced uncertainty intervals which failed to achieve the nominal coverage, but LLM-BMA, as implemented in the dga R package produced the best balance of accuracy and interval coverage. The results also indicate that two-list estimation is unnecessarily vulnerable, and it is better to estimate the sizes of KPs based on at least three lists.
format Online
Article
Text
id pubmed-9345571
institution National Center for Biotechnology Information
language English
publishDate 2022
publisher Public Library of Science
record_format MEDLINE/PubMed
spelling pubmed-93455712023-03-10 Comparative performance of multiple-list estimators of key population size Gutreuter, Steve PLOS Glob Public Health Research Article Estimates of the sizes of key populations (KPs) affected by HIV, including men who have sex with men, female sex workers and people who inject drugs, are required for targeting epidemic control efforts where they are most needed. Unfortunately, different estimators often produce discrepant results, and an objective basis for choice is lacking. This simulation study provides the first comparison of information-theoretic selection of loglinear models (LLM-AIC), Bayesian model averaging of loglinear models (LLM-BMA) and Bayesian nonparametric latent-class modeling (BLCM) for estimation of population size from multiple lists. Four hundred random samples from populations of size 1,000, 10,000 and 20,000, each including five encounter opportunities, were independently simulated using each of 30 data-generating models obtained from combinations of six patterns of variation in encounter probabilities and five expected per-list encounter probabilities, producing a total of 36,000 samples. Population size was estimated for each combination of sample and sequentially cumulative sets of 2–5 lists using LLM-AIC, LLM-BMA and BLCM. LLM-BMA and BLCM were quite robust and performed comparably in terms of root mean-squared error and bias, and outperformed LLM-AIC. All estimation methods produced uncertainty intervals which failed to achieve the nominal coverage, but LLM-BMA, as implemented in the dga R package produced the best balance of accuracy and interval coverage. The results also indicate that two-list estimation is unnecessarily vulnerable, and it is better to estimate the sizes of KPs based on at least three lists. Public Library of Science 2022-03-10 /pmc/articles/PMC9345571/ /pubmed/35928219 http://dx.doi.org/10.1371/journal.pgph.0000155 Text en https://creativecommons.org/publicdomain/zero/1.0/This is an open access article, free of all copyright, and may be freely reproduced, distributed, transmitted, modified, built upon, or otherwise used by anyone for any lawful purpose. The work is made available under the Creative Commons CC0 (https://creativecommons.org/publicdomain/zero/1.0/) public domain dedication.
spellingShingle Research Article
Gutreuter, Steve
Comparative performance of multiple-list estimators of key population size
title Comparative performance of multiple-list estimators of key population size
title_full Comparative performance of multiple-list estimators of key population size
title_fullStr Comparative performance of multiple-list estimators of key population size
title_full_unstemmed Comparative performance of multiple-list estimators of key population size
title_short Comparative performance of multiple-list estimators of key population size
title_sort comparative performance of multiple-list estimators of key population size
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC9345571/
https://www.ncbi.nlm.nih.gov/pubmed/35928219
http://dx.doi.org/10.1371/journal.pgph.0000155
work_keys_str_mv AT gutreutersteve comparativeperformanceofmultiplelistestimatorsofkeypopulationsize