Cargando…

Using interviewer random effects to remove selection bias from HIV prevalence estimates

BACKGROUND: Selection bias in HIV prevalence estimates occurs if non-participation in testing is correlated with HIV status. Longitudinal data suggests that individuals who know or suspect they are HIV positive are less likely to participate in testing in HIV surveys, in which case methods to correc...

Descripción completa

Detalles Bibliográficos
Autores principales: McGovern, Mark E, Bärnighausen, Till, Salomon, Joshua A, Canning, David
Formato: Online Artículo Texto
Lenguaje:English
Publicado: BioMed Central 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4429465/
https://www.ncbi.nlm.nih.gov/pubmed/25656226
http://dx.doi.org/10.1186/1471-2288-15-8
_version_ 1782371039669387264
author McGovern, Mark E
Bärnighausen, Till
Salomon, Joshua A
Canning, David
author_facet McGovern, Mark E
Bärnighausen, Till
Salomon, Joshua A
Canning, David
author_sort McGovern, Mark E
collection PubMed
description BACKGROUND: Selection bias in HIV prevalence estimates occurs if non-participation in testing is correlated with HIV status. Longitudinal data suggests that individuals who know or suspect they are HIV positive are less likely to participate in testing in HIV surveys, in which case methods to correct for missing data which are based on imputation and observed characteristics will produce biased results. METHODS: The identity of the HIV survey interviewer is typically associated with HIV testing participation, but is unlikely to be correlated with HIV status. Interviewer identity can thus be used as a selection variable allowing estimation of Heckman-type selection models. These models produce asymptotically unbiased HIV prevalence estimates, even when non-participation is correlated with unobserved characteristics, such as knowledge of HIV status. We introduce a new random effects method to these selection models which overcomes non-convergence caused by collinearity, small sample bias, and incorrect inference in existing approaches. Our method is easy to implement in standard statistical software, and allows the construction of bootstrapped standard errors which adjust for the fact that the relationship between testing and HIV status is uncertain and needs to be estimated. RESULTS: Using nationally representative data from the Demographic and Health Surveys, we illustrate our approach with new point estimates and confidence intervals (CI) for HIV prevalence among men in Ghana (2003) and Zambia (2007). In Ghana, we find little evidence of selection bias as our selection model gives an HIV prevalence estimate of 1.4% (95% CI 1.2% – 1.6%), compared to 1.6% among those with a valid HIV test. In Zambia, our selection model gives an HIV prevalence estimate of 16.3% (95% CI 11.0% - 18.4%), compared to 12.1% among those with a valid HIV test. Therefore, those who decline to test in Zambia are found to be more likely to be HIV positive. CONCLUSIONS: Our approach corrects for selection bias in HIV prevalence estimates, is possible to implement even when HIV prevalence or non-participation is very high or very low, and provides a practical solution to account for both sampling and parameter uncertainty in the estimation of confidence intervals. The wide confidence intervals estimated in an example with high HIV prevalence indicate that it is difficult to correct statistically for the bias that may occur when a large proportion of people refuse to test. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2288-15-8) contains supplementary material, which is available to authorized users.
format Online
Article
Text
id pubmed-4429465
institution National Center for Biotechnology Information
language English
publishDate 2015
publisher BioMed Central
record_format MEDLINE/PubMed
spelling pubmed-44294652015-05-14 Using interviewer random effects to remove selection bias from HIV prevalence estimates McGovern, Mark E Bärnighausen, Till Salomon, Joshua A Canning, David BMC Med Res Methodol Research Article BACKGROUND: Selection bias in HIV prevalence estimates occurs if non-participation in testing is correlated with HIV status. Longitudinal data suggests that individuals who know or suspect they are HIV positive are less likely to participate in testing in HIV surveys, in which case methods to correct for missing data which are based on imputation and observed characteristics will produce biased results. METHODS: The identity of the HIV survey interviewer is typically associated with HIV testing participation, but is unlikely to be correlated with HIV status. Interviewer identity can thus be used as a selection variable allowing estimation of Heckman-type selection models. These models produce asymptotically unbiased HIV prevalence estimates, even when non-participation is correlated with unobserved characteristics, such as knowledge of HIV status. We introduce a new random effects method to these selection models which overcomes non-convergence caused by collinearity, small sample bias, and incorrect inference in existing approaches. Our method is easy to implement in standard statistical software, and allows the construction of bootstrapped standard errors which adjust for the fact that the relationship between testing and HIV status is uncertain and needs to be estimated. RESULTS: Using nationally representative data from the Demographic and Health Surveys, we illustrate our approach with new point estimates and confidence intervals (CI) for HIV prevalence among men in Ghana (2003) and Zambia (2007). In Ghana, we find little evidence of selection bias as our selection model gives an HIV prevalence estimate of 1.4% (95% CI 1.2% – 1.6%), compared to 1.6% among those with a valid HIV test. In Zambia, our selection model gives an HIV prevalence estimate of 16.3% (95% CI 11.0% - 18.4%), compared to 12.1% among those with a valid HIV test. Therefore, those who decline to test in Zambia are found to be more likely to be HIV positive. CONCLUSIONS: Our approach corrects for selection bias in HIV prevalence estimates, is possible to implement even when HIV prevalence or non-participation is very high or very low, and provides a practical solution to account for both sampling and parameter uncertainty in the estimation of confidence intervals. The wide confidence intervals estimated in an example with high HIV prevalence indicate that it is difficult to correct statistically for the bias that may occur when a large proportion of people refuse to test. ELECTRONIC SUPPLEMENTARY MATERIAL: The online version of this article (doi:10.1186/1471-2288-15-8) contains supplementary material, which is available to authorized users. BioMed Central 2015-02-05 /pmc/articles/PMC4429465/ /pubmed/25656226 http://dx.doi.org/10.1186/1471-2288-15-8 Text en © McGovern et al.; licensee BioMed Central. 2015 This article is published under license to BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly credited. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.
spellingShingle Research Article
McGovern, Mark E
Bärnighausen, Till
Salomon, Joshua A
Canning, David
Using interviewer random effects to remove selection bias from HIV prevalence estimates
title Using interviewer random effects to remove selection bias from HIV prevalence estimates
title_full Using interviewer random effects to remove selection bias from HIV prevalence estimates
title_fullStr Using interviewer random effects to remove selection bias from HIV prevalence estimates
title_full_unstemmed Using interviewer random effects to remove selection bias from HIV prevalence estimates
title_short Using interviewer random effects to remove selection bias from HIV prevalence estimates
title_sort using interviewer random effects to remove selection bias from hiv prevalence estimates
topic Research Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4429465/
https://www.ncbi.nlm.nih.gov/pubmed/25656226
http://dx.doi.org/10.1186/1471-2288-15-8
work_keys_str_mv AT mcgovernmarke usinginterviewerrandomeffectstoremoveselectionbiasfromhivprevalenceestimates
AT barnighausentill usinginterviewerrandomeffectstoremoveselectionbiasfromhivprevalenceestimates
AT salomonjoshuaa usinginterviewerrandomeffectstoremoveselectionbiasfromhivprevalenceestimates
AT canningdavid usinginterviewerrandomeffectstoremoveselectionbiasfromhivprevalenceestimates