Cargando…

Network Structure and Biased Variance Estimation in Respondent Driven Sampling

This paper explores bias in the estimation of sampling variance in Respondent Driven Sampling (RDS). Prior methodological work on RDS has focused on its problematic assumptions and the biases and inefficiencies of its estimators of the population mean. Nonetheless, researchers have given only slight...

Descripción completa

Detalles Bibliográficos
Autores principales:	Verdery, Ashton M., Mouw, Ted, Bauldry, Shawn, Mucha, Peter J.
Formato:	Online Artículo Texto
Lenguaje:	English
Publicado:	Public Library of Science 2015
Materias:	Research Article
Acceso en línea:	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682989/ https://www.ncbi.nlm.nih.gov/pubmed/26679927 http://dx.doi.org/10.1371/journal.pone.0145296

_version_	1782405956373577728
author	Verdery, Ashton M. Mouw, Ted Bauldry, Shawn Mucha, Peter J.
author_facet	Verdery, Ashton M. Mouw, Ted Bauldry, Shawn Mucha, Peter J.
author_sort	Verdery, Ashton M.
collection	PubMed
description	This paper explores bias in the estimation of sampling variance in Respondent Driven Sampling (RDS). Prior methodological work on RDS has focused on its problematic assumptions and the biases and inefficiencies of its estimators of the population mean. Nonetheless, researchers have given only slight attention to the topic of estimating sampling variance in RDS, despite the importance of variance estimation for the construction of confidence intervals and hypothesis tests. In this paper, we show that the estimators of RDS sampling variance rely on a critical assumption that the network is First Order Markov (FOM) with respect to the dependent variable of interest. We demonstrate, through intuitive examples, mathematical generalizations, and computational experiments that current RDS variance estimators will always underestimate the population sampling variance of RDS in empirical networks that do not conform to the FOM assumption. Analysis of 215 observed university and school networks from Facebook and Add Health indicates that the FOM assumption is violated in every empirical network we analyze, and that these violations lead to substantially biased RDS estimators of sampling variance. We propose and test two alternative variance estimators that show some promise for reducing biases, but which also illustrate the limits of estimating sampling variance with only partial information on the underlying population social network.
format	Online Article Text
id	pubmed-4682989
institution	National Center for Biotechnology Information
language	English
publishDate	2015
publisher	Public Library of Science
record_format	MEDLINE/PubMed
spelling	pubmed-46829892015-12-31 Network Structure and Biased Variance Estimation in Respondent Driven Sampling Verdery, Ashton M. Mouw, Ted Bauldry, Shawn Mucha, Peter J. PLoS One Research Article This paper explores bias in the estimation of sampling variance in Respondent Driven Sampling (RDS). Prior methodological work on RDS has focused on its problematic assumptions and the biases and inefficiencies of its estimators of the population mean. Nonetheless, researchers have given only slight attention to the topic of estimating sampling variance in RDS, despite the importance of variance estimation for the construction of confidence intervals and hypothesis tests. In this paper, we show that the estimators of RDS sampling variance rely on a critical assumption that the network is First Order Markov (FOM) with respect to the dependent variable of interest. We demonstrate, through intuitive examples, mathematical generalizations, and computational experiments that current RDS variance estimators will always underestimate the population sampling variance of RDS in empirical networks that do not conform to the FOM assumption. Analysis of 215 observed university and school networks from Facebook and Add Health indicates that the FOM assumption is violated in every empirical network we analyze, and that these violations lead to substantially biased RDS estimators of sampling variance. We propose and test two alternative variance estimators that show some promise for reducing biases, but which also illustrate the limits of estimating sampling variance with only partial information on the underlying population social network. Public Library of Science 2015-12-17 /pmc/articles/PMC4682989/ /pubmed/26679927 http://dx.doi.org/10.1371/journal.pone.0145296 Text en © 2015 Verdery et al http://creativecommons.org/licenses/by/4.0/ This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are properly credited.
spellingShingle	Research Article Verdery, Ashton M. Mouw, Ted Bauldry, Shawn Mucha, Peter J. Network Structure and Biased Variance Estimation in Respondent Driven Sampling
title	Network Structure and Biased Variance Estimation in Respondent Driven Sampling
title_full	Network Structure and Biased Variance Estimation in Respondent Driven Sampling
title_fullStr	Network Structure and Biased Variance Estimation in Respondent Driven Sampling
title_full_unstemmed	Network Structure and Biased Variance Estimation in Respondent Driven Sampling
title_short	Network Structure and Biased Variance Estimation in Respondent Driven Sampling
title_sort	network structure and biased variance estimation in respondent driven sampling
topic	Research Article
url	https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4682989/ https://www.ncbi.nlm.nih.gov/pubmed/26679927 http://dx.doi.org/10.1371/journal.pone.0145296
work_keys_str_mv	AT verderyashtonm networkstructureandbiasedvarianceestimationinrespondentdrivensampling AT mouwted networkstructureandbiasedvarianceestimationinrespondentdrivensampling AT bauldryshawn networkstructureandbiasedvarianceestimationinrespondentdrivensampling AT muchapeterj networkstructureandbiasedvarianceestimationinrespondentdrivensampling

Network Structure and Biased Variance Estimation in Respondent Driven Sampling

Ejemplares similares