Cargando…

Estimating Vertex Measures in Social Networks by Sampling Completions of RDS Trees

This paper presents a new method for obtaining network properties from incomplete data sets. Problems associated with missing data represent well-known stumbling blocks in Social Network Analysis. The method of “estimating connectivity from spanning tree completions” (ECSTC) is specifically designed...

Descripción completa

Detalles Bibliográficos
Autores principales: Khan, Bilal, Dombrowski, Kirk, Curtis, Ric, Wendel, Travis
Formato: Online Artículo Texto
Lenguaje:English
Publicado: 2015
Materias:
Acceso en línea:https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4380167/
https://www.ncbi.nlm.nih.gov/pubmed/25838988
http://dx.doi.org/10.4236/sn.2015.41001
_version_ 1782364298362748928
author Khan, Bilal
Dombrowski, Kirk
Curtis, Ric
Wendel, Travis
author_facet Khan, Bilal
Dombrowski, Kirk
Curtis, Ric
Wendel, Travis
author_sort Khan, Bilal
collection PubMed
description This paper presents a new method for obtaining network properties from incomplete data sets. Problems associated with missing data represent well-known stumbling blocks in Social Network Analysis. The method of “estimating connectivity from spanning tree completions” (ECSTC) is specifically designed to address situations where only spanning tree(s) of a network are known, such as those obtained through respondent driven sampling (RDS). Using repeated random completions derived from degree information, this method forgoes the usual step of trying to obtain final edge or vertex rosters, and instead aims to estimate network-centric properties of vertices probabilistically from the spanning trees themselves. In this paper, we discuss the problem of missing data and describe the protocols of our completion method, and finally the results of an experiment where ECSTC was used to estimate graph dependent vertex properties from spanning trees sampled from a graph whose characteristics were known ahead of time. The results show that ECSTC methods hold more promise for obtaining network-centric properties of individuals from a limited set of data than researchers may have previously assumed. Such an approach represents a break with past strategies of working with missing data which have mainly sought means to complete the graph, rather than ECSTC's approach, which is to estimate network properties themselves without deciding on the final edge set.
format Online
Article
Text
id pubmed-4380167
institution National Center for Biotechnology Information
language English
publishDate 2015
record_format MEDLINE/PubMed
spelling pubmed-43801672015-03-31 Estimating Vertex Measures in Social Networks by Sampling Completions of RDS Trees Khan, Bilal Dombrowski, Kirk Curtis, Ric Wendel, Travis Soc Netw Article This paper presents a new method for obtaining network properties from incomplete data sets. Problems associated with missing data represent well-known stumbling blocks in Social Network Analysis. The method of “estimating connectivity from spanning tree completions” (ECSTC) is specifically designed to address situations where only spanning tree(s) of a network are known, such as those obtained through respondent driven sampling (RDS). Using repeated random completions derived from degree information, this method forgoes the usual step of trying to obtain final edge or vertex rosters, and instead aims to estimate network-centric properties of vertices probabilistically from the spanning trees themselves. In this paper, we discuss the problem of missing data and describe the protocols of our completion method, and finally the results of an experiment where ECSTC was used to estimate graph dependent vertex properties from spanning trees sampled from a graph whose characteristics were known ahead of time. The results show that ECSTC methods hold more promise for obtaining network-centric properties of individuals from a limited set of data than researchers may have previously assumed. Such an approach represents a break with past strategies of working with missing data which have mainly sought means to complete the graph, rather than ECSTC's approach, which is to estimate network properties themselves without deciding on the final edge set. 2015-01-13 2015-01-01 /pmc/articles/PMC4380167/ /pubmed/25838988 http://dx.doi.org/10.4236/sn.2015.41001 Text en Copyright © 2015 by authors and Scientific Research Publishing Inc. This work is licensed under the Creative Commons Attribution International License (CC BY). http://creativecommons.org/licenses/by/4.0/
spellingShingle Article
Khan, Bilal
Dombrowski, Kirk
Curtis, Ric
Wendel, Travis
Estimating Vertex Measures in Social Networks by Sampling Completions of RDS Trees
title Estimating Vertex Measures in Social Networks by Sampling Completions of RDS Trees
title_full Estimating Vertex Measures in Social Networks by Sampling Completions of RDS Trees
title_fullStr Estimating Vertex Measures in Social Networks by Sampling Completions of RDS Trees
title_full_unstemmed Estimating Vertex Measures in Social Networks by Sampling Completions of RDS Trees
title_short Estimating Vertex Measures in Social Networks by Sampling Completions of RDS Trees
title_sort estimating vertex measures in social networks by sampling completions of rds trees
topic Article
url https://www.ncbi.nlm.nih.gov/pmc/articles/PMC4380167/
https://www.ncbi.nlm.nih.gov/pubmed/25838988
http://dx.doi.org/10.4236/sn.2015.41001
work_keys_str_mv AT khanbilal estimatingvertexmeasuresinsocialnetworksbysamplingcompletionsofrdstrees
AT dombrowskikirk estimatingvertexmeasuresinsocialnetworksbysamplingcompletionsofrdstrees
AT curtisric estimatingvertexmeasuresinsocialnetworksbysamplingcompletionsofrdstrees
AT wendeltravis estimatingvertexmeasuresinsocialnetworksbysamplingcompletionsofrdstrees