Start Over

Revisiting random walk based sampling in networks: Evasion of burn-in period and frequent regenerations

Authors :: Vivek S. Borkar
Arun Kadavankandy
Jithin K. Sreedharan
Konstantin Avrachenkov
Network Engineering and Operations (NEO )
Inria Sophia Antipolis - Méditerranée (CRISAM)
Institut National de Recherche en Informatique et en Automatique (Inria)-Institut National de Recherche en Informatique et en Automatique (Inria)
Department of Electrical Engineering [IIT-Bombay] (EE-IIT)
Indian Institute of Technology Kanpur (IIT Kanpur)
Department of Computer Science [Purdue]
Purdue University [West Lafayette]
Source :: Computational Social Networks, Computational Social Networks, Springer, 2018, 5 (4), ⟨10.1186/s40649-018-0051-0⟩, Computational Social Networks, Vol 5, Iss 1, Pp 1-19 (2018), Computational Social Networks, 2018, 5 (4), ⟨10.1186/s40649-018-0051-0⟩
Publication Year :: 2018
Publisher :: HAL CCSD, 2018.
Abstract: International audience; Background: In the framework of network sampling, random walk (RW) based estimation techniques provide many pragmatic solutions while uncovering the unknown network as little as possible. Despite several theoretical advances in this area, RW based sampling techniques usually make a strong assumption that the samples are in stationary regime, and hence are impelled to leave out the samples collected during the burn-in period. Methods: This work proposes two sampling schemes without burn-in time constraint to estimate the average of an arbitrary function defined on the network nodes, for example, the average age of users in a social network. The central idea of the algorithms lies in exploiting regeneration of RWs at revisits to an aggregated super-node or to a set of nodes, and in strategies to enhance the frequency of such regenerations either by contracting the graph or by making the hitting set larger. Our first algorithm, which is based on reinforcement learning (RL), uses stochastic approximation to derive an estimator. This method can be seen as intermediate between purely stochastic Markov chain Monte Carlo iterations and deterministic relative value iterations. The second algorithm, which we call the Ratio with Tours (RT)-estimator, is a modified form of respondent-driven sampling (RDS) that accommodates the idea of regeneration. Results: We study the methods via simulations on real networks. We observe that the trajectories of RL-estimator are much more stable than those of standard random walk based estimation procedures, and its error performance is comparable to that of respondent-driven sampling (RDS) which has a smaller asymptotic variance than many other estimators. Simulation studies also show that the mean squared error of RT-esti-mator decays much faster than that of RDS with time. Conclusion: The newly developed RW based estimators (RL-and RT-estimators) allow to avoid burn-in period, provide better control of stability along the sample path, and overall reduce the estimation time. Our estimators can be applied in social and complex networks.

Subjects :: Mean squared error
Respondent-driven sampling
Computer science
Stability (learning theory)
02 engineering and technology
Stochastic approximation
01 natural sciences
lcsh:QA75.5-76.95
[INFO.INFO-SI]Computer Science [cs]/Social and Information Networks [cs.SI]
010104 statistics & probability
symbols.namesake
[INFO.INFO-LG]Computer Science [cs]/Machine Learning [cs.LG]
[MATH.MATH-ST]Mathematics [math]/Statistics [math.ST]
Reinforcement learning
0202 electrical engineering, electronic engineering, information engineering
0101 mathematics
Network sampling
Random walks on graph
lcsh:T58.5-58.64
lcsh:Information technology
Research
Sampling (statistics)
Estimator
020206 networking & telecommunications
Markov chain Monte Carlo
Complex network
Random walk
Computer Science Applications
Human-Computer Interaction
Modeling and Simulation
symbols
lcsh:Electronic computers. Computer science
Algorithm
Information Systems

Details

Language :: English
ISSN :: 21974314
Database :: OpenAIRE
Journal :: Computational Social Networks, Computational Social Networks, Springer, 2018, 5 (4), ⟨10.1186/s40649-018-0051-0⟩, Computational Social Networks, Vol 5, Iss 1, Pp 1-19 (2018), Computational Social Networks, 2018, 5 (4), ⟨10.1186/s40649-018-0051-0⟩
Accession number :: edsair.doi.dedup.....60f4c99a5ca6344bb04c7758893fadb6
Full Text :: https://doi.org/10.1186/s40649-018-0051-0⟩

Full Text Access

View/download PDF

Tools

Email
Cite

Printer

Authors Abstract Subjects Details

Searchworks

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources

Revisiting random walk based sampling in networks: Evasion of burn-in period and frequent regenerations

Abstract

Subjects

Details

Tools

Searchworks

Select search scope, currently: Articles Catalog books, media & more in Jio Institute collections Articles journal articles & other e-resources

Revisiting random walk based sampling in networks: Evasion of burn-in period and frequent regenerations

Abstract

Subjects

Details

Tools

Select search scope, currently: Articles

Catalog

books, media & more in Jio Institute collections

Articles

journal articles & other e-resources