FrogWild! -- Fast PageRank Approximations on Graph Engines

Mitliagkas, Ioannis; Borokhovich, Michael; Dimakis, Alexandros G.; Caramanis, Constantine

Computer Science > Distributed, Parallel, and Cluster Computing

arXiv:1502.04281 (cs)

[Submitted on 15 Feb 2015]

Title:FrogWild! -- Fast PageRank Approximations on Graph Engines

Authors:Ioannis Mitliagkas, Michael Borokhovich, Alexandros G. Dimakis, Constantine Caramanis

View PDF

Abstract:We propose FrogWild, a novel algorithm for fast approximation of high PageRank vertices, geared towards reducing network costs of running traditional PageRank algorithms. Our algorithm can be seen as a quantized version of power iteration that performs multiple parallel random walks over a directed graph. One important innovation is that we introduce a modification to the GraphLab framework that only partially synchronizes mirror vertices. This partial synchronization vastly reduces the network traffic generated by traditional PageRank algorithms, thus greatly reducing the per-iteration cost of PageRank. On the other hand, this partial synchronization also creates dependencies between the random walks used to estimate PageRank. Our main theoretical innovation is the analysis of the correlations introduced by this partial synchronization process and a bound establishing that our approximation is close to the true PageRank vector.
We implement our algorithm in GraphLab and compare it against the default PageRank implementation. We show that our algorithm is very fast, performing each iteration in less than one second on the Twitter graph and can be up to 7x faster compared to the standard GraphLab PageRank implementation.

Subjects:	Distributed, Parallel, and Cluster Computing (cs.DC); Information Theory (cs.IT); Social and Information Networks (cs.SI)
Cite as:	arXiv:1502.04281 [cs.DC]
	(or arXiv:1502.04281v1 [cs.DC] for this version)
	https://doi.org/10.48550/arXiv.1502.04281

Submission history

From: Michael Borokhovich [view email]
[v1] Sun, 15 Feb 2015 04:33:00 UTC (851 KB)

Computer Science > Distributed, Parallel, and Cluster Computing

Title:FrogWild! -- Fast PageRank Approximations on Graph Engines

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Distributed, Parallel, and Cluster Computing

Title:FrogWild! -- Fast PageRank Approximations on Graph Engines

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators