NetMF+: Network Embedding Based on Fast and Effective Single-Pass Randomized Matrix Factorization

Xie, Yuyang; Qiu, Jiezhong; Yu, Wenjian; Feng, Xu; Chen, Yuxiang; Tang, Jie

Computer Science > Social and Information Networks

arXiv:2110.12782v1 (cs)

[Submitted on 25 Oct 2021 (this version), latest version 1 Feb 2024 (v3)]

Title:NetMF+: Network Embedding Based on Fast and Effective Single-Pass Randomized Matrix Factorization

Authors:Yuyang Xie, Jiezhong Qiu, Wenjian Yu, Xu Feng, Yuxiang Chen, Jie Tang

View PDF

Abstract:In this work, we propose NetMF+, a fast, memory-efficient, scalable, and effective network embedding algorithm developed for a single machine with CPU only. NetMF+ is based on the theoretically grounded embedding method NetMF and leverages the theories from randomized matrix factorization to learn embedding efficiently. We firstly propose a fast randomized eigen-decomposition algorithm for the modified Laplacian matrix. Then, sparse-sign randomized single-pass singular value decomposition (SVD) is utilized to avoid constructing dense matrix and generate promising embedding. To enhance the performance of embedding, we apply spectral propagation in NetMF+. Finally, A high-performance parallel graph processing stack GBBS is used to achieve memory-efficiency. Experiment results show that NetMF+ can learn a powerful embedding from a network with more than 10^11 edges within 1.5 hours at lower memory cost than state-of-the-art methods. The result on ClueWeb with 0.9 billion vertices and 75 billion edges shows that NetMF+ saves more than half of the memory and runtime than the state-of-the-art and has better performance. The source code of NetMF+ will be publicly available after the anonymous peer review.

Subjects:	Social and Information Networks (cs.SI); Numerical Analysis (math.NA)
Cite as:	arXiv:2110.12782 [cs.SI]
	(or arXiv:2110.12782v1 [cs.SI] for this version)
	https://doi.org/10.48550/arXiv.2110.12782

Submission history

From: Yuyang Xie [view email]
[v1] Mon, 25 Oct 2021 10:21:25 UTC (1,153 KB)
[v2] Tue, 26 Oct 2021 13:31:06 UTC (241 KB)
[v3] Thu, 1 Feb 2024 09:02:28 UTC (5,081 KB)

Computer Science > Social and Information Networks

Title:NetMF+: Network Embedding Based on Fast and Effective Single-Pass Randomized Matrix Factorization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Social and Information Networks

Title:NetMF+: Network Embedding Based on Fast and Effective Single-Pass Randomized Matrix Factorization

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators