Free Energy Node Embedding via Generalized Skip-gram with Negative Sampling

Zhu, Yu; Swami, Ananthram; Segarra, Santiago

Computer Science > Machine Learning

arXiv:2105.09182 (cs)

[Submitted on 19 May 2021 (v1), last revised 10 Sep 2022 (this version, v2)]

Title:Free Energy Node Embedding via Generalized Skip-gram with Negative Sampling

Authors:Yu Zhu, Ananthram Swami, Santiago Segarra

View PDF

Abstract:A widely established set of unsupervised node embedding methods can be interpreted as consisting of two distinctive steps: i) the definition of a similarity matrix based on the graph of interest followed by ii) an explicit or implicit factorization of such matrix. Inspired by this viewpoint, we propose improvements in both steps of the framework. On the one hand, we propose to encode node similarities based on the free energy distance, which interpolates between the shortest path and the commute time distances, thus, providing an additional degree of flexibility. On the other hand, we propose a matrix factorization method based on a loss function that generalizes that of the skip-gram model with negative sampling to arbitrary similarity matrices. Compared with factorizations based on the widely used $\ell_2$ loss, the proposed method can better preserve node pairs associated with higher similarity scores. Moreover, it can be easily implemented using advanced automatic differentiation toolkits and computed efficiently by leveraging GPU resources. Node clustering, node classification, and link prediction experiments on real-world datasets demonstrate the effectiveness of incorporating free-energy-based similarities as well as the proposed matrix factorization compared with state-of-the-art alternatives.

Subjects:	Machine Learning (cs.LG); Social and Information Networks (cs.SI)
Cite as:	arXiv:2105.09182 [cs.LG]
	(or arXiv:2105.09182v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2105.09182

Submission history

From: Yu Zhu [view email]
[v1] Wed, 19 May 2021 14:58:13 UTC (387 KB)
[v2] Sat, 10 Sep 2022 00:52:36 UTC (408 KB)

Computer Science > Machine Learning

Title:Free Energy Node Embedding via Generalized Skip-gram with Negative Sampling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Free Energy Node Embedding via Generalized Skip-gram with Negative Sampling

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators