Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications

Xie, Han; Zheng, Da; Ma, Jun; Zhang, Houyu; Ioannidis, Vassilis N.; Song, Xiang; Ping, Qing; Wang, Sheng; Yang, Carl; Xu, Yi; Zeng, Belinda; Chilimbi, Trishul

Computer Science > Computation and Language

arXiv:2306.02592 (cs)

[Submitted on 5 Jun 2023]

Title:Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications

Authors:Han Xie, Da Zheng, Jun Ma, Houyu Zhang, Vassilis N. Ioannidis, Xiang Song, Qing Ping, Sheng Wang, Carl Yang, Yi Xu, Belinda Zeng, Trishul Chilimbi

View PDF

Abstract:Model pre-training on large text corpora has been demonstrated effective for various downstream applications in the NLP domain. In the graph mining domain, a similar analogy can be drawn for pre-training graph models on large graphs in the hope of benefiting downstream graph applications, which has also been explored by several recent studies. However, no existing study has ever investigated the pre-training of text plus graph models on large heterogeneous graphs with abundant textual information (a.k.a. large graph corpora) and then fine-tuning the model on different related downstream applications with different graph schemas. To address this problem, we propose a framework of graph-aware language model pre-training (GALM) on a large graph corpus, which incorporates large language models and graph neural networks, and a variety of fine-tuning methods on downstream applications. We conduct extensive experiments on Amazon's real internal datasets and large public datasets. Comprehensive empirical results and in-depth analysis demonstrate the effectiveness of our proposed methods along with lessons learned.

Comments:	To be published in the KDD 2023 proceedings as a full paper
Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2306.02592 [cs.CL]
	(or arXiv:2306.02592v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2306.02592

Submission history

From: Han Xie [view email]
[v1] Mon, 5 Jun 2023 04:46:44 UTC (3,410 KB)

Computer Science > Computation and Language

Title:Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Graph-Aware Language Model Pre-Training on a Large Graph Corpus Can Help Multiple Graph Applications

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators