GLBench: A Comprehensive Benchmark for Graph with Large Language Models

Li, Yuhan; Wang, Peisong; Zhu, Xiao; Chen, Aochuan; Jiang, Haiyun; Cai, Deng; Chan, Victor Wai Kin; Li, Jia

Computer Science > Machine Learning

arXiv:2407.07457 (cs)

[Submitted on 10 Jul 2024 (v1), last revised 29 Oct 2024 (this version, v4)]

Title:GLBench: A Comprehensive Benchmark for Graph with Large Language Models

Authors:Yuhan Li, Peisong Wang, Xiao Zhu, Aochuan Chen, Haiyun Jiang, Deng Cai, Victor Wai Kin Chan, Jia Li

View PDF HTML (experimental)

Abstract:The emergence of large language models (LLMs) has revolutionized the way we interact with graphs, leading to a new paradigm called GraphLLM. Despite the rapid development of GraphLLM methods in recent years, the progress and understanding of this field remain unclear due to the lack of a benchmark with consistent experimental protocols. To bridge this gap, we introduce GLBench, the first comprehensive benchmark for evaluating GraphLLM methods in both supervised and zero-shot scenarios. GLBench provides a fair and thorough evaluation of different categories of GraphLLM methods, along with traditional baselines such as graph neural networks. Through extensive experiments on a collection of real-world datasets with consistent data processing and splitting strategies, we have uncovered several key findings. Firstly, GraphLLM methods outperform traditional baselines in supervised settings, with LLM-as-enhancers showing the most robust performance. However, using LLMs as predictors is less effective and often leads to uncontrollable output issues. We also notice that no clear scaling laws exist for current GraphLLM methods. In addition, both structures and semantics are crucial for effective zero-shot transfer, and our proposed simple baseline can even outperform several models tailored for zero-shot scenarios. The data and code of the benchmark can be found at this https URL.

Subjects:	Machine Learning (cs.LG); Computation and Language (cs.CL)
Cite as:	arXiv:2407.07457 [cs.LG]
	(or arXiv:2407.07457v4 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.07457

Submission history

From: Yuhan Li [view email]
[v1] Wed, 10 Jul 2024 08:20:47 UTC (150 KB)
[v2] Thu, 11 Jul 2024 06:06:33 UTC (150 KB)
[v3] Tue, 22 Oct 2024 10:54:15 UTC (162 KB)
[v4] Tue, 29 Oct 2024 08:49:11 UTC (162 KB)

Computer Science > Machine Learning

Title:GLBench: A Comprehensive Benchmark for Graph with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:GLBench: A Comprehensive Benchmark for Graph with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators