WEPO: Web Element Preference Optimization for LLM-based Web Navigation

Liu, Jiarun; Hao, Jia; Zhang, Chunhong; Hu, Zheng

Computer Science > Computation and Language

arXiv:2412.10742 (cs)

[Submitted on 14 Dec 2024]

Title:WEPO: Web Element Preference Optimization for LLM-based Web Navigation

Authors:Jiarun Liu, Jia Hao, Chunhong Zhang, Zheng Hu

View PDF HTML (experimental)

Abstract:The rapid advancement of autonomous web navigation has significantly benefited from grounding pretrained Large Language Models (LLMs) as agents. However, current research has yet to fully leverage the redundancy of HTML elements for contrastive training. This paper introduces a novel approach to LLM-based web navigation tasks, called Web Element Preference Optimization (WEPO). WEPO utilizes unsupervised preference learning by sampling distance-based non-salient web elements as negative samples, optimizing maximum likelihood objective within Direct Preference Optimization (DPO). We evaluate WEPO on the Mind2Web benchmark and empirically demonstrate that WEPO aligns user high-level intent with output actions more effectively. The results show that our method achieved the state-of-the-art, with an improvement of 13.8% over WebAgent and 5.3% over the visual language model CogAgent baseline. Our findings underscore the potential of preference optimization to enhance web navigation and other web page based tasks, suggesting a promising direction for future research.

Comments:	Published at AAAI 2025
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2412.10742 [cs.CL]
	(or arXiv:2412.10742v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2412.10742

Submission history

From: Jiarun Liu [view email]
[v1] Sat, 14 Dec 2024 08:25:28 UTC (21,251 KB)

Computer Science > Computation and Language

Title:WEPO: Web Element Preference Optimization for LLM-based Web Navigation

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:WEPO: Web Element Preference Optimization for LLM-based Web Navigation

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators