Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model Architecture

Xiao, Yubin; Wang, Di; Wu, Xuan; Wu, Yuesong; Li, Boyang; Du, Wei; Wang, Liupu; Zhou, You

Computer Science > Machine Learning

arXiv:2406.06652 (cs)

[Submitted on 10 Jun 2024 (v1), last revised 17 Jun 2024 (this version, v2)]

Title:Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model Architecture

Authors:Yubin Xiao, Di Wang, Xuan Wu, Yuesong Wu, Boyang Li, Wei Du, Liupu Wang, You Zhou

View PDF HTML (experimental)

Abstract:Neural models produce promising results when solving Vehicle Routing Problems (VRPs), but often fall short in generalization. Recent attempts to enhance model generalization often incur unnecessarily large training cost or cannot be directly applied to other models solving different VRP variants. To address these issues, we take a novel perspective on model architecture in this study. Specifically, we propose a plug-and-play Entropy-based Scaling Factor (ESF) and a Distribution-Specific (DS) decoder to enhance the size and distribution generalization, respectively. ESF adjusts the attention weight pattern of the model towards familiar ones discovered during training when solving VRPs of varying sizes. The DS decoder explicitly models VRPs of multiple training distribution patterns through multiple auxiliary light decoders, expanding the model representation space to encompass a broader range of distributional scenarios. We conduct extensive experiments on both synthetic and widely recognized real-world benchmarking datasets and compare the performance with seven baseline models. The results demonstrate the effectiveness of using ESF and DS decoder to obtain a more generalizable model and showcase their applicability to solve different VRP variants, i.e., travelling salesman problem and capacitated VRP. Notably, our proposed generic components require minimal computational resources, and can be effortlessly integrated into conventional generalization strategies to further elevate model generalization.

Comments:	13 pages, 6 figures, and 6 tables
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2406.06652 [cs.LG]
	(or arXiv:2406.06652v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2406.06652

Submission history

From: Yubin Xiao [view email]
[v1] Mon, 10 Jun 2024 09:03:17 UTC (6,343 KB)
[v2] Mon, 17 Jun 2024 14:02:57 UTC (6,344 KB)

Computer Science > Machine Learning

Title:Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model Architecture

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Improving Generalization of Neural Vehicle Routing Problem Solvers Through the Lens of Model Architecture

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators