On Using Admissible Bounds for Learning Forward Search Heuristics

Núñez-Molina, Carlos; Asai, Masataro; Mesejo, Pablo; Fernández-Olivares, Juan

Computer Science > Artificial Intelligence

arXiv:2308.11905 (cs)

[Submitted on 23 Aug 2023 (v1), last revised 7 May 2024 (this version, v3)]

Title:On Using Admissible Bounds for Learning Forward Search Heuristics

Authors:Carlos Núñez-Molina, Masataro Asai, Pablo Mesejo, Juan Fernández-Olivares

View PDF HTML (experimental)

Abstract:In recent years, there has been growing interest in utilizing modern machine learning techniques to learn heuristic functions for forward search algorithms. Despite this, there has been little theoretical understanding of what they should learn, how to train them, and why we do so. This lack of understanding has resulted in the adoption of diverse training targets (suboptimal vs optimal costs vs admissible heuristics) and loss functions (e.g., square vs absolute errors) in the literature. In this work, we focus on how to effectively utilize the information provided by admissible heuristics in heuristic learning. We argue that learning from poly-time admissible heuristics by minimizing mean square errors (MSE) is not the correct approach, since its result is merely a noisy, inadmissible copy of an efficiently computable heuristic. Instead, we propose to model the learned heuristic as a truncated gaussian, where admissible heuristics are used not as training targets but as lower bounds of this distribution. This results in a different loss function from the MSE commonly employed in the literature, which implicitly models the learned heuristic as a gaussian distribution. We conduct experiments where both MSE and our novel loss function are applied to learning a heuristic from optimal plan costs. Results show that our proposed method converges faster during training and yields better heuristics.

Comments:	19 pages, 2 figures
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
MSC classes:	I.2.8
Cite as:	arXiv:2308.11905 [cs.AI]
	(or arXiv:2308.11905v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2308.11905

Submission history

From: Carlos Núñez Molina [view email]
[v1] Wed, 23 Aug 2023 04:14:45 UTC (184 KB)
[v2] Mon, 2 Oct 2023 20:15:56 UTC (169 KB)
[v3] Tue, 7 May 2024 11:11:47 UTC (179 KB)

Computer Science > Artificial Intelligence

Title:On Using Admissible Bounds for Learning Forward Search Heuristics

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:On Using Admissible Bounds for Learning Forward Search Heuristics

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators