Learning Heuristic Selection with Dynamic Algorithm Configuration

Speck, David; Biedenkapp, André; Hutter, Frank; Mattmüller, Robert; Lindauer, Marius

Computer Science > Artificial Intelligence

arXiv:2006.08246 (cs)

[Submitted on 15 Jun 2020 (v1), last revised 12 Apr 2021 (this version, v3)]

Title:Learning Heuristic Selection with Dynamic Algorithm Configuration

Authors:David Speck, André Biedenkapp, Frank Hutter, Robert Mattmüller, Marius Lindauer

View PDF

Abstract:A key challenge in satisficing planning is to use multiple heuristics within one heuristic search. An aggregation of multiple heuristic estimates, for example by taking the maximum, has the disadvantage that bad estimates of a single heuristic can negatively affect the whole search. Since the performance of a heuristic varies from instance to instance, approaches such as algorithm selection can be successfully applied. In addition, alternating between multiple heuristics during the search makes it possible to use all heuristics equally and improve performance. However, all these approaches ignore the internal search dynamics of a planning system, which can help to select the most useful heuristics for the current expansion step. We show that dynamic algorithm configuration can be used for dynamic heuristic selection which takes into account the internal search dynamics of a planning system. Furthermore, we prove that this approach generalizes over existing approaches and that it can exponentially improve the performance of the heuristic search. To learn dynamic heuristic selection, we propose an approach based on reinforcement learning and show empirically that domain-wise learned policies, which take the internal search dynamics of a planning system into account, can exceed existing approaches.

Comments:	Long version of the paper at the International Conference on Automated Planning and Scheduling (ICAPS) 2021
Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2006.08246 [cs.AI]
	(or arXiv:2006.08246v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2006.08246

Submission history

From: David Speck [view email]
[v1] Mon, 15 Jun 2020 09:35:07 UTC (48 KB)
[v2] Wed, 9 Dec 2020 09:25:42 UTC (545 KB)
[v3] Mon, 12 Apr 2021 14:32:32 UTC (310 KB)

Computer Science > Artificial Intelligence

Title:Learning Heuristic Selection with Dynamic Algorithm Configuration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Learning Heuristic Selection with Dynamic Algorithm Configuration

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators