CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC

Bordne, Philipp; Hasan, M. Asif; Bergman, Eddie; Awad, Noor; Biedenkapp, André

Computer Science > Machine Learning

arXiv:2407.05789 (cs)

[Submitted on 8 Jul 2024]

Title:CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC

Authors:Philipp Bordne, M. Asif Hasan, Eddie Bergman, Noor Awad, André Biedenkapp

View PDF HTML (experimental)

Abstract:High-dimensional action spaces remain a challenge for dynamic algorithm configuration (DAC). Interdependencies and varying importance between action dimensions are further known key characteristics of DAC problems. We argue that these Coupled Action Dimensions with Importance Differences (CANDID) represent aspects of the DAC problem that are not yet fully explored. To address this gap, we introduce a new white-box benchmark within the DACBench suite that simulates the properties of CANDID. Further, we propose sequential policies as an effective strategy for managing these properties. Such policies factorize the action space and mitigate exponential growth by learning a policy per action dimension. At the same time, these policies accommodate the interdependence of action dimensions by fostering implicit coordination. We show this in an experimental study of value-based policies on our new benchmark. This study demonstrates that sequential policies significantly outperform independent learning of factorized policies in CANDID action spaces. In addition, they overcome the scalability limitations associated with learning a single policy across all action dimensions. The code used for our experiments is available under this https URL.

Comments:	16 pages, 9 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2407.05789 [cs.LG]
	(or arXiv:2407.05789v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.05789

Submission history

From: Philipp Bordne [view email]
[v1] Mon, 8 Jul 2024 09:51:02 UTC (2,490 KB)

Computer Science > Machine Learning

Title:CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:CANDID DAC: Leveraging Coupled Action Dimensions with Importance Differences in DAC

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators