Adversarial Environment Design via Regret-Guided Diffusion Models

Chung, Hojun; Lee, Junseo; Kim, Minsoo; Kim, Dohyeong; Oh, Songhwai

Computer Science > Machine Learning

arXiv:2410.19715 (cs)

[Submitted on 25 Oct 2024 (v1), last revised 15 Nov 2024 (this version, v2)]

Title:Adversarial Environment Design via Regret-Guided Diffusion Models

Authors:Hojun Chung, Junseo Lee, Minsoo Kim, Dohyeong Kim, Songhwai Oh

View PDF HTML (experimental)

Abstract:Training agents that are robust to environmental changes remains a significant challenge in deep reinforcement learning (RL). Unsupervised environment design (UED) has recently emerged to address this issue by generating a set of training environments tailored to the agent's capabilities. While prior works demonstrate that UED has the potential to learn a robust policy, their performance is constrained by the capabilities of the environment generation. To this end, we propose a novel UED algorithm, adversarial environment design via regret-guided diffusion models (ADD). The proposed method guides the diffusion-based environment generator with the regret of the agent to produce environments that the agent finds challenging but conducive to further improvement. By exploiting the representation power of diffusion models, ADD can directly generate adversarial environments while maintaining the diversity of training environments, enabling the agent to effectively learn a robust policy. Our experimental results demonstrate that the proposed method successfully generates an instructive curriculum of environments, outperforming UED baselines in zero-shot generalization across novel, out-of-distribution environments. Project page: this https URL

Comments:	38th Conference on Neural Information Processing Systems
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2410.19715 [cs.LG]
	(or arXiv:2410.19715v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2410.19715

Submission history

From: Hojun Chung [view email]
[v1] Fri, 25 Oct 2024 17:35:03 UTC (5,731 KB)
[v2] Fri, 15 Nov 2024 01:01:44 UTC (5,732 KB)

Computer Science > Machine Learning

Title:Adversarial Environment Design via Regret-Guided Diffusion Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adversarial Environment Design via Regret-Guided Diffusion Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators