Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

Bonnet, Clément; Luo, Daniel; Byrne, Donal; Surana, Shikha; Abramowitz, Sasha; Duckworth, Paul; Coyette, Vincent; Midgley, Laurence I.; Tegegn, Elshadai; Kalloniatis, Tristan; Mahjoub, Omayma; Macfarlane, Matthew; Smit, Andries P.; Grinsztajn, Nathan; Boige, Raphael; Waters, Cemlyn N.; Mimouni, Mohamed A.; Sob, Ulrich A. Mbou; de Kock, Ruan; Singh, Siddarth; Furelos-Blanco, Daniel; Le, Victor; Pretorius, Arnu; Laterre, Alexandre

Computer Science > Machine Learning

arXiv:2306.09884 (cs)

[Submitted on 16 Jun 2023 (v1), last revised 16 Mar 2024 (this version, v2)]

Title:Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

Abstract:Open-source reinforcement learning (RL) environments have played a crucial role in driving progress in the development of AI algorithms. In modern RL research, there is a need for simulated environments that are performant, scalable, and modular to enable their utilization in a wider range of potential real-world applications. Therefore, we present Jumanji, a suite of diverse RL environments specifically designed to be fast, flexible, and scalable. Jumanji provides a suite of environments focusing on combinatorial problems frequently encountered in industry, as well as challenging general decision-making tasks. By leveraging the efficiency of JAX and hardware accelerators like GPUs and TPUs, Jumanji enables rapid iteration of research ideas and large-scale experimentation, ultimately empowering more capable agents. Unlike existing RL environment suites, Jumanji is highly customizable, allowing users to tailor the initial state distribution and problem complexity to their needs. Furthermore, we provide actor-critic baselines for each environment, accompanied by preliminary findings on scaling and generalization scenarios. Jumanji aims to set a new standard for speed, adaptability, and scalability of RL environments.

Comments:	9 pages + 21 pages of appendices and references. Published at ICLR 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2306.09884 [cs.LG]
	(or arXiv:2306.09884v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2306.09884

Submission history

From: Clément Bonnet [view email]
[v1] Fri, 16 Jun 2023 14:52:24 UTC (4,823 KB)
[v2] Sat, 16 Mar 2024 00:02:49 UTC (5,523 KB)

Computer Science > Machine Learning

Title:Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Jumanji: a Diverse Suite of Scalable Reinforcement Learning Environments in JAX

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators