Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning

Matthews, Michael; Beukman, Michael; Ellis, Benjamin; Samvelyan, Mikayel; Jackson, Matthew; Coward, Samuel; Foerster, Jakob

Computer Science > Machine Learning

arXiv:2402.16801 (cs)

[Submitted on 26 Feb 2024 (v1), last revised 3 Jun 2024 (this version, v2)]

Title:Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning

Authors:Michael Matthews, Michael Beukman, Benjamin Ellis, Mikayel Samvelyan, Matthew Jackson, Samuel Coward, Jakob Foerster

View PDF HTML (experimental)

Abstract:Benchmarks play a crucial role in the development and analysis of reinforcement learning (RL) algorithms. We identify that existing benchmarks used for research into open-ended learning fall into one of two categories. Either they are too slow for meaningful research to be performed without enormous computational resources, like Crafter, NetHack and Minecraft, or they are not complex enough to pose a significant challenge, like Minigrid and Procgen. To remedy this, we first present Craftax-Classic: a ground-up rewrite of Crafter in JAX that runs up to 250x faster than the Python-native original. A run of PPO using 1 billion environment interactions finishes in under an hour using only a single GPU and averages 90% of the optimal reward. To provide a more compelling challenge we present the main Craftax benchmark, a significant extension of the Crafter mechanics with elements inspired from NetHack. Solving Craftax requires deep exploration, long term planning and memory, as well as continual adaptation to novel situations as more of the world is discovered. We show that existing methods including global and episodic exploration, as well as unsupervised environment design fail to make material progress on the benchmark. We believe that Craftax can for the first time allow researchers to experiment in a complex, open-ended environment with limited computational resources.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2402.16801 [cs.LG]
	(or arXiv:2402.16801v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2402.16801

Submission history

From: Michael Matthews [view email]
[v1] Mon, 26 Feb 2024 18:19:07 UTC (6,242 KB)
[v2] Mon, 3 Jun 2024 14:12:27 UTC (10,789 KB)

Computer Science > Machine Learning

Title:Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Craftax: A Lightning-Fast Benchmark for Open-Ended Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators