Building reliable sim driving agents by scaling self-play

Cornelisse, Daphne; Pandya, Aarav; Joseph, Kevin; Suárez, Joseph; Vinitsky, Eugene

Computer Science > Artificial Intelligence

arXiv:2502.14706 (cs)

[Submitted on 20 Feb 2025]

Title:Building reliable sim driving agents by scaling self-play

Authors:Daphne Cornelisse, Aarav Pandya, Kevin Joseph, Joseph Suárez, Eugene Vinitsky

View PDF HTML (experimental)

Abstract:Simulation agents are essential for designing and testing systems that interact with humans, such as autonomous vehicles (AVs). These agents serve various purposes, from benchmarking AV performance to stress-testing the system's limits, but all use cases share a key requirement: reliability. A simulation agent should behave as intended by the designer, minimizing unintended actions like collisions that can compromise the signal-to-noise ratio of analyses. As a foundation for reliable sim agents, we propose scaling self-play to thousands of scenarios on the Waymo Open Motion Dataset under semi-realistic limits on human perception and control. Training from scratch on a single GPU, our agents nearly solve the full training set within a day. They generalize effectively to unseen test scenes, achieving a 99.8% goal completion rate with less than 0.8% combined collision and off-road incidents across 10,000 held-out scenarios. Beyond in-distribution generalization, our agents show partial robustness to out-of-distribution scenes and can be fine-tuned in minutes to reach near-perfect performance in those cases. Demonstrations of agent behaviors can be found at this link. We open-source both the pre-trained agents and the complete code base. Demonstrations of agent behaviors can be found at \url{this https URL}.

Comments:	First version
Subjects:	Artificial Intelligence (cs.AI); Robotics (cs.RO)
Cite as:	arXiv:2502.14706 [cs.AI]
	(or arXiv:2502.14706v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2502.14706

Submission history

From: Daphne Cornelisse [view email]
[v1] Thu, 20 Feb 2025 16:30:45 UTC (6,981 KB)

Computer Science > Artificial Intelligence

Title:Building reliable sim driving agents by scaling self-play

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Building reliable sim driving agents by scaling self-play

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators