BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving

Winter, Katharina; Azer, Mark; Flohr, Fabian B.

Computer Science > Robotics

arXiv:2503.03074 (cs)

[Submitted on 5 Mar 2025]

Title:BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving

Authors:Katharina Winter, Mark Azer, Fabian B. Flohr

View PDF HTML (experimental)

Abstract:Autonomous driving has the potential to set the stage for more efficient future mobility, requiring the research domain to establish trust through safe, reliable and transparent driving. Large Language Models (LLMs) possess reasoning capabilities and natural language understanding, presenting the potential to serve as generalized decision-makers for ego-motion planning that can interact with humans and navigate environments designed for human drivers. While this research avenue is promising, current autonomous driving approaches are challenged by combining 3D spatial grounding and the reasoning and language capabilities of LLMs. We introduce BEVDriver, an LLM-based model for end-to-end closed-loop driving in CARLA that utilizes latent BEV features as perception input. BEVDriver includes a BEV encoder to efficiently process multi-view images and 3D LiDAR point clouds. Within a common latent space, the BEV features are propagated through a Q-Former to align with natural language instructions and passed to the LLM that predicts and plans precise future trajectories while considering navigation instructions and critical scenarios. On the LangAuto benchmark, our model reaches up to 18.9% higher performance on the Driving Score compared to SoTA methods.

Comments:	This work has been submitted to the IEEE for possible publication
Subjects:	Robotics (cs.RO); Computer Vision and Pattern Recognition (cs.CV)
Cite as:	arXiv:2503.03074 [cs.RO]
	(or arXiv:2503.03074v1 [cs.RO] for this version)
	https://doi.org/10.48550/arXiv.2503.03074

Submission history

From: Fabian Flohr [view email]
[v1] Wed, 5 Mar 2025 00:27:32 UTC (4,843 KB)

Computer Science > Robotics

Title:BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Robotics

Title:BEVDriver: Leveraging BEV Maps in LLMs for Robust Closed-Loop Driving

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators