Driving on Registers

About

We present DrivoR, a simple and efficient transformer-based architecture for end-to-end autonomous driving. Our approach builds on pretrained Vision Transformers (ViTs) and introduces camera-aware register tokens that compress multi-camera features into a compact scene representation, significantly reducing downstream computation without sacrificing accuracy. These tokens drive two lightweight transformer decoders that generate and then score candidate trajectories. The scoring decoder learns to mimic an oracle and predicts interpretable sub-scores representing aspects such as safety, comfort, and efficiency, enabling behavior-conditioned driving at inference. Despite its minimal design, DrivoR outperforms or matches strong contemporary baselines across NAVSIM-v1, NAVSIM-v2, and the photorealistic closed-loop HUGSIM benchmark. Our results show that a pure-transformer architecture, combined with targeted token compression, is sufficient for accurate, efficient, and adaptive end-to-end driving. Code and checkpoints will be made available via the project page.

Ellington Kirby, Alexandre Boulch, Yihong Xu, Yuan Yin, Gilles Puy, \'Eloi Zablocki, Andrei Bursuc, Spyros Gidaris, Renaud Marlet, Florent Bartoccioni, Anh-Quan Cao, Nermin Samet, Tuan-Hung VU, Matthieu Cord• 2026

Related benchmarks

Task	Dataset	Result
Autonomous Driving	NAVSIM v1 (test)	NC99.1	147
Autonomous Driving Planning	NAVSIM v1	NC99	126
Autonomous Driving Planning	NAVSIM navhard v2	NC98.8	88
Planning Evaluation	NAVSIM navhard v2	NC99.1	28
Autonomous Driving Trajectory Planning	NAVSIM navhard-two-stage v2 (test)	Stage 1 NC99.1	23
Planning	NAVSIM v1	PDMS93.7	23
End-to-end Autonomous Driving	BridgeSim NavHard	DS42.79	15
Autonomous Driving Planning	navhard Snapshot from 03/2026 (Stage 2)	NC Rate92.3	11
Autonomous Driving Planning	navhard Snapshot from 03/2026 (Stage 1)	NC99.1	11
Autonomous Driving Planning	navhard Snapshot from 03/2026 (Overall)	EPDMS54.5	11

Showing 10 of 12 rows

Other info

Follow for update

@wizwand_team Discord