Drive My Way: Preference Alignment of Vision-Language-Action Model for Personalized Driving

About

Human driving behavior is inherently personal, which is shaped by long-term habits and influenced by short-term intentions. Individuals differ in how they accelerate, brake, merge, yield, and overtake across diverse situations. However, existing end-to-end autonomous driving systems either optimize for generic objectives or rely on fixed driving modes, lacking the ability to adapt to individual preferences or interpret natural language intent. To address this gap, we propose Drive My Way (DMW), a personalized Vision-Language-Action (VLA) driving framework that aligns with users' long-term driving habits and adapts to real-time user instructions. DMW learns a user embedding from our personalized driving dataset collected across multiple real drivers and conditions the policy on this embedding during planning, while natural language instructions provide additional short-term guidance. Closed-loop evaluation on the Bench2Drive benchmark demonstrates that DMW improves style instruction adaptation, and user studies show that its generated behaviors are recognizable as each driver's own style, highlighting personalization as a key capability for human-centered autonomous driving. Our data and code are available at https://dmw-cvpr.github.io/.

Zehao Wang, Huaide Jiang, Shuaiwu Dong, Yuping Wang, Hang Qiu, Jiachen Li• 2026

Related benchmarks

Task	Dataset	Result
Closed-loop Autonomous Driving	Bench2Drive	Driving Score (DS)82.72	74
Closed-loop Autonomous Driving	Bench2Drive closed-loop	DS82.72	46
Instruction-following Trajectory Evaluation	Personalized Driving Dataset Merging Scenario v1.0 (test)	Evaluation Score 18.9	9
Instruction-following Trajectory Evaluation	Personalized Driving Dataset Overtaking Scenario v1.0 (test)	E1 Score9.1	9
Instruction-following Trajectory Evaluation	Personalized Driving Dataset Traffic Sign Scenario v1.0 (test)	Trajectory Score E18.9	9
Instruction-following Trajectory Evaluation	Personalized Driving Dataset Emergency Brake Scenario v1.0 (test)	Error Metric 17.8	9
Long-term preference alignment	StyleDrive (ID)	Alignment Score (D1)92	2
Long-term preference alignment	StyleDrive (OOD)	Alignment Score (D3)83	2

Showing 8 of 8 rows

Other info

Follow for update

@wizwand_team Discord