Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Semantic Reasoning on DriveBench
Loading...
53.18
Score
XEmbodied
35.1048
39.7974
44.49
49.1826
Apr 20, 2026
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
XEmbodied
version=Best
2026.04
53.18
Qwen2.5-VL-32B
Category=Open-Source M...
2026.04
53.06
Mimo-Embodied
Category=Embodied Models
2026.04
52.95
PR1-Grounding-2B
Category=Spatial Models
2026.04
52.03
Qwen3-VL-32B
Category=Open-Source M...
2026.04
51.7
PR1-OCR-2B
Category=Spatial Models
2026.04
50.73
UniVG-R1-7B
Category=Spatial Models
2026.04
48.75
GPT-4o
Category=Proprietary M...
2026.04
47.97
Qwen3-VL-A3B-30B
Category=Open-Source M...
2026.04
46.22
DriveMM
Category=Embodied Models
2026.04
44.5
Qwen2.5-VL-7B
Category=Open-Source M...
2026.04
43.29
PR1-Detection-3B
Category=Spatial Models
2026.04
41.35
PR1-Counting-2B
Category=Spatial Models
2026.04
39.72
Cosmos-R1
Category=Embodied Models
2026.04
35.8
Feedback
Search any
task
Search any
task