Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Semantic Reasoning on Omnidrive
Loading...
25.43
Score
XEmbodied
-1.0172
5.8489
12.715
19.5811
Apr 20, 2026
Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Score
XEmbodied
version=Best
2026.04
25.43
Mimo-Embodied
Category=Embodied Models
2026.04
4.9
PR1-Grounding-2B
Category=Spatial Models
2026.04
4.85
GPT-4o
Category=Proprietary M...
2026.04
4.54
PR1-OCR-2B
Category=Spatial Models
2026.04
4.07
Gemini-1.5
Category=Proprietary M...
2026.04
2.73
Qwen2.5-VL-7B
Category=Open-Source M...
2026.04
2.53
PR1-Counting-2B
Category=Spatial Models
2026.04
2
Cosmos-R1
Category=Embodied Models
2026.04
1.23
UniVG-R1-7B
Category=Spatial Models
2026.04
1.2
DriveMM
Category=Embodied Models
2026.04
1.1
Qwen3-VL-A3B-30B
Category=Open-Source M...
2026.04
0.31
PR1-Detection-3B
Category=Spatial Models
2026.04
0.1
Qwen2.5-VL-32B
Category=Open-Source M...
2026.04
0
Qwen3-VL-32B
Category=Open-Source M...
2026.04
0
Feedback
Search any
task
Search any
task