Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-task Driving Scene Understanding Robustness on The Dolphins Lvl. 1
Loading...
45.71
Final Score
NutVLM
37.2756
39.4653
41.655
43.8447
Feb 9, 2026
Final Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Final Score
NutVLM
2026.02
45.71
NRP
2026.02
45.68
AAA
2026.02
45.64
Bit-Red
2026.02
45.59
Scene–CADA
2026.02
45.52
MS
2026.02
44.55
JPEG
2026.02
42.85
TVM
2026.02
37.6
Feedback
Search any
task
Search any
task