Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-task Driving Scene Understanding Robustness on The Dolphins (Lvl. 0)
Loading...
46.83
Final Score
NutVLM
37.8028
40.1464
42.49
44.8336
Feb 9, 2026
Final Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Final Score
NutVLM
2026.02
46.83
NRP
2026.02
45.55
MS
2026.02
45.25
Bit-Red
2026.02
45.1
AAA
2026.02
45.05
Scene–CADA
2026.02
45.03
JPEG
2026.02
43.2
TVM
2026.02
38.15
Feedback
Search any
task
Search any
task