Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Viewpoint Robustness on Progress-Bench 1.0 (test)

-1.1ΔNSE

Intern3.5-VL-8B

-2.1164.74211.618.458Jan 21, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
-1.110.5-0.1
2026.01
-0.9-21-0.5
2026.01
1.350
2026.01
4.2-14.1-3.4
2026.01
4.3-64.50
2026.01
4.9-16.28.2
2026.01
4.9-4.711.6
2026.01
5.901.8
2026.01
6-10.50.4
2026.01
6.1-20.40
2026.01
6.2-4.1-0.1
2026.01
9.2-25.2-4.6
2026.01
10.5-25.150.8
2026.01
12.5-20.70
2026.01
14.1-33.213
2026.01
24.3-43.50