Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Task 1 on BlenderBench
Loading...
60.82
PL
Qwen3-VL-8B
6.4696
20.5798
34.69
48.8002
Jan 16, 2026
PL
N-CLIP
VLM Score
Improvement (%)
Updated 4d ago
Evaluation Results
Method
Method
Links
PL
N-CLIP
VLM Score
Improvement (%)
Qwen3-VL-8B
Setting=One-Shot
2026.01
60.82
78.16
28
-
GPT-4o
Setting=One-Shot
2026.01
48.16
64.17
58
-
Gemini-2.5-Pro
Setting=One-Shot
2026.01
40.65
49.57
175
-
Claude-Sonnet-4
Setting=One-Shot
2026.01
20.26
33.82
136
-
Gemini-2.5-Pro
Setting=VIGA
2026.01
17.77
20.58
233
41.12
Qwen3-VL-8B
Setting=VIGA
2026.01
10.54
17.27
131
112.79
Claude-Sonnet-4
Setting=VIGA
2026.01
9.42
11.62
247
53.07
GPT-4o
Setting=VIGA
2026.01
8.56
18.19
144
113.96
Feedback
Search any
task
Search any
task