Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Alignment & Instruction Following on Scale AI Multi-Challenge
Loading...
61.5
Pass@1
Qwen3.5-122B-A10B
37.58
43.79
50
56.21
Mar 19, 2026
Mar 23, 2026
Mar 27, 2026
Apr 1, 2026
Apr 5, 2026
Apr 9, 2026
Apr 14, 2026
Pass@1
Updated 4d ago
Evaluation Results
Method
Method
Links
Pass@1
Qwen3.5-122B-A10B
2026.04
61.5
Qwen3.5 35B-A3B
2026.03
60
GPT-OSS-120B
2026.04
58.29
Nemotron 3 Super
2026.04
55.23
Nemotron-3-Super 120B-A12B
2026.03
55.2
Nemotron-Cascade-2 30B-A3B
2026.03
45.3
Nemotron-3-Nano 30B-A3B
2026.03
38.5
Feedback
Search any
task
Search any
task