Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Logic Reasoning on ZebraLogic
Loading...
0.817
Avg Accuracy @1
NPR
0.3334
0.45895
0.5845
0.71005
Dec 8, 2025
Avg Accuracy @1
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg Accuracy @1
NPR
Data=orz-8k, Train=Par...
2025.12
0.817
Qwen3-4B-Instruct-2507
Data=N/A, Train=N/A, B...
2025.12
0.802
SR
Data=orz-8k, Train=Seq...
2025.12
0.789
NPR-BETA
Data=orz-8k, Train=Par...
2025.12
0.761
NPR (Variant)
Data=orz-8k, Train=Par...
2025.12
0.758
SR-BETA
Data=orz-8k, Train=Seq...
2025.12
0.728
NPR-BETA (Variant)
Data=orz-8k, Train=Par...
2025.12
0.7
Multiverse-4B
Data=s1.1-8k, Train=S→...
2025.12
0.602
Multiverse-32B
Data=s1.1-8k, Train=S→...
2025.12
0.471
Qwen2.5-32B-Instruct
Data=N/A, Train=N/A, B...
2025.12
0.436
Qwen3-4B (Non-Thinking)
Data=N/A, Train=N/A, B...
2025.12
0.352
Feedback
Search any
task
Search any
task