Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code on CRUX
Loading...
66.4
Accuracy
Qwen2.5-14B-Instruct-1M
53.608
56.929
60.25
63.571
Aug 28, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-14B-Instruct-1M
Size=14B, Type=Instruc...
2025.08
66.4
NPG-Muse-8B
Backbone=Qwen3-8B-Base
2025.08
65.3
Qwen3-14B-Base
Size=14B, Type=Base
2025.08
63.9
NPG-Muse-7B
Backbone=Qwen2.5-7B-In...
2025.08
59
Qwen2.5-7B-Instruct-1M
Size=7B, Type=Instruct...
2025.08
55.6
Qwen3-8B-Base
Size=8B, Type=Base
2025.08
54.1
Feedback
Search any
task
Search any
task