Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Code Reasoning on CRUXEval I
Loading...
74
Accuracy
Kimi-K2 Base
61.624
64.837
68.05
71.263
Jan 6, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Kimi-K2 Base
# Shots=1-shot, # Acti...
2026.01
74
MiMo-V2-Flash Base
# Shots=1-shot, # Acti...
2026.01
67.5
DeepSeek-V3.2 Exp Base
# Shots=1-shot, # Acti...
2026.01
63.9
DeepSeek-V3.1 Base
# Shots=1-shot, # Acti...
2026.01
62.1
Feedback
Search any
task
Search any
task