Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Coding Reasoning on LCB
Loading...
53.4
Avg@4
RePro
49.656
50.628
51.6
52.572
Dec 1, 2025
Avg@4
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg@4
RePro
Backbone=Qwen3-8B, Bas...
2025.12
53.4
GRPO
Backbone=Qwen3-8B
2025.12
52.2
Original
Backbone=Qwen3-8B
2025.12
49.8
Feedback
Search any
task
Search any
task