Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Chat on AE2 LC
Loading...
61.7
Win Rate
Qwen3-8B (Non-thinking)
5.436
20.043
34.65
49.257
May 14, 2026
Win Rate
Updated 16d ago
Evaluation Results
Method
Method
Links
Win Rate
Qwen3-8B (Non-thinking)
Backbone=Qwen3-8B, Tra...
2026.05
61.7
Qwen3-8B-GRLO
Backbone=Qwen3-8B, Tra...
2026.05
57.8
Qwen3-8B-GRLO+RLVR
Backbone=Qwen3-8B, Tra...
2026.05
55.9
Qwen3-8B-RLVR
Backbone=Qwen3-8B, Tra...
2026.05
23.6
Qwen3-8B-Base
Backbone=Qwen3-8B, Tra...
2026.05
12.8
Qwen3-8B-MathSFT
Backbone=Qwen3-8B, Tra...
2026.05
7.6
Feedback
Search any
task
Search any
task