Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Human Preference Alignment on Multi-Challenge

49.4Avg@3

Qwen3-30A3-2507

35.25638.92842.646.272Dec 6, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
49.4
2025.12
41.8
2025.12
41.2
2025.12
39.2
2025.12
36.4
2025.12
35.8