Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Human Preference Alignment on Multi-Challenge

49.4Avg@3

Qwen3-30A3-2507

35.25638.92842.646.272Dec 6, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
49.4
2025.12
41.8
2025.12
41.2
2025.12
39.2
2025.12
36.4
2025.12
35.8