Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Personalized LLM Alignment Evaluation on PersonalRewardBench (test)

3.354Mean Score

Llama3.1-8B-Instruct-GRPO

2.9383.0463.1543.262Feb 12, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
3.3540.0102-
2026.02
3.3160.0068-
2026.02
3.2140.0089-
2026.02
3.1560.0093-
2026.02
2.970.0089-
2026.02
2.9540.0074-