Share your thoughts, 1 month free Claude Pro on usSee more

Personalized Reward Modeling on LaMP 4

17.9ROUGE@N=30

Probabilistic User RM

Updated 2mo ago

Evaluation Results

Method	Links
Probabilistic User RM 2026.05		17.9	-	15.5
LLM tournament (Qwen3-4B) 2026.05		15.7	49.9	1.3
Random 2026.05		15.5	50	-
LLM pointwise (GPT-class) 2026.05		14.4	-	7.1