Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Personalized Reward Modeling on Reddit TLDR 150 examples Unseen

69.8User-level Accuracy

MRM

62.31264.25666.268.144Jan 26, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
69.8
2026.01
69.5
2026.01
68.8
2026.01
68.6
2026.01
68.2
2026.01
68
2026.01
67.9
2026.01
66.7
2026.01
66.3
2026.01
64.5
2026.01
62.6