Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Personalized Reward Modeling on Reddit TLDR 100 examples Unseen

69.6User-level Accuracy

MRM

62.3264.2166.167.99Jan 26, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.01
69.6
2026.01
69
2026.01
68.6
2026.01
68
2026.01
68
2026.01
67.8
2026.01
67.7
2026.01
66.3
2026.01
65.9
2026.01
64.5
2026.01
62.6