Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Personalized Reward Modeling on Reddit TLDR 150 examples Unseen
Loading...
69.8
User-level Accuracy
MRM
62.312
64.256
66.2
68.144
Jan 26, 2026
User-level Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
User-level Accuracy
MRM
Base Reward Model=Skyw...
2026.01
69.8
MRM
Base Reward Model=Skyw...
2026.01
69.5
LoRe
2026.01
68.8
GPO
2026.01
68.6
BT
2026.01
68.2
SynthesizeMe
Adaptation Protocol=FT
2026.01
68
VPL
2026.01
67.9
PAL
2026.01
66.7
SynthesizeMe
Adaptation Protocol=ICL
2026.01
66.3
Skywork-Reward V2
Base Reward Model=V2
2026.01
64.5
Skywork-Reward V1
Base Reward Model=V1
2026.01
62.6
Feedback
Search any
task
Search any
task