Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Personalized Reward Modeling on Reddit TLDR 150 examples Seen
Loading...
69.7
User-level Accuracy
MRM
62.316
64.233
66.15
68.067
Jan 26, 2026
User-level Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
User-level Accuracy
MRM
Base Reward Model=Skyw...
2026.01
69.7
MRM
Base Reward Model=Skyw...
2026.01
69
GPO
2026.01
68.5
LoRe
2026.01
68.5
SynthesizeMe
Adaptation Protocol=FT
2026.01
68.2
BT
2026.01
68.1
VPL
2026.01
67.7
SynthesizeMe
Adaptation Protocol=ICL
2026.01
66.7
PAL
2026.01
66.3
Skywork-Reward V2
Base Reward Model=V2
2026.01
64.4
Skywork-Reward V1
Base Reward Model=V1
2026.01
62.6
Feedback
Search any
task
Search any
task