Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reward Modeling on WebGPT
Loading...
58.4
Accuracy
UMM-RM
50.496
52.548
54.6
56.652
Nov 30, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
UMM-RM
Base Model=Qwen2.5-0.5...
2025.11
58.4
UMM-RM
Base Model=Qwen2.5-0.5...
2025.11
58.2
UMM-RM
Base Model=Qwen2.5-0.5...
2025.11
57.8
UMM-RM
Base Model=Pythia-1.4B...
2025.11
57.8
Dense RM
Base Model=Qwen2.5-0.5B
2025.11
57.2
UMM-RM
Base Model=Pythia-1.4B...
2025.11
54.2
UMM-RM
Base Model=Pythia-1.4B...
2025.11
54
Dense RM
Base Model=Pythia-1.4B
2025.11
50.8
Feedback
Search any
task
Search any
task