Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Preference Classification on WebGPT comparisons (test)
Loading...
60.8
Accuracy
UMM-RM
51.024
53.562
56.1
58.638
Nov 30, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
UMM-RM
Backbone=TinyLlama-1.1...
2025.11
60.8
Worst-Case Optimization
Backbone=TinyLlama-1.1...
2025.11
60.6
Uncertainty-Weighted Optimization
Backbone=TinyLlama-1.1...
2025.11
59.6
UMM-RM
Backbone=TinyLlama-1.1...
2025.11
58.6
UMM-RM
Backbone=TinyLlama-1.1...
2025.11
57.8
Dense RM
Backbone=TinyLlama-1.1B
2025.11
52.2
Mean Optimization
Backbone=TinyLlama-1.1...
2025.11
51.4
Feedback
Search any
task
Search any
task