Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reddit Summary (Faithful vs Preference 1) on Reddit (test)
Loading...
1.23
Hypervolume
Rewards-in-Context
0.5852
0.7526
0.92
1.0874
Feb 15, 2025
Hypervolume
Inner Product
Controllability
Length of Front
Sparsity
Spacing
Updated 4d ago
Evaluation Results
Method
Method
Links
Hypervolume
Inner Product
Controllability
Length of Front
Sparsity
Spacing
Rewards-in-Context
Backbone=LLama-2 7B
2025.02
1.23
2.03
82
6
39
0.08
Bone Soup
Backbone=LLama-2 7B
2025.02
1.12
1.89
100
11
29
0.03
MOD
Backbone=LLama-2 7B
2025.02
0.62
1.17
100
11
18
0.02
Rewarded Soups
Backbone=LLama-2 7B
2025.02
0.61
1.13
100
11
17
0.01
Feedback
Search any
task
Search any
task