Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Joke Generation on Joke
Loading...
12.8
Quality
SLR
11.24
11.645
12.05
12.455
Feb 6, 2026
Quality
Diversity
Updated 4d ago
Evaluation Results
Method
Method
Links
Quality
Diversity
SLR
Backbone=Llama
2026.02
12.8
33.2
Post-trained
Backbone=Llama
2026.02
12.6
25.1
Proxy-Soup
Backbone=Llama
2026.02
12.1
29.9
Post-trained
Backbone=Gemma
2026.02
12
20.8
Post-trained
Backbone=Qwen
2026.02
11.6
9.5
SLR
Backbone=Qwen
2026.02
11.6
19
Proxy-Soup
Backbone=Gemma
2026.02
11.6
19
SLR
Backbone=Gemma
2026.02
11.5
29.4
Proxy-Soup
Backbone=Qwen
2026.02
11.3
22
Feedback
Search any
task
Search any
task