Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Creative Writing on WildBench (test)
Loading...
64.4
WildBench Score
Full
-2.576
14.812
32.2
49.588
Mar 19, 2026
WildBench Score
Updated 29d ago
Evaluation Results
Method
Method
Links
WildBench Score
Full
Model=Qwen3, Ratio=0%,...
2026.03
64.4
Frequency
Model=Qwen3, Ratio=25%...
2026.03
63.2
EAN
Model=Qwen3, Ratio=25%...
2026.03
62.6
SEER
Model=Qwen3, Ratio=25%...
2026.03
61.2
AIMER
Model=Qwen3, Ratio=25%...
2026.03
60.4
REAP
Model=Qwen3, Ratio=25%...
2026.03
56.8
EAN
Model=Qwen3, Ratio=50%...
2026.03
52.2
Random
Model=Qwen3, Ratio=25%...
2026.03
46.1
AIMER
Model=Qwen3, Ratio=50%...
2026.03
41.3
REAP
Model=Qwen3, Ratio=50%...
2026.03
22.6
Magnitude
Model=Qwen3, Ratio=25%...
2026.03
11.9
Random
Model=Qwen3, Ratio=50%...
2026.03
2.7
SEER
Model=Qwen3, Ratio=50%...
2026.03
1.8
Frequency
Model=Qwen3, Ratio=50%...
2026.03
1.5
Magnitude
Model=Qwen3, Ratio=50%...
2026.03
0
Feedback
Search any
task
Search any
task