Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Creative Writing on HelloBench
Loading...
80.4
Accuracy
Qwen3-30B-A3B-Thinking
71.56
73.855
76.15
78.445
Apr 3, 2026
Accuracy
Delta (%)
Updated 12d ago
Evaluation Results
Method
Method
Links
Accuracy
Delta (%)
Qwen3-30B-A3B-Thinking
Thinking Mode=Thinking
2026.04
80.4
0.7
Qwen3-30B-A3B-Instruct
Thinking Mode=No Thinking
2026.04
79.8
-
DeepSeek-R1-0528
Thinking Mode=Thinking
2026.04
78
0.7
DeepSeek-V3.1
Thinking Mode=No Thinking
2026.04
77.4
-
Qwen3-8B
Thinking Mode=Thinking
2026.04
72.2
0.4
Qwen3-8B
Thinking Mode=No Thinking
2026.04
71.9
-
Feedback
Search any
task
Search any
task