Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Creative Writing Generation on Creative Writing (Score)
Loading...
71.58
Score
Gemini-2.5-pro
61.5128
64.1264
66.74
69.3536
May 28, 2026
Score
Updated 5d ago
Evaluation Results
Method
Method
Links
Score
Gemini-2.5-pro
Backbone=Proprietary
2026.05
71.58
EvoRubric
Backbone=Qwen3-14B
2026.05
69.88
External Evolving-RL
Backbone=Qwen3-14B
2026.05
68.67
GPT-4o
Backbone=Proprietary
2026.05
68.66
Static Rubric-RL
Backbone=Qwen3-14B
2026.05
68.64
EvoRubric
Backbone=Qwen3-8B
2026.05
66.99
Base Model
Backbone=Qwen3-14B
2026.05
66.53
External Evolving-RL
Backbone=Qwen3-8B
2026.05
65.75
Static Rubric-RL
Backbone=Qwen3-8B
2026.05
65.69
Base Model
Backbone=Qwen3-8B
2026.05
61.9
Feedback
Search any
task
Search any
task