Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Creative Writing on ROCStories + WritingPrompts Llama-3-70B (test)
Loading...
63.3
QD-Score
QD-LLM
35.324
42.587
49.85
57.113
May 10, 2026
QD-Score
Median Score [IQR]
Covariance
SB
Metric A
Updated 21d ago
Evaluation Results
Method
Method
Links
QD-Score
Median Score [IQR]
Covariance
SB
Metric A
QD-LLM
Backbone=Llama-3-70B-I...
2026.05
63.3
63.1
0.52
0.28
0.96
CMA-ME (ad.)
Backbone=Llama-3-70B-I...
2026.05
46.2
46
0.39
0.38
-
QDAIF
Backbone=Llama-3-70B-I...
2026.05
44.8
44.6
0.37
0.4
-
Best-of-N+MMR
Backbone=Llama-3-70B-I...
2026.05
40.1
39.9
0.31
0.46
-
Diverse Beam
Backbone=Llama-3-70B-I...
2026.05
38.2
38
0.28
0.52
-
Nucleus
Backbone=Llama-3-70B-I...
2026.05
36.4
36.2
0.26
0.56
-
Feedback
Search any
task
Search any
task