Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Story Generation on DecTest story_gen no_hds (1000 samples)
Loading...
0.779
Spearman ρ
a_n
-0.60004
-0.24202
0.116
0.47402
Jun 1, 2026
Spearman ρ
Updated 1d ago
Evaluation Results
Method
Method
Links
Spearman ρ
a_n
model=Qwen2.5-3B, perm...
2026.06
0.779
C x a_n
model=Qwen2.5-3B, perm...
2026.06
0.763
distinct-n
2026.06
0.758
cos-sim
2026.06
0.712
BERTScore
2026.06
0.694
SentBERT
2026.06
0.645
C
model=Qwen2.5-3B, perm...
2026.06
-0.547
Feedback
Search any
task
Search any
task