Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Open-ended Generation on NoveltyBench, WildChat, and Narrative-Discourse Average (test)

49Lexical Dominance

Aligned

-1.64811.50124.6537.799Nov 7, 2025
Updated 22h ago

Evaluation Results

MethodLinks
2025.11
4926.910.429.218.639
2025.11
24.944.53640.540.332.7
2025.11
12.79.89.8169.814.3
2025.11
9.327.624.79.926.19.6
2025.11
2.7--2.2-2.4
2025.11
1.1--1.9-1.5
2025.11
0.3--0.3-0.3