Share your thoughts, 1 month free Claude Pro on usSee more

Open-ended Generation on NoveltyBench, WildChat, and Narrative-Discourse Average (test)

49Lexical Dominance

Aligned

Updated 1mo ago

Evaluation Results

Method	Links
Aligned 2025.11		49	26.9	10.4	29.2	18.6	39
BACO 2025.11		24.9	44.5	36	40.5	40.3	32.7
Base 2025.11		12.7	9.8	9.8	16	9.8	14.3
Nudging 2025.11		9.3	27.6	24.7	9.9	26.1	9.6
Prompting 2025.11		2.7	-	-	2.2	-	2.4
Ensemble 2025.11		1.1	-	-	1.9	-	1.5
Decoding 2025.11		0.3	-	-	0.3	-	0.3