Share your thoughts, 1 month free Claude Pro on usSee more

Plan Generation on RecipeNLG (test)

91.3BLEU

Self-Ask

Updated 5mo ago

Evaluation Results

Method	Links
Self-Ask 2026.01		91.3	15.7
ReAct 2026.01		91.2	59.9
SQ-BCP 2026.01		90.7	5.8
CoT 2026.01		90	64.1
ToS 2026.01		89.8	64.2
Direct Prompt 2026.01		89.7	65.7
ToT 2026.01		89.2	66.5