Share your thoughts, 1 month free Claude Pro on usSee more

Sequential Decision Making on WebShop

0.88Score

SYMPHONY-L

Updated 4mo ago

Evaluation Results

Method	Links
SYMPHONY-L 2026.01		0.88	72
Human Expert 2026.01		0.82	60
SYMPHONY-S 2026.01		0.82	56
MASTER 2026.01		0.8	-
LATS 2026.01		0.76	38
AgentKit 2026.01		0.7	-
Fine-tuning 2026.01		0.68	45
Reflexion 2026.01		0.64	35
IL+RL 2026.01		0.62	29
IL 2026.01		0.6	29
ReAct 2026.01		0.54	32