Share your thoughts, 1 month free Claude Pro on usSee more

Embodied AI Reasoning on ALFWorld

100CoT Match Rate

OPT-13B

Updated 4mo ago

Evaluation Results

Method	Links
OPT-13B 2025.05		100
LLaMA-13B 2025.05		100
Structured Agent Distillation 2025.05		77.2
Structured Agent Distillation 2025.05		76.4
Token-level 2025.05		73
Token-level 2025.05		72.2
Structured Agent Distillation 2025.05		71.6
SeqKD 2025.05		70.1
SeqKD 2025.05		68.7
KD 2025.05		68.3
Token-level 2025.05		67.3
Structured Agent Distillation 2025.05		67.2
KD 2025.05		66.4
SeqKD 2025.05		63.4
Token-level 2025.05		61.5
KD 2025.05		61.5
SeqKD 2025.05		58
KD 2025.05		56.2