Share your thoughts, 1 month free Claude Pro on usSee more

Text-based Task Completion on AlfWorld

2.78Mean Normalised Score

ReAct

Updated 3mo ago

Evaluation Results

Method	Links
ReAct 2026.04		2.78
DORA 2026.04		2.78
DORA 2026.04		2.78
DORA 2026.04		2.78
Zero-shot 2026.04		0
Chain of Thought 2026.04		0
Tree of Thought 2026.04		0
Prompt Explore 2026.04		0
Zero-shot 2026.04		0
Chain of Thought 2026.04		0
Tree of Thought 2026.04		0
Prompt Explore 2026.04		0
ReAct 2026.04		0
Zero-shot 2026.04		0
Chain of Thought 2026.04		0
Tree of Thought 2026.04		0
Prompt Explore 2026.04		0
ReAct 2026.04		0