Share your thoughts, 1 month free Claude Pro on usSee more

Text-based Task Completion on Jericho

3.37Mean Normalised Score

ReAct

Updated 3mo ago

Evaluation Results

Method	Links
ReAct 2026.04		3.37
DORA 2026.04		2.92
ReAct 2026.04		2.88
Chain of Thought 2026.04		2.22
Zero-shot 2026.04		2.21
DORA 2026.04		2.17
Tree of Thought 2026.04		2.05
ReAct 2026.04		1.87
Prompt Explore 2026.04		1.7
Prompt Explore 2026.04		1.6
Chain of Thought 2026.04		1.58
Chain of Thought 2026.04		1.43
DORA 2026.04		1.42
Tree of Thought 2026.04		1.4
Zero-shot 2026.04		1.35
Prompt Explore 2026.04		1.27
Tree of Thought 2026.04		1.27
Zero-shot 2026.04		0.66