Share your thoughts, 1 month free Claude Pro on usSee more

Agentic Tasks on Frames

70.45Accuracy

Debate

Updated 4mo ago

Evaluation Results

Method	Links
Debate 2025.05		70.45
Self-Refine 2025.05		67.89
MAS-ZERO 2025.05		65.18
CoT-SC 2025.05		63.58
CoT 2025.05		59.76