Share your thoughts, 1 month free Claude Pro on usSee more

Agentic tasks on BrowseComp

9.45Accuracy

MAS-ZERO

Updated 4mo ago

Evaluation Results

Method	Links
MAS-ZERO 2025.05		9.45
CoT-SC 2025.05		8.66
Self-Refine 2025.05		5.51
CoT 2025.05		3.97
Debate 2025.05		3.94