Share your thoughts, 1 month free Claude Pro on usSee more

Multi-agent policy synthesis on Gathering

4.59Metric U

Gemini 3.1 Pro

Updated 4mo ago

Evaluation Results

Method	Links
Gemini 3.1 Pro 2026.03		4.59	97	502.7
Gemini 3.1 Pro 2026.03		4.58	97	502.5
Gemini 3.1 Pro 2026.03		3.71	79	443.2
Claude Sonnet 4.6 2026.03		3.53	84	452.7
Claude Sonnet 4.6 2026.03		3.47	72	402.9
GEPA (Gemini 3.1 Pro) 2026.03		3.45	91	496.2
Claude Sonnet 4.6 2026.03		1.85	52	298.6
BFS Collector 2026.03		1.29	54	489.5
Q-learner 2026.03		0.77	83	508.2