Share your thoughts, 1 month free Claude Pro on usSee more

Agentic Reasoning on HLE

41.6Overall Score

ChatGPT-Agent

Updated 4mo ago

Evaluation Results

Method	Links
ChatGPT-Agent 2026.02		41.6	-
InternAgent-1.5 2026.02		40	40.87
InternAgent-1.5 2026.02		34.52	36.1
Kimi-Researcher 2026.02		26.9	-
Gemini DR 2026.02		26.9	-
OpenAI DR 2026.02		26.6	-
InternAgent-1.5 2026.02		14.84	15.04
MiroThinker 2026.02		-	31
MiroThinker 2026.02		-	39.2
Tongyi-DR 2026.02		-	32.9