Share your thoughts, 1 month free Claude Pro on usSee more

Expert-level Reasoning on Humanity's Last Exam 2,158 text-only

54.2Avg@3 Score

Seed-2.0-Pro

Updated 4mo ago

Evaluation Results

Method	Links
Seed-2.0-Pro 2026.03		54.2
Claude-4.6-Opus 2026.03		53.1
OpenAI-GPT-5.4 2026.03		52.1
Gemini-3.1-Pro 2026.03		51.4
GLM-5.0 2026.03		50.4
Kimi-K2.5 2026.03		50.2
Qwen3.5-397B 2026.03		48.3
MiroThinker-H1 2026.03		47.7
Gemini-3.0-Pro 2026.03		46.9
Claude-4.5-Opus 2026.03		43.2
MiroThinker-1.7 2026.03		42.9
DeepSeek-V3.2 2026.03		40.8
MiroThinker-1.7-mini 2026.03		36.4
OpenAI-GPT-5 2026.03		35.2
Tongyi-DeepResearch-30B 2026.03		32.9