Share your thoughts, 1 month free Claude Pro on usSee more

Multi-domain Knowledge and Reasoning on HLE (official)

42Exact Match

GPT-5 pro

Updated 5mo ago

Evaluation Results

Method	Links
GPT-5 pro 2026.02		42
ChatGPT Agent 2026.02		41.6
Tendem’s AI agent 2026.02		39
GPT-5 high 2026.02		35.2
ChatGPT Deep Research 2026.02		26.6
o3 high 2026.02		24.3
Perplexity Deep Research 2026.02		21.1