Share your thoughts, 1 month free Claude Pro on usSee more

Reasoning over Large Structured Context on Hard

5ReasoningJudge Score

GPT-5 + HYVE

Updated 3mo ago

Evaluation Results

Method	Links
GPT-5 + HYVE 2026.04		5	20.3	16.96
GPT-4.1 + HYVE 2026.04		5	15.1	5.43
GPT-5 2026.04		4.33	80.5	19.45
GPT-4.1 2026.04		4.04	75.1	8.07