Share your thoughts, 1 month free Claude Pro on usSee more

Reasoning over Large Structured Context on Canvas

4.96ReasoningJudge Score

GPT-5

Updated 3mo ago

Evaluation Results

Method	Links
GPT-5 2026.04		4.96	122.8	10.45
GPT-5 + HYVE 2026.04		4.96	38.2	8.99
GPT-4.1 + HYVE 2026.04		4.95	35.1	3
GPT-4.1 2026.04		4.94	123.4	3.22