Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reasoning over Large Structured Context on Hard

5ReasoningJudge Score

GPT-5 + HYVE

4.00164.26084.524.7792Apr 7, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2026.04
520.316.96
2026.04
515.15.43
2026.04
4.3380.519.45
2026.04
4.0475.18.07