Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Complex Reasoning on Seal-0

53.4Accuracy (Seal-0)

Claude-4.5-Sonnet

24.07231.68639.346.914Apr 20, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.04
53.4
2026.04
51.4
2026.04
41.8
2026.04
40.5
2026.04
40.4
2026.04
38.5
2026.04
36
2026.04
25.2