Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

General Reasoning on General Reasoning Suite Average

78.3Pass@1

GPT-5.4-xhigh

8.339226.502144.66562.8279Apr 27, 2026May 1, 2026May 5, 2026May 9, 2026May 13, 2026May 17, 2026May 21, 2026
Updated 12d ago

Evaluation Results

MethodLinks
2026.05
78.3--
2026.05
74.2--
2026.05
73.2--
2026.05
71.3--
2026.05
70.9--
2026.05
69.81--
2026.05
68.62--
2026.05
68.4--
2026.05
66.4--
2026.05
64.81--
2026.05
64.28--
2026.05
64.28--
2026.05
62.8--
2026.05
62.1--
2026.05
60.7--
2026.05
60.6--
2026.05
60.3--
2026.05
59.48--
2026.05
59.2--
2026.05
58.12--
2026.05
58.12--
2026.05
57.2--
2026.05
57--
2026.05
57--
2026.05
56--
2026.05
55.3--
2026.05
54.7--
2026.05
53.1--
2026.05
52.6--
2026.05
51.8--
2026.05
51.4--
2026.05
51.15--
2026.05
49.33--
2026.05
48.91--
2026.05
46.29--
2026.05
46.27--
2026.05
44.68--
2026.05
43.8--
2026.05
39.54--
2026.05
39.54--
2026.05
38.6--
2026.04
38.32,7290.45
2026.05
37.8--
2026.04
37.773,3310.27
2026.05
37.73--
2026.04
37.623,1270.31
2026.04
37.444,4160
2026.04
37.222,2420.46
2026.05
36.6--
2026.05
36.6--
2026.04
36.552,4960.32
2026.05
34.54--
2026.05
32.8--
2026.05
32.55--
2026.05
31.13--
2026.05
30.9--
2026.05
28.9--
2026.05
27.5--
2026.05
26.01--
2026.05
24.57--
2026.05
24.5--
2026.05
18.17--
2026.05
11.03--