Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Reasoning on CRUX

87.37Accuracy

RMoA

18.865236.650154.43572.2199May 26, 2025Jul 23, 2025Sep 19, 2025Nov 17, 2025Jan 14, 2026Mar 13, 2026May 11, 2026
Updated 21d ago

Evaluation Results

MethodLinks
2025.05
87.37
2025.05
86.93
2025.05
86.66
2025.05
75.8
2025.05
61
2025.05
59.93
2025.05
57.31
2025.05
56.81
2025.05
51.5
2025.05
51.25
2025.05
50.5
2025.05
47.5
2025.05
46.12
2025.05
44.81
2025.05
42.65
2025.05
40.62
2026.05
39.38
2026.05
35.62
2025.05
29.3
2025.05
28.1
2025.05
26.6
2025.05
26.5
2026.05
24.38
2025.05
23.8
2025.05
21.8
2025.05
21.5