Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Code Reasoning on CRUX

87.37Accuracy

RMoA

18.865236.650154.43572.2199May 26, 2025May 27, 2025May 28, 2025May 29, 2025May 30, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
87.37
2025.05
86.93
2025.05
86.66
2025.05
75.8
2025.05
61
2025.05
59.93
2025.05
57.31
2025.05
56.81
2025.05
51.5
2025.05
51.25
2025.05
50.5
2025.05
47.5
2025.05
46.12
2025.05
44.81
2025.05
42.65
2025.05
40.62
2025.05
29.3
2025.05
28.1
2025.05
26.6
2025.05
26.5
2025.05
23.8
2025.05
21.8
2025.05
21.5