Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Structural Reasoning on C2S

77.9Accuracy

GPT-4o

48.2655.95563.6571.345May 2, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.05
77.9
2025.05
75.6
2025.05
75.2
2025.05
73.5
2025.05
68.5
2025.05
68.4
2025.05
65.5
2025.05
50.6
2025.05
49.4