Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Language Understanding and Reasoning on LUE Suite (Zyda2 calibration test)

58.28ARC-c

Moonlight

39.019244.019649.0254.0204Jul 1, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.07
58.2882.4980.49267.345.681.1265.771.1171.56
2025.07
55.8980.679.579055.2346.880.8561.0171.9869.1
2025.07
55.880.6478.699046.7346.481.0158.8472.367.82
2025.07
48.5576.0175.939055.8442.277.9764.2668.1966.55
2025.07
47.6173.1578.728946.1143.680.3656.3271.4365.14
2025.07
39.7652.6938.97942.5732.268.561.0162.0452.96