Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

List operations evaluation on ListOps (3, 14) (test)

79.1Mean Accuracy

RIR-GRC

23.35637.82852.366.772May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
79.166.4
2026.05
79.167.7
2026.05
69.765.2
2026.05
63.346.4
2026.05
62.751.4
2026.05
37.631.2
2026.05
25.58.4