Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

List operations evaluation on ListOps (3, 9) (test)

89.9Mean Accuracy

RIR-GRC

33.7448.3262.977.48May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
89.979.2
2026.05
84.172.5
2026.05
74.767.8
2026.05
68.255.1
2026.05
65.950
2026.05
37.332.1
2026.05
35.99.8