Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

List operations evaluation on ListOps (5, 14) (test)

53.1Mean Accuracy

BBT-GRC

21.69229.8463846.154May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
53.141.4
2026.05
51.638.8
2026.05
4941.5
2026.05
48.138.9
2026.05
41.725.8
2026.05
32.524.2
2026.05
22.98.7