Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Stack Manipulation on Formal-language benchmark lengths 41-500 (test)

100Accuracy

SRNN

54.2466.127889.88May 16, 2026
Updated 16d ago

Evaluation Results

MethodLinks
2026.05
100
2026.05
100
2026.05
59.1
2026.05
57.5
2026.05
56.3
2026.05
56