Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
List operations evaluation on ListOps (5, 9) (test)
Loading...
49.6
Mean Accuracy
BBT-GRC
20.376
27.963
35.55
43.137
May 25, 2026
Mean Accuracy
Longest Bin Accuracy
Updated 8d ago
Evaluation Results
Method
Method
Links
Mean Accuracy
Longest Bin Accuracy
BBT-GRC
2026.05
49.6
39.2
RIR-GRC
2026.05
48.1
36.4
MLP-LDRU
2026.05
45.9
38.8
LSTM
2026.05
44.9
36
TF (ALiBi)
Positional Encoding=ALiBi
2026.05
39
22.8
TF (NoPE)
Positional Encoding=None
2026.05
29.9
22.7
TF (Sin.)
Positional Encoding=Si...
2026.05
21.5
8.7
Feedback
Search any
task
Search any
task