Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Logical operations parsing on ListOps mid L1024
Loading...
85.4
Accuracy
PCT
7.4
27.65
47.9
68.15
May 11, 2026
Accuracy
Updated 22d ago
Evaluation Results
Method
Method
Links
Accuracy
PCT
Cell type=PCT
2026.05
85.4
complex_screen
Cell type=c_screen
2026.05
83.3
real_screen
Cell type=r_screen
2026.05
69.8
real_sigmoid
Cell type=r_sigmoid
2026.05
17.7
real_softmax
Cell type=r_softmax
2026.05
14.6
complex_softmax
Cell type=c_softmax
2026.05
10.4
Feedback
Search any
task
Search any
task