Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Logical Expression Evaluation on ListOps-O Length Generalization (Lengths 900-1000)
Loading...
99.5
Accuracy
EBT-GRC
11.308
34.204
57.1
79.996
Nov 8, 2023
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
EBT-GRC
2023.11
99.5
RIR-EBT-GRC (-S4D)
Backbone=S4D
2023.11
98.6
CRVNN
2023.11
98
BT-GRC OS
2023.11
97.2
RIR-EBT-GRC
2023.11
97.1
OM
2023.11
76.9
RIR-EBT-GRC (-Beam Align)
Alignment=Beam Align
2023.11
68.8
RIR-GRC
2023.11
32.3
BBT GRC
2023.11
31.5
MEGA
2023.11
24.73
S4D
2023.11
14.7
Feedback
Search any
task
Search any
task