Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Logical Expression Evaluation on ListOps-O Length Generalization (Lengths 200-300)
Loading...
99.9
Accuracy
EBT-GRC
28.244
46.847
65.45
84.053
Nov 8, 2023
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
EBT-GRC
2023.11
99.9
OM
2023.11
99.6
CRVNN
2023.11
99.5
BT-GRC OS
2023.11
99.5
RIR-EBT-GRC
2023.11
99.15
RIR-EBT-GRC (-S4D)
Backbone=S4D
2023.11
99.15
RIR-EBT-GRC (-Beam Align)
Alignment=Beam Align
2023.11
91.75
MEGA
2023.11
45.21
BBT GRC
2023.11
43.6
RIR-GRC
2023.11
41.75
S4D
2023.11
31
Feedback
Search any
task
Search any
task