Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Logical Expression Evaluation on ListOps-O Length Generalization (Lengths 500-600)
Loading...
99.4
Accuracy
EBT-GRC
17.708
38.9165
60.125
81.3335
Nov 8, 2023
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
EBT-GRC
2023.11
99.4
BT-GRC OS
2023.11
99
RIR-EBT-GRC (-S4D)
Backbone=S4D
2023.11
98.87
CRVNN
2023.11
98.5
RIR-EBT-GRC
2023.11
98.25
OM
2023.11
92.7
RIR-EBT-GRC (-Beam Align)
Alignment=Beam Align
2023.11
79.05
BBT GRC
2023.11
40.4
RIR-GRC
2023.11
35.55
MEGA
2023.11
31.71
S4D
2023.11
20.85
Feedback
Search any
task
Search any
task