Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
LTL Instruction Following on ZoneEnv Finite-horizon
Loading...
97
Success Rate (SR)
DEEPLTL
11.72
33.86
56
78.14
Feb 6, 2026
Success Rate (SR)
Average States Traversed (µstates)
Updated 3d ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
Average States Traversed (µstates)
DEEPLTL
Task Formula=((green ∨...
2026.02
97
-
SEMLTL
Task Formula=((green ∨...
2026.02
96
4
DEEPLTL
Task Formula=F(pink ∨...
2026.02
96
-
SEMLTL
Task Formula=F(blue ∧...
2026.02
95
3.1
DEEPLTL
Task Formula=F(blue ∧...
2026.02
94
-
DEEPLTL
Task Formula=(F red) ∧...
2026.02
93
-
SEMLTL
Task Formula=(F red) ∧...
2026.02
93
3.19
DEEPLTL
Task Formula=¬gray U (...
2026.02
91
-
SEMLTL
Task Formula=¬gray U (...
2026.02
91
4
SEMLTL
Task Formula=F(pink ∨...
2026.02
90
2.11
DEEPLTL
Task Formula=¬(purple...
2026.02
85
-
SEMLTL
Task Formula=¬(purple...
2026.02
85
4.01
LTL2ACTION
Task Formula=((green ∨...
2026.02
85
-
LTL2ACTION
Task Formula=F(pink ∨...
2026.02
61
-
LTL2ACTION
Task Formula=¬gray U (...
2026.02
56
-
LTL2ACTION
Task Formula=F(blue ∧...
2026.02
53
-
LTL2ACTION
Task Formula=¬(purple...
2026.02
38
-
LTL2ACTION
Task Formula=(F red) ∧...
2026.02
15
-
Feedback
Search any
task
Search any
task