Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LTL Instruction Following on ZoneEnv Finite-horizon
Loading...
97
Success Rate (SR)
DEEPLTL
11.72
33.86
56
78.14
Feb 6, 2026
Success Rate (SR)
Average States Traversed (µstates)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Success Rate (SR)
Average States Traversed (µstates)
DEEPLTL
Task Formula=((green ∨...
2026.02
97
-
SEMLTL
Task Formula=((green ∨...
2026.02
96
4
DEEPLTL
Task Formula=F(pink ∨...
2026.02
96
-
SEMLTL
Task Formula=F(blue ∧...
2026.02
95
3.1
DEEPLTL
Task Formula=F(blue ∧...
2026.02
94
-
DEEPLTL
Task Formula=(F red) ∧...
2026.02
93
-
SEMLTL
Task Formula=(F red) ∧...
2026.02
93
3.19
DEEPLTL
Task Formula=¬gray U (...
2026.02
91
-
SEMLTL
Task Formula=¬gray U (...
2026.02
91
4
SEMLTL
Task Formula=F(pink ∨...
2026.02
90
2.11
DEEPLTL
Task Formula=¬(purple...
2026.02
85
-
SEMLTL
Task Formula=¬(purple...
2026.02
85
4.01
LTL2ACTION
Task Formula=((green ∨...
2026.02
85
-
LTL2ACTION
Task Formula=F(pink ∨...
2026.02
61
-
LTL2ACTION
Task Formula=¬gray U (...
2026.02
56
-
LTL2ACTION
Task Formula=F(blue ∧...
2026.02
53
-
LTL2ACTION
Task Formula=¬(purple...
2026.02
38
-
LTL2ACTION
Task Formula=(F red) ∧...
2026.02
15
-
Feedback
Search any
task
Search any
task