Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
LTL Instruction Following on Letter Infinite-horizon (full)
Loading...
7.13
µAcc
SEMLTL
1.566
3.0105
4.455
5.8995
Feb 6, 2026
µAcc
µStates
Updated 3d ago
Evaluation Results
Method
Method
Links
µAcc
µStates
SEMLTL
Task Specification=SMA...
2026.02
7.13
4.76
DEEPLTL
Task Specification=SMA...
2026.02
6.15
-
SEMLTL
Task Specification=ALW...
2026.02
5.6
4.81
DEEPLTL
Task Specification=ALW...
2026.02
5.28
-
SEMLTL
Task Specification=COM...
2026.02
4.9
4.38
DEEPLTL
Task Specification=COM...
2026.02
4.46
-
SEMLTL
Task Specification=ALW...
2026.02
3.06
7.06
SEMLTL
Task Specification=ALW...
2026.02
2.46
8.09
SEMLTL
Task Specification=COM...
2026.02
1.83
6.44
DEEPLTL
Task Specification=COM...
2026.02
1.78
-
Feedback
Search any
task
Search any
task