Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Asynchronous Planning on NL (test)
Loading...
78.2
Accuracy
Graph (40 steps) + NL (40 steps)
41.696
51.173
60.65
70.127
Feb 3, 2026
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Graph (40 steps) + NL (40 steps)
Model=Qwen 1.5B, Stage...
2026.02
78.2
GPT-4o (zero-shot)
Mode=zero-shot
2026.02
78.2
NL only (80 steps)
Model=Qwen 1.5B, Total...
2026.02
69.8
7B (NL 40 steps)
Model=Qwen 7B, Trainin...
2026.02
69.8
3B (NL 40 steps)
Model=Qwen 3B, Trainin...
2026.02
47.1
GPT-4o-mini (zero-shot)
Mode=zero-shot
2026.02
44
NL (40 steps) + Graph (40 steps)
Model=Qwen 1.5B, Stage...
2026.02
43.1
Feedback
Search any
task
Search any
task