Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Navigation Execution on Map2Seq (TestSetA)
Loading...
56.8
NE (Count)
GROKE
48.712
103.306
157.9
212.494
Jan 12, 2026
NE (Count)
SR (%)
OSR (%)
SDTW (Score)
Updated 4d ago
Evaluation Results
Method
Method
Links
NE (Count)
SR (%)
OSR (%)
SDTW (Score)
GROKE
2026.01
56.8
66.4
78.4
0.634
Heuristic Agent
2026.01
180.6
18
18.9
0.155
Action Sampling
2026.01
250.1
5.1
6
0.037
Random Walker
2026.01
259
4.4
5.7
0.026
Feedback
Search any
task
Search any
task