Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Navigation Reasoning on BBH-Navigate (test)
Loading...
98
Accuracy
UPA
89.368
91.609
93.85
96.091
Jan 30, 2026
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
UPA
Venue=-, Execution Eng...
2026.01
98
PromptBreeder
Venue=ICML 24, Executi...
2026.01
96.3
SPO
Venue=EMNLP 26, Execut...
2026.01
96.3
OPRO
Venue=ICLR 24, Executi...
2026.01
95.8
PromptAgent
Venue=ICLR 24, Executi...
2026.01
95.7
RaR
Venue=arXiv 23, Execut...
2026.01
93.5
Step-Back
Venue=ICLR 24, Executi...
2026.01
93.5
APE
Venue=ICLR 23, Executi...
2026.01
92.5
IO
Venue=-, Execution Eng...
2026.01
91.3
TextGrad
Venue=Nature 25, Execu...
2026.01
91.3
CoT
Venue=NeurIPS 22, Exec...
2026.01
89.7
Feedback
Search any
task
Search any
task