Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Navigation on BBH Navigation (test)
Loading...
83.1
Accuracy
DLN-2
55.852
62.926
70
77.074
Jun 21, 2023
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
DLN-2
LLM=GPT-3
2023.06
83.1
CoT
LLM=GPT-3
2023.06
69.3
DLN-1
LLM=GPT-3
2023.06
68.5
APE
LLM=GPT-3
2023.06
67.3
0-shot
LLM=GPT-3
2023.06
64.1
APE-400
LLM=GPT-3
2023.06
56.9
Feedback
Search any
task
Search any
task