Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Natural Language Understanding on BIG-Bench Hard (BBH)
Loading...
42.1
Accuracy
Arcana
34.404
36.402
38.4
40.398
Oct 17, 2024
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Arcana
zero-shot=true
2024.10
42.1
Vicuna-v1.5
zero-shot=true
2024.10
41.2
LLaMA-2
zero-shot=true
2024.10
38.2
LLaMA-2-Chat
zero-shot=true
2024.10
35.6
WizardLM
zero-shot=true
2024.10
34.7
Feedback
Search any
task
Search any
task