Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Performance on Aggregated LLM Evaluation Suite
Loading...
47.9
Average Score
BTX
31.468
35.734
40
44.266
Mar 12, 2024
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Score
BTX
Active Experts (Top-k)=2
2024.03
47.9
BTX
Active Experts (Top-k)...
2024.03
47.3
Sparse upcycling
Training Context=Data-...
2024.03
46.3
LLAMA-2
Model Size=13B
2024.03
45.4
Dense
Training Context=Data-...
2024.03
44.5
BTM
Active Experts (Top-k)=2
2024.03
43.4
BTM
Active Experts (Top-k)=1
2024.03
43.1
LLAMA-2
Model Size=7B
2024.03
40.7
CODELLAMA
Model Size=7B
2024.03
37.9
LLEMMA
Model Size=7B
2024.03
32.1
Feedback
Search any
task
Search any
task