Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Performance on Aggregated LLM Evaluation Suite
Loading...
47.9
Average Score
BTX
31.468
35.734
40
44.266
Mar 12, 2024
Average Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Average Score
BTX
Active Experts (Top-k)=2
2024.03
47.9
BTX
Active Experts (Top-k)...
2024.03
47.3
Sparse upcycling
Training Context=Data-...
2024.03
46.3
LLAMA-2
Model Size=13B
2024.03
45.4
Dense
Training Context=Data-...
2024.03
44.5
BTM
Active Experts (Top-k)=2
2024.03
43.4
BTM
Active Experts (Top-k)=1
2024.03
43.1
LLAMA-2
Model Size=7B
2024.03
40.7
CODELLAMA
Model Size=7B
2024.03
37.9
LLEMMA
Model Size=7B
2024.03
32.1
Feedback
Search any
task
Search any
task