Share your thoughts, 1 month free Claude Pro on usSee more

General Performance on Aggregated LLM Evaluation Suite

47.9Average Score

BTX

Updated 4mo ago

Evaluation Results

Method	Links
BTX 2024.03		47.9
BTX 2024.03		47.3
Sparse upcycling 2024.03		46.3
LLAMA-2 2024.03		45.4
Dense 2024.03		44.5
BTM 2024.03		43.4
BTM 2024.03		43.1
LLAMA-2 2024.03		40.7
CODELLAMA 2024.03		37.9
LLEMMA 2024.03		32.1