Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Language Understanding on MMLU v1 (test)
Loading...
72.4
Accuracy
GLM-4-9B
54.304
59.002
63.7
68.398
Sep 18, 2023
Nov 7, 2023
Dec 27, 2023
Feb 15, 2024
Apr 5, 2024
May 25, 2024
Jul 15, 2024
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
GLM-4-9B
Instruction-tuned=true
2024.07
72.4
Qwen2-7B
Instruction-tuned=true
2024.07
70.5
Yi-1.5-9B
Instruction-tuned=true
2024.07
69.5
Llama-3-8B
Instruction-tuned=true
2024.07
68.4
LLaVA-70B
Multimodal-Language Da...
2023.09
65.1
LLaMA-2-70B-Chat
2023.09
63.1
LLaVA-65B
Multimodal-Language Da...
2023.09
62.6
Vicuna-65B
2023.09
62.5
LLaVA-65B
Multimodal-Language Da...
2023.09
62.2
Qwen1.5-7B
Instruction-tuned=true
2024.07
59.5
Vicuna-33B
2023.09
59
LLaVA-33B
Multimodal-Language Da...
2023.09
58.6
LLaVA-33B
Multimodal-Language Da...
2023.09
56.1
Vicuna-13B
2023.09
55.8
LLaVA-13B
Multimodal-Language Da...
2023.09
55
Feedback
Search any
task
Search any
task