Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Tasks on MMLU-redux
Loading...
86.8
Accuracy
Qwen2.5-72B
81.392
82.796
84.2
85.604
Feb 19, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen2.5-72B
Model Type=Instruct, P...
2025.02
86.8
Llama-3.1-405B
Model Type=Instruct, P...
2025.02
86.2
Qwen2.5-VL-72B
Model Type=Instruct, P...
2025.02
85.9
Llama-3.1-70B
Model Type=Instruct, P...
2025.02
83
Qwen2-72B
Model Type=Instruct, P...
2025.02
81.6
Feedback
Search any
task
Search any
task