Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Utility on MMLU
Loading...
75.78
Accuracy
Qwen (Base)
29.604
41.592
53.58
65.568
May 27, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Qwen (Base)
LLM=Qwen, SFT Status=None
2025.05
75.78
Qwen (SFT_DB)
LLM=Qwen, SFT Status=S...
2025.05
60.52
Qwen (SFT_OG)
LLM=Qwen, SFT Status=S...
2025.05
55.73
Mixtral (Base)
LLM=Mixtral, SFT Statu...
2025.05
35.42
Mixtral (SFT_DB)
LLM=Mixtral, SFT Statu...
2025.05
34.51
SFT_DB
Training=Supervised Fi...
2025.05
34.51
DPO_Whisperer
Alignment=DPO, Data Re...
2025.05
33.07
Mixtral (SFT_OG)
LLM=Mixtral, SFT Statu...
2025.05
31.38
Feedback
Search any
task
Search any
task