Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instruction Tuning on Alpaca 52K (test)
Loading...
32.9
BBH
DQ (2%)
32.276
32.438
32.6
32.762
Mar 17, 2025
BBH
DROP
MMLU
HumanEval
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
BBH
DROP
MMLU
HumanEval
Average Score
DQ (2%)
Pruned %=-
2025.03
32.9
27.6
36.6
8.5
26.3
SeTa
Pruned %=25.9
2025.03
32.4
27.4
35.8
13.4
27.3
SeTa
Pruned %=40.3
2025.03
32.3
27
35.4
11.6
26.6
SeTa
Pruned %=52.5
2025.03
32.3
27.3
34.8
9.8
26.1
Feedback
Search any
task
Search any
task