Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
General Language Proficiency on Aggregated (GSM8K, TruthfulQA, TriviaQA, CNN/DM, MMLU)
Loading...
48.6
Average Score
LoRA Tuning
24.3992
30.6821
36.965
43.2479
Jun 17, 2024
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Score
LoRA Tuning
Model Size=13B, Transf...
2024.06
48.6
Full Fine-tuning
Model Size=13B, Transf...
2024.06
47.79
Ours
Model Size=13B, Transf...
2024.06
46.09
Proxy Tuning
Model Size=13B, Transf...
2024.06
44.43
Full Fine-tuning
Model Size=13B, Transf...
2024.06
44.13
Ours
Model Size=13B, Transf...
2024.06
34.86
Proxy Tuning
Model Size=13B, Transf...
2024.06
31.74
Base Model
Model Size=13B, Transf...
2024.06
29.93
Full Fine-tuning
Model Size=13B, Transf...
2024.06
25.33
Feedback
Search any
task
Search any
task