Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Instruction Following on Vicuna
Loading...
58.2
Score
Ours 12
11.608
23.704
35.8
47.896
Oct 5, 2023
Jan 1, 2024
Mar 30, 2024
Jun 27, 2024
Sep 24, 2024
Dec 22, 2024
Mar 21, 2025
Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Score
Ours 12
Distillation Method=Ra...
2025.03
58.2
FullKD
Distillation Method=Fu...
2025.03
56.9
Top-K 50
Distillation Method=To...
2025.03
53.1
CE
Distillation Method=No...
2025.03
49.1
Top-K 12
Distillation Method=To...
2025.03
48.9
LumiNet
Model Architecture=GPT...
2023.10
17.5
LumiNet
Model Architecture=GPT...
2023.10
17.1
SeqKD
Model Architecture=GPT...
2023.10
16.9
KD
Model Architecture=GPT...
2023.10
16.9
Teacher
Model Architecture=Tea...
2023.10
16.3
SFT w/o KD
Model Architecture=GPT...
2023.10
16.1
SFT w/o KD
Model Architecture=GPT...
2023.10
16
SeqKD
Model Architecture=GPT...
2023.10
15.9
KD
Model Architecture=GPT...
2023.10
15.4
LumiNet
Model Architecture=GPT...
2023.10
14.9
SFT w/o KD
Model Architecture=GPT...
2023.10
14.7
SeqKD
Model Architecture=GPT...
2023.10
14.3
KD
Model Architecture=GPT...
2023.10
13.4
Feedback
Search any
task
Search any
task