Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-turn Dialogue Evaluation on MT-Bench Turn-2
Loading...
1.9
Writing Score
LoRA
1.354
1.49575
1.6375
1.77925
Feb 23, 2024
Writing Score
Roleplay Score
Reasoning Score
Math Score
Coding Score
Extraction Score
Stem Score
Humanities Score
Average Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Writing Score
Roleplay Score
Reasoning Score
Math Score
Coding Score
Extraction Score
Stem Score
Humanities Score
Average Score
LoRA
Backbone=LLaMA-2, # Pa...
2024.02
1.9
5.8
2.1
1.6
2.55
1.222
3.1
5.5
2.994
FT
Backbone=LLaMA-2, # Pa...
2024.02
1.667
5.938
2.222
1.7
2
2.111
3.2
5.3
3.021
RED
Backbone=LLaMA-2, # Pa...
2024.02
1.375
5.5
2.444
1.444
2.125
1.75
3
5.875
2.946
Feedback
Search any
task
Search any
task