Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Semantic Similarity on Simulated Music Recommendation Conversations
Loading...
0.9676
BertScore F1
MuseChat
0.944408
0.950429
0.95645
0.962471
Oct 10, 2023
BertScore F1
AB Divergence
L2 Distance
Fisher-Rao Distance
Updated 4d ago
Evaluation Results
Method
Method
Links
BertScore F1
AB Divergence
L2 Distance
Fisher-Rao Distance
MuseChat
Input Modality=Music T...
2023.10
0.9676
1.51
0.208
1.47
Vicuna w/ Music
Input Modality=Music E...
2023.10
0.9526
2.68
0.279
2.02
Vicuna-7B
Input Modality=Music T...
2023.10
0.9453
3.93
0.382
2.11
Feedback
Search any
task
Search any
task