Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Sarcasm Understanding on MuSTARD
Loading...
86.8
Precision
MuVaC
52.48
61.39
70.3
79.21
Jan 28, 2026
Precision
Recall
F1 Score
Updated 4d ago
Evaluation Results
Method
Method
Links
Precision
Recall
F1 Score
MuVaC
2026.01
86.8
89.2
88
Qwen 2.5 VL
Fine-tuned=true
2026.01
67.5
70.3
68.9
Qwen 2.5 VL
Fine-tuned=false
2026.01
61.9
41.4
49.6
Kimi VL
Fine-tuned=false
2026.01
61.5
16.2
25.7
MiniCPM-V 2.6
Fine-tuned=false
2026.01
57.2
62.6
59.8
Qwen 2.5
Fine-tuned=false
2026.01
55.9
79.1
65.5
Llama 3.1
Fine-tuned=false
2026.01
53.8
83.1
65.3
Feedback
Search any
task
Search any
task