Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Sarcasm Understanding on WITS
Loading...
44.7
ROUGE-1
MuVaC
19.116
25.758
32.4
39.042
Jan 28, 2026
ROUGE-1
BLEU-1
METEOR
BERTScore
Updated 4d ago
Evaluation Results
Method
Method
Links
ROUGE-1
BLEU-1
METEOR
BERTScore
MuVaC
2026.01
44.7
41.1
39.4
79.3
Qwen 2.5 VL
Fine-tuned=true
2026.01
24.7
26
23.2
77.9
Qwen 2.5
Fine-tuned=false
2026.01
23.9
23.3
19.2
75.9
Qwen 2.5 VL
Fine-tuned=false
2026.01
21.8
20.7
18.5
74
Llama 3.1
Fine-tuned=false
2026.01
20.5
18.8
18.3
72.9
MiniCPM-V 2.6
Fine-tuned=false
2026.01
20.4
17.9
18.8
71.8
Kimi VL
Fine-tuned=false
2026.01
20.1
16.8
18
70.5
Feedback
Search any
task
Search any
task