Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Sarcasm Explanation on WITS
Loading...
3.18
Human Evaluation Score
Qwen2.5VL
2.4832
2.6641
2.845
3.0259
Jan 28, 2026
Human Evaluation Score
LLM Judge Score
Updated 1mo ago
Evaluation Results
Method
Method
Links
Human Evaluation Score
LLM Judge Score
Qwen2.5VL
2026.01
3.18
2.43
MuVaC
2026.01
2.82
3.36
EDGE
2026.01
2.76
3.36
KimiVL
2026.01
2.57
1.97
MAF
2026.01
2.54
3.28
MOSES
2026.01
2.51
3.24
Feedback
Search any
task
Search any
task