Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Visual Abductive Reasoning on YouCookII (test)
Loading...
6.16
BLEU@4
AbductiveMLLM
0.2216
1.7633
3.305
4.8467
Jan 6, 2026
BLEU@4
METEOR
ROUGE
CIDEr
BERT-S
Updated 4d ago
Evaluation Results
Method
Method
Links
BLEU@4
METEOR
ROUGE
CIDEr
BERT-S
AbductiveMLLM
Category=MLLM, Mode=Fi...
2026.01
6.16
13.46
30.06
77.7
30.77
Qwen2VL-7B^FT
Category=MLLM, Mode=Fi...
2026.01
5.66
12.62
28.64
68.44
29.09
REASONER
Category=Traditional M...
2026.01
3.54
9.47
24.62
32.99
23.19
Qwen2VL-7B
Category=MLLM, Mode=Ze...
2026.01
2.46
8.41
22.1
35.83
21.83
VideoChat2-7B
Category=MLLM, Mode=Ze...
2026.01
0.49
4.59
14.31
17.82
7.96
GPT-4o-mini
Category=MLLM, Mode=Ze...
2026.01
0.45
4.22
12.78
14.15
9.75
Feedback
Search any
task
Search any
task