Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Plan-Grounded Answer Generation on InstructionVidDial (test)
Loading...
75.3
ROUGE-L
VIGiA
29.8624
41.6587
53.455
65.2513
Feb 22, 2026
ROUGE-L
METEOR
BERTScore
Updated 1mo ago
Evaluation Results
Method
Method
Links
ROUGE-L
METEOR
BERTScore
VIGiA
2026.02
75.3
76.67
88.72
MM-PlanLLM
2026.02
58.85
59.34
80.03
Qwen 3 VL
2026.02
44.71
46.31
71.94
LLaVA-1.5
2026.02
42.47
40.84
70.55
InternVL 3.5
2026.02
41.62
46.2
69.22
LLaVA-OV
2026.02
41.18
40.85
70.7
Idefics2
2026.02
37.16
45.09
67.55
Qwen 2.5 VL
2026.02
31.61
41.51
63.91
Feedback
Search any
task
Search any
task