Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Verb Prediction on VidSitu (val)
Loading...
56.15
Top-1 Verb Accuracy
HostSG
31.6996
38.0473
44.395
50.7427
Apr 25, 2026
Top-1 Verb Accuracy
Top-5 Verb Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Top-1 Verb Accuracy
Top-5 Verb Accuracy
HostSG
Backbone=M-TDE
2026.04
56.15
86.33
OME+OIE
Backbone=I3D
2026.04
53.36
83.94
CineMEC (old backbone)
Backbone=FR + SF
2026.04
49.86
78.36
CineMEC
Backbone=YS + SF
2026.04
49.32
79.83
TypesDev-ucofia
Backbone=BLIP2
2026.04
47.23
-
VideoWhisperer
Backbone=YS + SF
2026.04
46.02
76.49
VideoWhisperer
Backbone=FR + SF
2026.04
45.06
75.59
VidSitu-SlowFast
Backbone=SF
2026.04
32.64
69.2
Feedback
Search any
task
Search any
task