Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VidSitu

Benchmarks

Task NameDataset NameSOTA ResultTrend
Semantic Role PredictionVidSitu (test)
CIDEr84.85
17
Event relation predictionVidSitu
Mean Accuracy35.32
12
Video Situation RecognitionVidSitu
CIDEr76.24
9
Semantic Role LabelingVidSitu (val)
CIDEr90.12
9
Verb PredictionVidSitu (val)
Top-1 Verb Accuracy56.15
8
Verb predictionVidSitu (test)
Accuracy@144.67
8
Semantic Role LabelingVidSitu (test)
CIDEr83.68
5
LocalizationVidSitu (val)
IoU @ 0.370.33
5
Video Semantic Role LabelingVidSitu
CIDEr73.71
5
Semantic Role Labeling CaptioningVidSitu
CIDEr76.34
5
LocalizationVidSitu (test)
IoU@0.359.64
3
Multimodal Event ExtractionVidSitu Aud
ET24.2
3
Video TrackingVidSitu
V-Trck23.2
3
Event RelationVidSitu
ER14.5
3
Event TypingVidSitu
ET22.3
3
Video TrackingVidSitu Txt
V-Trck34.4
3
Event RelationVidSitu Txt
Event Relation (ER)23.1
3
Event TypingVidSitu-Txt
ET Score32.8
3
Grounded Video Situation RecognitionVidSitu v1 (val)
Verb Accuracy@146.79
3
Grounded Video Situation RecognitionVidSitu (test)
Verb Acc@146.79
3
Showing 20 of 20 rows