Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VIDEO

Benchmarks

Task NameDataset NameSOTA ResultTrend
Membership InferenceVideo modality
AUC-ROC100
16
Video AnalyticsVideo
Cost per 1K Requests0.49
15
Neural Video RepresentationVideo per-frame
GFLOPs64.92
12
Video reasoningVideo-R1
VSI44.3
12
Video Super-Resolution30-frame 2K Video (test)
Inference Time (min)0.77
8
Sequential RecommendationVideo (test)
NDCG@106.436
8
Video Super-Resolutionvideo 1920x1080 (21-frame sequence)
Step Count50
8
Sequential RecommendationVideo
NDCG@52.17
8
Future item recommendationVideo
Recall11.3
7
Video Semantic Segmentation1024 x 512 resolution (video)
Speed (FPS)18.15
6
Point Tracking24-frame video
Throughput23,405.71
5
Session-based recommendationVIDEO
Recall@2066.24
5
User Cold-start RecommendationVideo
Recall@209.22
4
Visual DubbingVideo 3-second 25fps 512x512 resolution
Inference Time (s)1
4
Object Detectionvideo (train)
Accuracy93
4
RecommendationVideo
Processing Time (sec)10
3
Misalignment reductionVideo #5
ITF (dB)21.66
3
Misalignment reductionVideo #4
ITF (dB)19.26
3
Misalignment reductionVideo #3
ITF (dB)22.26
3
Misalignment reductionVideo 1
ITF (dB)17.54
3
Multi-object Backdoor AttackVideo 9
ASR1
3
Multi-object Backdoor AttackVideo 8
ASR100
3
Multi-object Backdoor AttackVideo 7
ASR99.97
3
Segment Anything14 new Video
mIoU (1-click)69.6
3
Automatic Speech RecognitionVideo Average of 7 subsets
WER0.027
3
Showing 25 of 26 rows