Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

CLEVRER

Benchmarks

Task NameDataset NameSOTA ResultTrend
Rule-level anomaly detectionCLEVRER
AUROC0.844
15
Temporal Jigsaw Puzzle SolvingCLEVRER
Normalized Kendall Distance0
13
Temporal and causal video reasoningCLEVRER-Humans (test)
Accuracy (Per Option)74.1
12
Counterfactual PredictionCLEVRER Hypothesis
CF-Acc81
9
Visual Question AnsweringCLEVRER 1.0 (test)
Descriptive Accuracy0.94
8
Video Question AnsweringCLEVRER (test)
Descriptive Accuracy96.46
7
SegmentationCLEVRER (Blender engine) zero-shot
Segmentation Map IoU (First Frame)67
6
Optical FlowCLEVRER Full Sequence Blender (test)
Optical Flow EPE5.43
6
Optical FlowCLEVRER First Frame Blender (test)
Optical Flow EPE2.79
6
Object SegmentationCLEVRER Full Sequence Blender (test)
Segmentation Map IoU30
6
Object SegmentationCLEVRER First Frame Blender (test)
Segmentation Map IoU67
6
Video GenerationCLEVRER 256x256 (test)
FVD87.4
6
Predictive Video ReasoningCLEVRER (val)
Accuracy87.5
5
Counterfactual Video ReasoningCLEVRER (val)
Accuracy86.69
5
Explanatory Video ReasoningCLEVRER (val)
Accuracy99.94
5
Physical ReasoningCLEVRER-LLMPhy
mIoU97.2
5
Collision CountingCLEVRER T3 (val)
Accuracy77.84
4
Collision Event DetectionCLEVRER T2 (val)
Accuracy74.95
4
Collision ClassificationCLEVRER T1 (val)
Accuracy93.84
4
Controllable Video GenerationCLEVRER (test)
SSIM92.52
4
Video ReasoningCLEVRER
Accuracy78.5
4
Descriptive Video ReasoningCLEVRER (val)
Accuracy97.99
3
Visual Question AnsweringCLEVRER (test val)
Accuracy (per option)98.5
2
Showing 23 of 23 rows