Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

VBVR-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Video ReasoningVBVR-Bench Out-of-Domain
Average Score61
39
Video ReasoningVBVR-Bench In-Domain
Average Score96
35
Video ReasoningVBVR-Bench
Overall Accuracy97.4
18
Video ReasoningVBVR-Bench Overall
Overall Score78.1
17
Showing 4 of 4 rows