Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

V*

Benchmarks

Task NameDataset NameSOTA ResultTrend
High-resolution perceptionV*
Overall Score89.53
55
Visual SearchV* benchmark
Overall Success Rate91.1
54
Visual ReasoningV*
Accuracy92.7
52
Visual Question AnsweringV*
Accuracy74.35
45
Visual PerceptionV*
Score89
42
Visual Perception and ReasoningV*
Overall Accuracy90.1
36
Visual GroundingV*
Accuracy83.77
29
Visual SearchV*
Accuracy90.1
28
Visual ReasoningV*
Overall Score95.7
22
Visual PerceptionV* v1.0 (test)
Score84.35
20
Fine-grained VQAV*
Accuracy93.2
18
Vision-Intensive PerceptionV* Benchmark
Attr Score84.4
18
ReasoningV*
Pass@497.9
16
PerceptionV*
Pass@190.2
16
Multimodal ReasoningV*
Accuracy87
16
Pixel-centric UnderstandingV*
Score72.7
15
Semantic SegmentationV20
mIoU83.8
15
Visual ReasoningV* cross-domain (test)
Accuracy79.06
15
Fine-grained visual searchV*
Overall Score91.1
14
PerceptionV*
Overall Score95.7
13
High-resolution Visual SearchV*
Top-1 Accuracy86.91
13
Fine-grained visual reasoningV*
Avg@8 Overall89.5
13
Visual GroundingV* Relative Position 52
Accuracy89.47
13
Visual GroundingV* Direct Attributes 52
Accuracy90.43
13
High-resolution Multi-modal UnderstandingV*
Accuracy80.23
13
Showing 25 of 53 rows