Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

V*

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Question AnsweringV*
Accuracy74.35
45
High-resolution perceptionV*
Overall Score89.53
26
Visual ReasoningV*
Overall Score95.7
22
Visual Perception and ReasoningV*
Overall Accuracy90.1
18
Visual ReasoningV*
Accuracy90.2
18
Vision-Intensive PerceptionV* Benchmark
Attr Score84.4
18
Semantic SegmentationV20
mIoU83.8
15
Visual ReasoningV* cross-domain (test)
Accuracy79.06
15
High-resolution Visual SearchV*
Top-1 Accuracy86.91
13
Fine-grained visual reasoningV*
Avg@8 Overall89.5
13
Visual GroundingV* Relative Position 52
Accuracy89.47
13
Visual GroundingV* Direct Attributes 52
Accuracy90.43
13
High-resolution Multi-modal UnderstandingV*
Accuracy80.23
13
Fine-grained PerceptionV*
Accuracy78.8
13
Visual PerceptionV*
Score89
12
Visual SearchV*
Average Success90.6
11
Visual ReasoningV* (test)
Overall Score92.2
11
PerceptionV* (test)
Accuracy86.9
11
Visual SearchV* bench (test)
Attribute Rate87
10
Causal DiscoveryV
Structural F180
9
Fine-grained Visual ReasoningV*
Accuracy89
8
Multimodal Multi-choiceV*
Accuracy84.3
8
Visual Search and ComprehensionV*
Accuracy89.8
8
Delay IdentificationV
Precision of Delay (POD)100
7
Multimodal reasoningV*
Pass@189.5
7
Showing 25 of 31 rows