Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

V*

Benchmarks

Task NameDataset NameSOTA ResultTrend
High-resolution perceptionV*
Overall Score89.53
20
Vision-Intensive PerceptionV* Benchmark
Attr Score84.4
18
Semantic SegmentationV20
mIoU83.8
15
Visual ReasoningV* cross-domain (test)
Accuracy79.06
15
Visual ReasoningV*
Accuracy81.15
14
Fine-grained PerceptionV*
Accuracy78.8
13
Visual PerceptionV*
Score89
12
Visual SearchV*
Average Success90.6
11
Visual ReasoningV* (test)
Overall Score92.2
11
PerceptionV* (test)
Accuracy86.9
11
Visual ReasoningV*
Overall Score95.7
10
Visual Question AnsweringV*
Accuracy49.73
10
Visual SearchV* bench (test)
Attribute Rate87
10
Fine-grained Visual ReasoningV*
Accuracy89
8
Multimodal Multi-choiceV*
Accuracy84.3
8
Visual Search and ComprehensionV*
Accuracy89.8
8
Multimodal reasoningV*
Pass@189.5
7
Visual SearchV* benchmark
Attribute Success Rate75.65
5
Showing 18 of 18 rows