Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

VisualProbe

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual ReasoningVisualProbe Hard
Accuracy0.434
18
Visual ReasoningVisualProbe Medium
Accuracy40.6
18
Visual ReasoningVisualProbe Easy
Accuracy65.4
18
Visual ReasoningVisualProbe (VP) cross-domain (test)
Accuracy0.4357
15
Visual ReasoningVisualProbe (test)
Accuracy (Hard)47.2
7
Showing 5 of 5 rows