Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

PuzzleVQA

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual Puzzle ReasoningPuzzleVQA
Accuracy46.8
12
Visual Question AnsweringPuzzleVQA
Accuracy61
4
Showing 2 of 2 rows