Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multimodal Visual Logic Reasoning on VisualPuzzles
Loading...
58.2
Mean@5
Naive-KD
51.44
53.195
54.95
56.705
Mar 25, 2026
Mean@5
Updated 19d ago
Evaluation Results
Method
Method
Links
Mean@5
Naive-KD
Train Set=LogicVista,...
2026.03
58.2
TED
Train Set=LogicVista,...
2026.03
57.9
Reflexion
Train Set=LogicVista,...
2026.03
57.4
Direct
Train Set=-, Student=Q...
2026.03
57.2
Naive-KD
Train Set=LogicVista,...
2026.03
56.6
TED
Train Set=LogicVista,...
2026.03
56.1
Reflexion
Train Set=LogicVista,...
2026.03
52.4
Direct
Train Set=-, Student=Q...
2026.03
51.7
Feedback
Search any
task
Search any
task