Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Visual Question Answering on AlgoPuzzleVQA
Loading...
48.7
Accuracy
OctoTools
41.004
43.002
45
46.998
Feb 16, 2025
Accuracy
Delta (Zero-Shot)
Delta (CoT)
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Delta (Zero-Shot)
Delta (CoT)
OctoTools
Backbone=gpt-4o-2024-0...
2025.02
48.7
7.4
6
OctoToolsbase
Backbone=gpt-4o-2024-0...
2025.02
44
-
-
CoT
Backbone=gpt-4o-2024-0...
2025.02
42.7
-
-
0-shot
Backbone=gpt-4o-2024-0...
2025.02
41.3
-
-
Feedback
Search any
task
Search any
task