Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Visual Mathematical Reasoning on CLEVR-Math
Loading...
79
Accuracy
OctoTools
63.92
67.835
71.75
75.665
Feb 16, 2025
Accuracy
Delta (Zero-Shot)
Delta (CoT)
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Delta (Zero-Shot)
Delta (CoT)
OctoTools
Backbone=gpt-4o-2024-0...
2025.02
79
14.5
3.8
CoT
Backbone=gpt-4o-2024-0...
2025.02
75.2
-
-
OctoToolsbase
Backbone=gpt-4o-2024-0...
2025.02
68.8
-
-
0-shot
Backbone=gpt-4o-2024-0...
2025.02
64.5
-
-
Feedback
Search any
task
Search any
task