Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mathematical Reasoning on Omni-MATH (Accuracy and CoT Deltas)
Loading...
32.2
Accuracy (Omni-MATH)
OctoTools
26.792
28.196
29.6
31.004
Feb 16, 2025
Accuracy (Omni-MATH)
Zero-Shot Performance Gain
CoT Performance Gain
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy (Omni-MATH)
Zero-Shot Performance Gain
CoT Performance Gain
OctoTools
Backbone=gpt-4o-2024-0...
2025.02
32.2
5.2
2.9
OctoToolsbase
Backbone=gpt-4o-2024-0...
2025.02
30.2
-
-
CoT
Backbone=gpt-4o-2024-0...
2025.02
29.3
-
-
0-shot
Backbone=gpt-4o-2024-0...
2025.02
27
-
-
Feedback
Search any
task
Search any
task