Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scientific Question Answering on MMLU-Pro
Loading...
73.7
Accuracy
OctoTools
70.164
71.082
72
72.918
Feb 16, 2025
Accuracy
Delta (Zero-Shot)
Delta (CoT)
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
Delta (Zero-Shot)
Delta (CoT)
OctoTools
Backbone=gpt-4o-2024-0...
2025.02
73.7
2
3.4
0-shot
Backbone=gpt-4o-2024-0...
2025.02
71.7
-
-
OctoToolsbase
Backbone=gpt-4o-2024-0...
2025.02
71.5
-
-
CoT
Backbone=gpt-4o-2024-0...
2025.02
70.3
-
-
Feedback
Search any
task
Search any
task