Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Scientific Problem Solving on SciBench (Diff, Stat, Calc metrics)
Loading...
65.47
Diff Accuracy
Meta-reasoner
54.8724
57.6237
60.375
63.1263
Feb 27, 2025
Diff Accuracy
Stat Accuracy
Calc Accuracy
Updated 26d ago
Evaluation Results
Method
Method
Links
Diff Accuracy
Stat Accuracy
Calc Accuracy
Meta-reasoner
Model=gemini-exp-1206
2025.02
65.47
79.42
82.77
Meta-reasoner
Model=gpt-4o-mini
2025.02
60.32
73.64
80.23
HiAR-ICL
Model=gemini-exp-1206
2025.02
57.76
75.92
80.23
HiAR-ICL
Model=gpt-4o-mini
2025.02
57.42
70.12
77.93
Evo-Prompt
Model=gemini-exp-1206
2025.02
57.32
70.32
78.42
Evo-Prompt
Model=gpt-4o-mini
2025.02
55.28
67.32
76.53
Feedback
Search any
task
Search any
task