Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Geometric Reasoning on Geo BBEH
Loading...
45
Accuracy
CoT
1.32
12.66
24
35.34
Mar 14, 2026
Accuracy
Cost
Updated 26d ago
Evaluation Results
Method
Method
Links
Accuracy
Cost
CoT
Model=Qwen3
2026.03
45
3,468
CoT
Model=Gemma3
2026.03
32.5
4,115
CoT
Model=Llama3.1
2026.03
25.5
3,556
ToT
Model=Llama3.1
2026.03
10
8,648
DST
Model=Llama3.1
2026.03
8.5
4,601
DPTS
Model=Llama3.1
2026.03
7.5
6,050
ToT
Model=Qwen3
2026.03
6
8,984
ToT
Model=Gemma3
2026.03
6
10,762
DST
Model=Gemma3
2026.03
5.5
5,809
DST
Model=Qwen3
2026.03
4
4,994
DPTS
Model=Gemma3
2026.03
3.5
7,437
DPTS
Model=Qwen3
2026.03
3
6,299
Feedback
Search any
task
Search any
task