Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Geometric Problem Solving on GeoTrust Tier2 (test)
Loading...
24
Count
Gemini-2.5-pro
0.08
6.29
12.5
18.71
Apr 22, 2025
Count
Accuracy
Updated 22d ago
Evaluation Results
Method
Method
Links
Count
Accuracy
Gemini-2.5-pro
#Params=-
2025.04
24
40
OpenAI-o3
#Params=-
2025.04
24
40
Intern-S1
#Params=235B+6B
2025.04
21
35
DeepSeek-R1
#Params=671B, input_fo...
2025.04
20
33.33
Claude-3.7-sonnet
#Params=-
2025.04
16
26.67
Qwen2.5-VL-72B
#Params=72B
2025.04
15
25
GPT-4o
#Params=-
2025.04
10
16.67
Qwen2-VL-7B
#Params=7B
2025.04
2
3.33
LLaVA-1.5-7B
#Params=7B
2025.04
1
0.42
Feedback
Search any
task
Search any
task