Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Geometric Problem Solving on GeoTrust Tier3 (test)
Loading...
26
Count
Gemini-2.5-pro
-1.04
5.98
13
20.02
Apr 22, 2025
Count
Accuracy
Updated 23d ago
Evaluation Results
Method
Method
Links
Count
Accuracy
Gemini-2.5-pro
#Params=-
2025.04
26
43.33
OpenAI-o3
#Params=-
2025.04
26
43.33
Intern-S1
#Params=235B+6B
2025.04
25
41.67
DeepSeek-R1
#Params=671B, input_fo...
2025.04
22
36.67
Qwen2.5-VL-72B
#Params=72B
2025.04
15
25
GPT-4o
#Params=-
2025.04
11
18.33
Claude-3.7-sonnet
#Params=-
2025.04
10
16.67
Qwen2-VL-7B
#Params=7B
2025.04
2
3.33
LLaVA-1.5-7B
#Params=7B
2025.04
0
0
Feedback
Search any
task
Search any
task