Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Geometric Reasoning on OlympiadBench-Geo olympiad level
Loading...
77.68
Accuracy
OpenAI-o3
10.8184
28.1767
45.535
62.8933
Apr 22, 2025
Accuracy
Updated 23d ago
Evaluation Results
Method
Method
Links
Accuracy
OpenAI-o3
Release Date=Apr, 2025
2025.04
77.68
Gemini-2.5-pro
Release Date=Jun, 2025
2025.04
75
Intern-S1
Release Date=Jul, 2025
2025.04
49.11
Qwen2.5-VL-72B
Release Date=Jan, 2025
2025.04
29.46
Claude-3.7-sonnet
Release Date=Feb, 2025
2025.04
17.86
GPT-4o
Release Date=May, 2024
2025.04
13.39
Feedback
Search any
task
Search any
task