Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Plane Geometry Problem Solving on Formalgeo (test)
Loading...
0.857
Accuracy
MLLM Interpreter
0.56164
0.63832
0.715
0.79168
Jan 29, 2026
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
MLLM Interpreter
Training Data Size=5.5k
2026.01
0.857
Gemini2.5-Pro
Training Data Size=-
2026.01
0.818
Gemini2.5-Flash
Training Data Size=-
2026.01
0.805
DFE-GPS
Training Data Size=238k
2026.01
0.753
GLM4.1-V
Training Data Size=-
2026.01
0.734
Claude-Sonnet-4
Training Data Size=-
2026.01
0.691
Claude-Opus-4.1
Training Data Size=-
2026.01
0.691
GeoUni
Training Data Size=235k
2026.01
0.598
GPT-4o
Training Data Size=-
2026.01
0.58
Qwen2.5-VL 32B
Training Data Size=-
2026.01
0.573
Feedback
Search any
task
Search any
task