Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Multi-modal Reasoning on Geometry3K
Loading...
37.9
Pass@1
APMPO
25.212
28.506
31.8
35.094
Apr 11, 2026
Pass@1
Updated 27d ago
Evaluation Results
Method
Method
Links
Pass@1
APMPO
2026.04
37.9
GMPO
2026.04
37.2
DAPO
2026.04
36.5
FREIA
2026.04
36.1
GRPO
2026.04
35.6
GRPO
2026.04
35.6
TTRL
2026.04
35.3
Intuitor
2026.04
34.8
Entropy
2026.04
34.4
Base
2026.04
25.7
Base
2026.04
25.7
Feedback
Search any
task
Search any
task