Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Geometric Interleaved Reasoning on GGBench

39.11VLM-I Final Score

GPT-image-1

1.20211.043520.88530.7265Mar 1, 2026
Updated 1mo ago

Evaluation Results

MethodLinks
2026.03
39.11--50.6716.9564.3739.11
2026.03
37.3662.0176.7949.6554.859.4957.08
2026.03
37.1460.1389.1133.3120.3675.5763.13
2026.03
30.2961.1977.9252.2251.7450.5254.11
2026.03
26.4160.2473.1357.2148.3350.1249.77
2026.03
23.9456.449.5539.452.3358.7136.74
2026.03
22.8158.5444.8351.8564.5359.5133.82
2026.03
22.75--56.3958.2348.0622.75
2026.03
20.4861.1662.4266.0637.9437.5941.45
2026.03
20.355.6650.3967.3535.2638.3133.04
2026.03
19.7633.8521.6957.7457.7660.9720.73
2026.03
15.838.537.4168.3937.1739.7326.61
2026.03
12.9758.6539.378.8123.9224.8126.13
2026.03
5.0253.3225.6352.9112.1912.9415.33
2026.03
2.6659.7326.1995.435.455.6914.43