Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning-aware Image Generation on RiseBench 1.0 (test)

0.77Instruction Reasoning

Gemini-3-pro-image-preview

0.204240.351120.4980.64488Jan 6, 2026
Updated 4d ago

Evaluation Results

MethodLinks
0.770.8550.9440.4120.6110.480.3760.472
0.6280.8020.9490.3410.3220.370.1060.289
2026.01
0.6190.7620.9050.3290.30.410.0940.289
0.6120.860.9130.2590.4780.370.1880.328
0.5890.6740.9120.1290.1220.110.0710.108
2026.01
0.5870.7570.8090.1520.1770.20.0820.155
2026.01
0.5860.7590.9010.2470.2220.380.0940.242
0.5410.7150.9370.2470.2890.330.0940.244
2026.01
0.5330.7360.7810.1410.1770.180.0350.136
0.4990.6840.8490.1060.1330.110.0230.094
0.4890.6820.8270.0820.1550.230.0470.133
2026.01
0.4590.7380.8010.0590.1780.210.0120.119
0.3720.6640.8690.0470.10.170.0240.089
2026.01
0.3650.5350.730.0240.0560.140.0120.061
2026.01
0.3390.5270.7290.0120.0330.040.0240.028
0.3030.1260.74900.0220.020.0350.019
0.260.7160.8520.0230.0550.130.0120.058
2026.01
0.2510.4150.7350.0120.0100.0120.008
2026.01
0.2260.3820.7830.0120.011000.005