Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reasoning Image Editing on RISEBench 1.0 (test)

34.1Temporal Score

GPT-4o

-1.3647.84317.0526.257Dec 5, 2025Dec 9, 2025Dec 13, 2025Dec 17, 2025Dec 21, 2025Dec 25, 2025Dec 29, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.12
34.132.23710.628.9
2025.12
34.132.23710.628.9
2025.12
25.947.83718.832.8
2025.12
25.932.2409.427.5
2025.12
17.68.982.49.2
2025.12
16.524.4335.920.6
2025.12
16.417.7161.113
2025.12
12.912.2117.110.8
2025.12
11.817.8207.114.4
2025.12
10.823.3278.217.8
2025.12
8.215.5234.713.3
2025.12
5.917.8211.211.9
2025.12
5.917.8211.211.9
2025.12
4.710172.48.9
2025.12
4.77.853.53.4
2025.12
4.710172.48.9
2025.12
3.54.495.95.8
2025.12
3.52.271.13.6
2025.12
2.45.6141.26.1
2025.12
2.41.171.23.1
2025.12
2.35.5131.25.8
2025.12
2.35.5131.25.8
2025.12
1.2101.20.8
2025.12
1.21.1000.5
2025.12
1.23.342.42.8
2025.12
02.223.51.9
2025.12
00000
2025.12
00000
2025.12
02.272.33
2025.12
02.223.51.9