Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Text-to-Image Generation on Human Evaluation Total
Loading...
85
Win Ratio
SEER
61.08
67.29
73.5
79.71
Jan 28, 2026
Win Ratio
Win Ratio (Simp)
Win Ratio (Hard)
Updated 3d ago
Evaluation Results
Method
Method
Links
Win Ratio
Win Ratio (Simp)
Win Ratio (Hard)
SEER
Opponent Model=Base (CoT)
2026.01
85
-
-
SEER
Opponent Model=Base (CoT)
2026.01
85
87
84
SEER
Opponent Model=Blip3-o
2026.01
83
93
76
SEER
Opponent Model=Blip3-o
2026.01
81
-
-
SEER
Opponent Model=Show-o2
2026.01
81
-
-
SEER
Opponent Model=Show-o2
2026.01
77
73
80
SEER
Opponent Model=Bagel
2026.01
75
-
-
SEER
Opponent Model=Bagel
2026.01
73
63
79
SEER
Opponent Model=Bagel-T...
2026.01
69
-
-
SEER
Opponent Model=Bagel-T...
2026.01
62
55
66
Feedback
Search any
task
Search any
task