Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GenEval

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Image GenerationGenEval
Overall Score98
704
Text-to-image generationGenEval
Overall Score96
517
Text-to-Image GenerationGenEval
GenEval Score97.58
442
Text-to-Image GenerationGenEval
Overall Score80.48
277
Text-to-Image GenerationGenEval (test)
Two Obj. Acc99
250
Text-to-Image GenerationGenEval
Overall Score94
218
Text-to-Image GenerationGenEval
Overall Score (GenEval)0.98
153
Text-to-Image GenerationGenEval 1.0 (test)
Overall Score95
130
Text-to-Image GenerationGenEval
GenEval Score0.96
108
Text-to-Image GenerationGenEval
Overall Score88.3
96
Compositional Image GenerationGenEval
Overall Score60.34
84
Text-to-Image GenerationGenEval++
Color Accuracy90
75
Image GenerationGenEval
Overall Score89
69
Image GenerationGenEval
Overall GenEval Score95
65
Image GenerationGenEval (test)
GenEval Score91
48
Visual GenerationGenEval
Two Obj. Acc94
43
Text-to-Image GenerationGenEval (val)
GenEval Score90
33
Text-to-image reward alignmentGenEval (test)
Reward 1 Score (r1)0.26
30
Image GenerationGenEval overall
GenEval Overall Score90
30
Text-to-Image GenerationGenEval 2
GenEval2 Overall Score75.1
27
Text-to-Image GenerationGenEval
Two Objects Score96.97
27
Text-to-Image GenerationGenEval 1024x1024
Overall Score (GenEval)0.8
23
Multimodal GenerationGenEval
Score90
21
Composition Image GenerationGenEval
GenEval Score97
20
Text-to-ImageGenEval 11 (test)
Accuracy (Single Obj)100
19
Showing 25 of 83 rows