Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GenEval

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-image generationGenEval
Overall Score96
506
Text-to-Image GenerationGenEval
Overall Score95
391
Text-to-Image GenerationGenEval
GenEval Score95
360
Text-to-Image GenerationGenEval (test)
Two Obj. Acc99
221
Text-to-Image GenerationGenEval
Overall Score94
218
Text-to-Image GenerationGenEval
Overall Score88.3
96
Text-to-Image GenerationGenEval
GenEval Score0.9
88
Text-to-Image GenerationGenEval 1.0 (test)
Overall Score84
85
Image GenerationGenEval
Overall Score89
57
Text-to-Image GenerationGenEval++
Color Accuracy90
45
Compositional Image GenerationGenEval
Overall Score0.99
44
Image GenerationGenEval (test)
GenEval Score91
35
Text-to-Image GenerationGenEval (val)
GenEval Score90
33
Visual GenerationGenEval
Single Obj. Acc99
31
Text-to-image reward alignmentGenEval (test)
Reward 1 Score (r1)0.26
30
Image GenerationGenEval overall
GenEval Overall Score90
30
Text-to-Image GenerationGenEval
Two Objects Score96.97
27
Text-to-Image GenerationGenEval 1024x1024
Overall Score (GenEval)0.8
23
Multimodal GenerationGenEval
Score90
21
Composition Image GenerationGenEval
GenEval Score97
20
Text-to-ImageGenEval 11 (test)
Accuracy (Single Obj)100
19
Text-to-Image GenerationGenEval
GE Score61.9
18
Text-to-Image GenerationGenEval
DINO0.786
18
Text-to-Image GenerationGenEval 2
Soft TIFA AM80
17
Text-to-Image GenerationGenEval
GenEval Score77
17
Showing 25 of 58 rows