Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GenAI-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Image GenerationGenAI-Bench
Basic Score0.939
41
Compositional ReasoningGenAI-Bench (test)
Spatial Score83.79
18
Text-to-ImageGenAI-Bench 19 (test)
VQAScore78.2
17
element-level text-to-image alignment evaluationGenAI-Bench
SRCC0.749
17
Video GenerationGenAI-Bench
Accuracy82.5
14
Image GenerationGenAI-Bench
Accuracy75.9
14
Instruction-guided image editing preference predictionGenAI-Bench
Accuracy65.72
12
Visual GenerationGenAI-Bench
Overall Score75
11
Video Preference AlignmentGenAI-Bench
Alignment Accuracy (w/ties)64.26
11
Pairwise PreferenceGenAI Bench (test)
Accuracy72.38
11
Video Generation AssessmentGenAI-Bench Video (test)
Accuracy82.5
8
Image Generation AssessmentGenAI-Bench Image (test)
Accuracy73.4
8
Video Preference ModelingGenAI-Bench (evaluation)
Tau68.7
7
Visual ReasoningGenAI-Bench
SRCC74.4
5
Text-to-Image GenerationGenAI-Bench (test)
Text Alignment83
5
Image-Text AlignmentGenAI-Bench Advanced
Alignment Score0.276
3
Image-Text AlignmentGenAI-Bench Basic
Alignment Score29.6
3
Text-to-Video GenerationGenAI-Bench
Score80.33
2
Showing 18 of 18 rows