Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GenAI-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Text-to-Image GenerationGenAI-Bench
Basic Score0.939
47
Prompt-to-prompt semantic compositionGenAI-Bench
CLIP-T (S*T)46.8
30
Instruction-guided image editing preference predictionGenAI-Bench
Accuracy67.5
24
Compositional ReasoningGenAI-Bench (test)
Spatial Score83.79
18
Text-to-ImageGenAI-Bench 19 (test)
VQAScore78.2
17
element-level text-to-image alignment evaluationGenAI-Bench
SRCC0.749
17
Human Consistency EvaluationGenAI-Bench
Kendall's Tau-c38.4
16
Video GenerationGenAI-Bench
Accuracy82.5
14
Image GenerationGenAI-Bench
Accuracy75.9
14
Text-to-Image GenerationGenAI-Bench advanced prompts
Counting Score82
12
Generative AI evaluation consistencyGenAI-Bench
Pearson Correlation Score (r)70.3
11
Text-to-Image Generation EvaluationGenAI-Bench
Pearson-r70.3
11
Visual GenerationGenAI-Bench
Overall Score75
11
Video Preference AlignmentGenAI-Bench
Alignment Accuracy (w/ties)64.26
11
Pairwise PreferenceGenAI Bench (test)
Accuracy72.38
11
Image Editing Quality AssessmentGenAI-Bench
Accuracy83.9
10
Video Generation AssessmentGenAI-Bench Video (test)
Accuracy82.5
8
Image Generation AssessmentGenAI-Bench Image (test)
Accuracy73.4
8
Video Preference ModelingGenAI-Bench (evaluation)
Tau68.7
7
Text-to-Image Generation EvaluationGenAi-Bench
Kendall Tau B (Basic)0.446
5
Visual ReasoningGenAI-Bench
SRCC74.4
5
Text-to-Image GenerationGenAI-Bench (test)
Text Alignment83
5
Video generation assessmentGenAI-Bench
Pairwise Accuracy70.16
3
Image-Text AlignmentGenAI-Bench Advanced
Alignment Score0.276
3
Image-Text AlignmentGenAI-Bench Basic
Alignment Score29.6
3
Showing 25 of 28 rows