Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

EvalGen

Benchmarks

Task NameDataset NameSOTA ResultTrend
AI-generated image detectionEvalGEN
Balanced Accuracy99.9
12
AIGI DetectionEvalGEN
BBox Accuracy99
12
Human-Metric CorrelationEvalGen Out-of-Distribution
Kendall's Tau0.382
9
Compositional Image GenerationEvalGen
EvalGen Score0.96
5
Showing 4 of 4 rows