Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Gen

Benchmarks

Task NameDataset NameSOTA ResultTrend
Object DetectionGen1 (test)
mAP50.4
36
Object DetectionGen1
mAP50.4
23
Object DetectionGen1
mAP47.9
21
Generative Language TasksGEN benchmark
IFEval93.4
9
GeneralizationGen
Gen Score35.7
8
Video Nudity ErasureGen
Nudity Rate17.29
6
AI Video DetectionGen-3 Alpha
Total Count56
1
Showing 7 of 7 rows