Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

GenAI

Benchmarks

Task NameDataset NameSOTA ResultTrend
Global Multimodal Question AnsweringGenAI
Comprehensiveness95.2
12
Video Reward ModelingGenAI
Accuracy (w/ Tie)54.8
7
Aesthetic Preference PredictionGenAI
Accuracy (w/ Ties)46.29
6
Attack-Mitigation EffectivenessGenAI
CIS1
1
Showing 4 of 4 rows