Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Veg

Benchmarks

Task NameDataset NameSOTA ResultTrend
Imitation LearningVeg (unseen)
Success Rate60
10
Role-Playing Evaluation (Visual-Element-Groundedness)VEG
Win Rate65
9
Showing 2 of 2 rows