Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

WhatsUp

Benchmarks

Task NameDataset NameSOTA ResultTrend
Visual World ModellingWhatsUp
GPT-4o Score8.2
18
Forward-dynamics PredictionWhatsUp AURORA-BENCH
GPT-4o Score3.3
9
Action-centric EditingWhatsUp (test)
Human Evaluation Score0.25
4
Showing 3 of 3 rows