Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MemGUI-Bench

Benchmarks

Task NameDataset NameSOTA ResultTrend
Long-horizon GUI InteractionMemGUI-Bench
Precision@1 (1 App)50
14
Task SuccessMemGUI-Bench 1.0 (test)
Pass@1 L166.7
11
Showing 2 of 2 rows