Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Random Game Benchmarks

Benchmarks

Task NameDataset NameSOTA ResultTrend
Decision Making under Imperfect RecallRandom (Rand) Game Benchmarks 1.0 (full set)
Value0.69
8
Showing 1 of 1 rows