Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Decision Making under Imperfect Recall: Algorithms and Benchmarks

About

In game theory, imperfect-recall decision problems model situations in which an agent forgets information it held before. They encompass games such as the ``absentminded driver'' and team games with limited communication. In this paper, we introduce the first benchmark suite for imperfect-recall decision problems. Our benchmarks capture a variety of problem types, including ones concerning privacy in AI systems that elicit sensitive information, and AI safety via testing of agents in simulation. Across 61 problem instances generated using this suite, we evaluate the performance of different algorithms for finding first-order optimal strategies in such problems. In particular, we introduce the family of regret matching (RM) algorithms for nonlinear constrained optimization. This class of parameter-free algorithms has enjoyed tremendous success in solving large two-player zero-sum games, but, surprisingly, they were hitherto relatively unexplored beyond that setting. Our key finding is that RM algorithms consistently outperform commonly employed first-order optimizers such as projected gradient descent, often by orders of magnitude. This establishes, for the first time, the RM family as a formidable approach to large-scale constrained optimization problems.

Emanuel Tewolde, Brian Hu Zhang, Ioannis Anagnostides, Tuomas Sandholm, Vincent Conitzer• 2026

Related benchmarks

TaskDatasetResultRank
Decision Making under Imperfect RecallDetection Game Benchmarks 1.0 (full set)
Value Score18
11
Decision Making under Imperfect Recall61 aggregate benchmark instances (Sim, Det, and Rand)
Utility (% of Best)73.8
8
Decision Making under Imperfect RecallRandom (Rand) Game Benchmarks 1.0 (full set)
Value0.65
8
Simulation ProblemSim 540k
Value Score8.54
7
Random ProblemRand-24k
Value66
7
Subgroup DetectionDet 1k
Detection Value13
7
Decision Making under Imperfect RecallSimulation (Sim) Game Benchmarks 1.0 (full set)
Value14.01
6
Subgroup DetectionDet-2.2m
Performance Score16.36
6
Subgroup DetectionDet 10m
Performance Value24.76
6
Showing 9 of 9 rows

Other info

Follow for update