Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Adversarial Reinforcement Learning on Connect Four 30% optimal adversary (test-time)

0.55Average Return

ARDT

0.16520.26510.3650.4649Jul 25, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.07
0.55
2024.07
0.44
2024.07
0.18