Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Adversarial Reinforcement Learning on Connect Four 50% optimal adversary (test-time)

0.11Average Return

ARDT

-0.2956-0.1903-0.0850.0203Jul 25, 2024
Updated 4d ago

Evaluation Results

MethodLinks
2024.07
0.11
2024.07
0
2024.07
-0.28