Adversarial Reinforcement Learning on Connect Four 70% optimal adversary (test)

0.02Average Return

ARDT

Updated 5mo ago

Evaluation Results