Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adversarial Reinforcement Learning on Connect Four 50% optimal adversary (test-time)

0.11Average Return

ARDT

-0.2956-0.1903-0.0850.0203Jul 25, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.07
0.11
2024.07
0
2024.07
-0.28