Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Adversarial Reinforcement Learning on Connect Four 70% optimal adversary (test)

0.02Average Return

ARDT

-0.5936-0.4343-0.275-0.1157Jul 25, 2024
Updated 1mo ago

Evaluation Results

MethodLinks
2024.07
0.02
2024.07
-0.42
2024.07
-0.57