Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Adversarial Reinforcement Learning on Connect Four 50% optimal adversary (test-time)
Loading...
0.11
Average Return
ARDT
-0.2956
-0.1903
-0.085
0.0203
Jul 25, 2024
Average Return
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Return
ARDT
test-time adversary op...
2024.07
0.11
ESPER
test-time adversary op...
2024.07
0
DT
test-time adversary op...
2024.07
-0.28
Feedback
Search any
task
Search any
task