Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Adversarial Reinforcement Learning on Connect Four 100% optimal adversary (test-time)
Loading...
-0.98
Avg Return
ESPER
-1.0008
-0.9954
-0.99
-0.9846
Jul 25, 2024
Avg Return
Updated 4d ago
Evaluation Results
Method
Method
Links
Avg Return
ESPER
test-time adversary op...
2024.07
-0.98
DT
test-time adversary op...
2024.07
-1
ARDT
test-time adversary op...
2024.07
-1
Feedback
Search any
task
Search any
task