Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Single-agent Reinforcement Learning (SARL) on Pong Clean
Loading...
18.5
Average Episode Return
BehaviorGuard
2.796
6.873
10.95
15.027
May 7, 2026
Average Episode Return
Updated 26d ago
Evaluation Results
Method
Method
Links
Average Episode Return
BehaviorGuard
2026.05
18.5
SHINE
2026.05
18.2
Original
2026.05
16.9
STRIP
2026.05
15.2
FeatureRE
2026.05
7.3
Direct retraining
2026.05
7.1
NC
2026.05
3.4
Feedback
Search any
task
Search any
task