Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Single-agent Reinforcement Learning (SARL) on Breakout Clean
Loading...
505.5
Average Episode Return
SHINE
200.468
279.659
358.85
438.041
May 7, 2026
Average Episode Return
Updated 26d ago
Evaluation Results
Method
Method
Links
Average Episode Return
SHINE
2026.05
505.5
BIRD
2026.05
478.2
STRIP
2026.05
476.8
BehaviorGuard
2026.05
465.2
FeatureRE
2026.05
450.1
Original
2026.05
445.3
Direct retraining
2026.05
435.1
NC
2026.05
252.5
PD
2026.05
212.2
Feedback
Search any
task
Search any
task