Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Backdoor Attack on Reinforcement Learning on Breakout Discrete (evaluation)
Loading...
495.2
Baseline Reward
TrojDRL
454.536
465.093
475.65
486.207
Feb 4, 2026
Baseline Reward
Attack Success Rate (ASR)
ASR Standard Deviation
Baseline Reward Standard Deviation
Updated 1mo ago
Evaluation Results
Method
Method
Links
Baseline Reward
Attack Success Rate (ASR)
ASR Standard Deviation
Baseline Reward Standard Deviation
TrojDRL
Target Action=No-Op, A...
2026.02
495.2
98.5
2
58.6
SleeperNets
Target Action=No-Op, A...
2026.02
489.6
100
0
25.9
No Poisoning
Agent Architecture=PPO...
2026.02
476.5
-
-
9.4
Daze
Target Action=No-Op, A...
2026.02
465.6
97.6
0.1
31.1
Q-Incept
Target Action=No-Op, A...
2026.02
456.1
100
0
19.7
Feedback
Search any
task
Search any
task