Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Trojan Detection on Humanoid
Loading...
99.7
TDSR
Plan2Cleanse
-3.988
22.931
49.85
76.769
May 10, 2026
TDSR
Updated 22d ago
Evaluation Results
Method
Method
Links
TDSR
Plan2Cleanse
Iteration=1000
2026.05
99.7
PolicyCleanse
Iteration=1000
2026.05
60.9
Uniform Random
Iteration=1000
2026.05
3
Normal Agent
Iteration=1000
2026.05
0
Feedback
Search any
task
Search any
task