Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Mapping on Mapping H1
Loading...
4,773
Mean Episode Reward
Shared Policy
4,678.36
4,702.93
4,727.5
4,752.07
Apr 5, 2026
Mean Episode Reward
Updated 12d ago
Evaluation Results
Method
Method
Links
Mean Episode Reward
Shared Policy
Budget (B)=200,000, Se...
2026.04
4,773
Local Fine-Tuning
Budget (B)=200,000, Se...
2026.04
4,765.4
DC-Ada
Budget (B)=200,000, Se...
2026.04
4,760.2
Obs. Normalization
Budget (B)=200,000, Se...
2026.04
4,710
Random Perturbation
Budget (B)=200,000, Se...
2026.04
4,682
Feedback
Search any
task
Search any
task