Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Reinforcement Learning on Cart-Pole Domain Generalization - Pole Length OpenAI Gym (3 held out domains)
Loading...
175.25
Average Reward
RL-MLDG-GN
94.2756
115.2978
136.32
157.3422
Oct 10, 2017
Average Reward
Updated 4d ago
Evaluation Results
Method
Method
Links
Average Reward
RL-MLDG-GN
Variant=Gradient norm...
2017.10
175.25
RL-MLDG
Variant=Vanilla
2017.10
165.34
RL-Random-Source
Source Training=Single...
2017.10
133.74
RL-MLDG-GC
Variant=Cosine alignme...
2017.10
129.56
RL-Undobias
Architecture=Non-linea...
2017.10
113.52
RL-All
Source Training=Aggreg...
2017.10
97.39
Feedback
Search any
task
Search any
task