POPGym

Benchmarks

Task Name	Dataset Name	SOTA Result
Reinforcement Learning	POPGym Noisy Stateless Pendulum Hard	MMER0.657	9
Reinforcement Learning	POPGym Stateless Pendulum (Hard)	MMER82.8	9
Reinforcement Learning	POPGym Noisy Stateless CartPole (Hard)	MMER20.7	9
Reinforcement Learning	POPGym Stateless CartPole Hard	MMER0.127	9
Reinforcement Learning	POPGym Aggregated (48 tasks)	Aggregated Return (All)10.4	6
Offline Reinforcement Learning	POPGym	Average Normalized Score (All)9.5	5
Memory	POPGym Copy k=10	Temporal Range16.715	4
Memory	POPGym Copy k=5	Temporal Range17.255	4
Memory	POPGym Copy k=3	Temporal Range17.312	4
Memory	POPGym Copy k=1	Temporal Range12.294	4
Memory	POPGym RepeatFirst	Temporal Range21.177	4
Control	POPGym Noisy Stateless CartPole	Temporal Range15.274	4
Control	POPGym Stateless CartPole	Temporal Range13.704	4
Control	POPGym CartPole	Temporal Range12.362	4

Showing 14 of 14 rows