Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on CartPole (Average Reward)

1,000Average Reward

SALSA-RL

165.92382.46599815.54May 29, 2024Sep 24, 2024Jan 20, 2025May 18, 2025Sep 13, 2025Jan 9, 2026May 8, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2025.02
1,000
2025.02
1,000
2025.02
1,000
2025.02
1,000
2025.02
1,000
2025.02
1,000
2025.02
1,000
2025.02
999.6
2025.02
998
2025.02
993.9
2026.01
989.5
2026.01
977.9
2026.01
973.4
2025.02
971.8
2026.01
962.3
2026.05
500
2026.05
500
2026.05
500
2026.05
499.95
2026.05
496
2026.05
474.67
2026.01
331.4
2026.05
258.09
2026.05
253.06
220
2026.05
216.92
2024.05
210
2024.05
200
2024.05
198