Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on InvertedPendulum v2

1,000Mean Reward

TTOpt

-19.824244.938509.7774.462Apr 30, 2022Dec 25, 2022Aug 21, 2023Apr 16, 2024Dec 11, 2024Aug 7, 2025Apr 4, 2026
Updated 11d ago

Evaluation Results

MethodLinks
2022.04
1,0000
2022.04
1,0000
2026.04
964.3-
2026.04
949.8-
2026.04
945.5-
2026.04
937.3-
2026.04
935.7-
2026.04
932.2-
2026.04
925.5-
2026.04
919.7-
2022.04
893283.1
2026.04
764.5-
2022.04
721335.37
2026.04
714.2-
2022.04
651.86436.37
2026.04
637.3-
2022.04
621472.81
2026.04
597.3-
2026.04
568.1-
2026.04
340.7-
2022.04
224.71217.51
2022.04
222.86342.79
2026.04
67.7-
2026.04
32.1-
2026.04
24.3-
2026.04
20.7-
2026.04
19.4-