Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on Walker2D v5

6,335.5Average Return

TD3+DBC(*)

-45.1081,611.3963,267.94,924.404Jun 8, 2025Jul 19, 2025Aug 29, 2025Oct 9, 2025Nov 19, 2025Dec 30, 2025Feb 10, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.02
6,335.5-
2026.02
6,138.2-
2026.02
5,802.6-
2026.02
5,448.1-
2026.02
5,093.7-
2026.02
4,986.3-
2026.02
4,854.4-
2026.02
4,766.5-
2026.02
4,417-
2026.02
4,385-
2026.02
4,295-
2026.02
4,050-
2026.02
4,050-
2026.02
4,045-
2026.02
3,981-
2026.02
3,950-
2026.02
3,925-
2026.02
3,925-
2026.02
3,899-
2026.02
3,893-
2026.02
3,884-
2026.02
3,814-
2026.02
3,766-
2026.02
3,708-
2026.02
3,640-
2026.02
3,632-
2026.02
3,603-
2026.02
3,600-
2026.02
3,580-
2026.02
3,424-
2026.02
3,297.4-
2026.02
3,243-
2026.02
3,132-
2026.02
2,830-
2026.02
2,663-
2026.02
2,626.4-
2026.02
2,533.9-
2026.02
2,210-
2026.02
2,181-
2026.02
1,351-
2026.02
1,124-
2026.02
1,105-
2026.02
942-
2025.06
319.9245
2025.06
200.3165.7