Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Reinforcement Learning on Walker2D v5

6,335.5Average Return

TD3+DBC(*)

726.262,182.5053,638.755,094.995Feb 5, 2026Feb 6, 2026Feb 7, 2026Feb 8, 2026Feb 9, 2026Feb 10, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
6,335.5
2026.02
6,138.2
2026.02
5,802.6
2026.02
5,448.1
2026.02
5,093.7
2026.02
4,986.3
2026.02
4,854.4
2026.02
4,766.5
2026.02
4,417
2026.02
4,385
2026.02
4,295
2026.02
4,050
2026.02
4,050
2026.02
4,045
2026.02
3,981
2026.02
3,950
2026.02
3,925
2026.02
3,925
2026.02
3,899
2026.02
3,893
2026.02
3,884
2026.02
3,814
2026.02
3,766
2026.02
3,708
2026.02
3,640
2026.02
3,632
2026.02
3,603
2026.02
3,600
2026.02
3,580
2026.02
3,424
2026.02
3,297.4
2026.02
3,243
2026.02
3,132
2026.02
2,830
2026.02
2,663
2026.02
2,626.4
2026.02
2,533.9
2026.02
2,210
2026.02
2,181
2026.02
1,351
2026.02
1,124
2026.02
1,105
2026.02
942