Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on puzzle-4x4-play OGBench 5 tasks v0

78Average Success Rate

MAC

-3.1217.943960.06Oct 9, 2025Oct 19, 2025Oct 29, 2025Nov 8, 2025Nov 18, 2025Nov 28, 2025Dec 8, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
78-----
2025.10
344,7001,7003,8003,4003,200
2025.12
29-----
2025.10
283,30005,8002,2002,900
2025.10
27-----
2025.10
27-----
2025.10
25-----
2025.10
20-----
2025.12
17-----
2025.10
17-----
2025.12
14-----
2025.10
14-----
2025.10
13-----
2025.12
7-----
2025.10
7-----
2025.10
5600400600600600
2025.10
11001001001000
2025.12
0-----
2025.12
0-----
2025.12
0-----
2025.12
0-----
2025.10
0100001000
2025.10
010001001000
2025.10
000000
2025.10
010001000100
2025.10
00100000
2025.10
0-----
2025.10
0-----