Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Offline Reinforcement Learning on cube-double-play OGBench 5 tasks v0

69Average Success Rate

Value Flows

-2.7615.8734.553.13Oct 9, 2025Oct 19, 2025Oct 29, 2025Nov 8, 2025Nov 18, 2025Nov 28, 2025Dec 8, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.10
69
2025.10
61
2025.12
53
2025.10
42
2025.12
29
2025.10
29
2025.12
15
2025.10
15
2025.10
14
2025.12
12
2025.10
12
2025.12
7
2025.10
6
2025.12
3
2025.10
2
2025.10
2
2025.12
1
2025.12
1
2025.12
0