Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Reinforcement Learning on the Room (test)

128Average Total Reward per Episode

Episodic-only

10.89641.29871.7102.102Dec 5, 2022
Updated 15d ago

Evaluation Results

MethodLinks
2022.12
128
2022.12
123.8
2022.12
117.9
2022.12
116.5
2022.12
109.1
2022.12
92.2
2022.12
91.6
2022.12
90.6
2022.12
86.8
2022.12
85.3
2022.12
84.2
2022.12
82.6
2022.12
76
2022.12
61.6
2022.12
60.4
2022.12
59.2
2022.12
56.8
2022.12
54.6
2022.12
39.4
2022.12
38.2
2022.12
37.6
2022.12
36.8
2022.12
35
2022.12
25.5
2022.12
25
2022.12
23.9
2022.12
23.6
2022.12
22.4
2022.12
17.4
2022.12
15.4