Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-agent Reinforcement Learning on TrafficJunction Large (TJ-L)

-24.6Reward

ST

-33.128-30.914-28.7-26.486May 13, 2026
Updated 20d ago

Evaluation Results

MethodLinks
2026.05
-24.6
2026.05
-24.7
2026.05
-24.8
2026.05
-24.8
2026.05
-24.9
2026.05
-24.9
2026.05
-25
2026.05
-25.1
2026.05
-25.3
2026.05
-25.6
2026.05
-26
2026.05
-26
2026.05
-26.1
2026.05
-26.3
2026.05
-26.4
2026.05
-26.5
2026.05
-26.7
2026.05
-27
2026.05
-27.1
2026.05
-27.6
2026.05
-28.3
2026.05
-28.6
2026.05
-30.1
2026.05
-32.8