Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Cooperative Multi-Agent Reinforcement Learning on Heterogeneous Cloud Scheduling 40 scenarios (test)

-40.5Best Checkpoint Reward

DG-PG

-85.948-74.149-62.35-50.551Feb 23, 2026
Updated 4d ago

Evaluation Results

MethodLinks
2026.02
-40.5
2026.02
-40.7
2026.02
-41
2026.02
-41.1
2026.02
-41.5
2026.02
-41.7
2026.02
-41.9
2026.02
-43.5
2026.02
-43.7
2026.02
-44.9
2026.02
-46.3
2026.02
-46.4
2026.02
-60
2026.02
-60.5
2026.02
-66.8
2026.02
-78.7
2026.02
-80
2026.02
-84.2