Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Cooperative Multi-Agent Reinforcement Learning on Heterogeneous Cloud Scheduling 40 scenarios (test)
Loading...
-40.5
Best Checkpoint Reward
DG-PG
-85.948
-74.149
-62.35
-50.551
Feb 23, 2026
Best Checkpoint Reward
Updated 4d ago
Evaluation Results
Method
Method
Links
Best Checkpoint Reward
DG-PG
Number of Agents (N)=20
2026.02
-40.5
Best-Fit
Number of Agents (N)=100
2026.02
-40.7
Best-Fit
Number of Agents (N)=200
2026.02
-41
Best-Fit
Number of Agents (N)=50
2026.02
-41.1
DG-PG
Number of Agents (N)=100
2026.02
-41.5
DG-PG
Number of Agents (N)=50
2026.02
-41.7
Best-Fit
Number of Agents (N)=20
2026.02
-41.9
DG-PG
Number of Agents (N)=10
2026.02
-43.5
Best-Fit
Number of Agents (N)=10
2026.02
-43.7
DG-PG
Number of Agents (N)=5
2026.02
-44.9
Best-Fit
Number of Agents (N)=5
2026.02
-46.3
DG-PG
Number of Agents (N)=200
2026.02
-46.4
Random
Number of Agents (N)=200
2026.02
-60
Random
Number of Agents (N)=100
2026.02
-60.5
Random
Number of Agents (N)=50
2026.02
-66.8
Random
Number of Agents (N)=10
2026.02
-78.7
Random
Number of Agents (N)=20
2026.02
-80
Random
Number of Agents (N)=5
2026.02
-84.2
Feedback
Search any
task
Search any
task