Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Multi-Agent Reinforcement Learning on SMAC v2 (test)

84Win Rate (Protoss 5 Units)

HPN-QMIX

-3.349619.327742.00564.6823Sep 25, 2025Nov 3, 2025Dec 12, 2025Jan 20, 2026Feb 28, 2026Apr 8, 2026May 18, 2026
Updated 15d ago

Evaluation Results

MethodLinks
2026.02
848278-
2026.02
847079-
2026.02
818381-
2025.09
74-48-
73-44-
70-48-
2026.02
697573-
2025.09
67-44-
65-45-
2025.09
64-44-
2026.02
636464-
2025.09
63-43-
2025.09
61-44-
2025.09
61-43-
2025.09
60-39-
2026.02
585934-
2026.05
57.9667.8742.18-
2026.05
56.464.7740.06-
2026.02
545432-
2026.05
51.9363.838.59-
2026.05
48.4461.6934.79-
2026.05
48.1661.6735.09-
2026.05
48.0359.238.75-
2026.02
466762-
2026.02
384529-
32.832.32530
2026.05
32.4354.7234.22-
2026.03
28.525.118.824.1
2026.05
273413.53-
2026.05
22.5629.6815.47-
2026.05
16.9318.446.83-
2026.03
11.613.2810.9
2026.02
10209-
2026.03
8.1756.7
2026.05
0.0110.290.19-