Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

REF-q

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-Agent Reinforcement LearningREF-q rac-dist (test)
Mean Episodic Reward (q=2)125
12
Showing 1 of 1 rows