Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MPE PP

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-agent Offline Reinforcement LearningMPE PP (Expert)
Score125.6
16
Multi-agent Offline Reinforcement LearningMPE PP Medium
Score86.3
16
Multi-agent Offline Reinforcement LearningMPE PP (Medium-replay)
Score87.3
16
Multi-agent Offline Reinforcement LearningMPE PP (Random)
Score92.8
16
Showing 4 of 4 rows