Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

CN

Benchmarks

Task NameDataset NameSOTA ResultTrend
Multi-Agent Reinforcement LearningCN rac-dist
Mean Episodic Reward888
21
Multi-Agent Reinforcement LearningCN rdist
Mean Episodic Reward-161
21
Multi-Agent Reinforcement LearningCN rdete
Mean Episodic Reward-154
21
Soft Query AnsweringCN15k
1P Score16.6
6
Backdoor AttackCN (test)
Runtime (s)28.3
4
Intent PredictionCN
Accuracy55.2
4
Function InvocationCN Ver. Dual
Token Usage1,377.9
3
Function InvocationCN (Single)
Invocation Accuracy0.89
3
Showing 8 of 8 rows