Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Decision Inference on SMAC
Loading...
76.4
Accuracy
Ours
44.264
52.607
60.95
69.293
Feb 18, 2025
Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Ours
2025.02
76.4
ReFT
Framework=Reasoning
2025.02
72.2
KTO
Framework=RLHF
2025.02
72.1
DPO
Framework=RLHF
2025.02
71.3
Skywork
Framework=RLAIF
2025.02
69.2
DeepSeek
2025.02
65.8
PPO
Framework=RLHF
2025.02
65.3
CoT+SFT
Framework=Reasoning
2025.02
64.2
Proxy LLM
2025.02
64
SFT
2025.02
58.2
o3-mini
2025.02
45.5
Feedback
Search any
task
Search any
task