Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Multi-turn Dialogue on ACEBench En
Loading...
68
MT Accuracy
GPT-4o-2024-11-20
22.24
34.12
46
57.88
Aug 18, 2025
MT Accuracy
Updated 4d ago
Evaluation Results
Method
Method
Links
MT Accuracy
GPT-4o-2024-11-20
2025.08
68
Llama3.1-70B-Inst
2025.08
61
ToolACE-MT
2025.08
51
Multi-Agent Simulation
2025.08
48
ToolACE-MT
Ablation=Without Offli...
2025.08
44
ToolACE-MT
Ablation=Without Itera...
2025.08
34
Llama3.1-8B-Inst
2025.08
24
Feedback
Search any
task
Search any
task