Share your thoughts, 1 month free Claude Pro on usSee more

Multi-turn Dialogue on ACEBench En

68MT Accuracy

GPT-4o-2024-11-20

Updated 4mo ago

Evaluation Results

Method	Links
GPT-4o-2024-11-20 2025.08		68
Llama3.1-70B-Inst 2025.08		61
ToolACE-MT 2025.08		51
Multi-Agent Simulation 2025.08		48
ToolACE-MT 2025.08		44
ToolACE-MT 2025.08		34
Llama3.1-8B-Inst 2025.08		24