Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Intent Alignment on ToolACE
Loading...
85.71
Aintent (GPT-5.0)
IntentMiner
74.3428
77.2939
80.245
83.1961
Dec 16, 2025
Aintent (GPT-5.0)
Aintent (Claude-4.0)
Aintent (DeepSeek-R1)
Average Aintent
Updated 1mo ago
Evaluation Results
Method
Method
Links
Aintent (GPT-5.0)
Aintent (Claude-4.0)
Aintent (DeepSeek-R1)
Average Aintent
IntentMiner
Reasoner LLM=Gemini-2.5
2025.12
85.71
85.33
85.52
85.52
IntentMiner
Reasoner LLM=Llama-3.1
2025.12
83.99
81.78
84.95
83.57
IntentMiner
Reasoner LLM=GPT-4.1
2025.12
83.13
85.81
83.99
84.31
IntentMiner
Reasoner LLM=DeepSeek-V3
2025.12
82.55
84.66
82.36
83.19
IntentMiner
Reasoner LLM=Claude-3.5
2025.12
76.22
78.33
77.28
77.28
IntentMiner
Reasoner LLM=Qwen3
2025.12
74.78
70.95
73.83
73.19
Feedback
Search any
task
Search any
task