Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Agent Toolchain Scheduling on CCAD
Loading...
55
Accuracy
LLMBOOST
40.44
44.22
48
51.78
Dec 26, 2025
Accuracy
Updated 3d ago
Evaluation Results
Method
Method
Links
Accuracy
LLMBOOST
Base Model=Llama-3.1-8...
2025.12
55
VOTE
Base Model=Llama-3.1-8...
2025.12
53
UNITE
Base Model=Llama-3.1-8...
2025.12
52.5
Single
Base Model=Llama-3.1-8...
2025.12
52
LLMBOOST
Base Model=Qwen-2.5-7B...
2025.12
45.5
VOTE
Base Model=Qwen-2.5-7B...
2025.12
43
UNITE
Base Model=Qwen-2.5-7B...
2025.12
41.5
Single
Base Model=Qwen-2.5-7B...
2025.12
41
Feedback
Search any
task
Search any
task