Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Maximum Clique on MCP Small-scale
Loading...
96.2
Accuracy
QwQ-32B
-2.184
23.358
48.9
74.442
Aug 28, 2025
Accuracy
Feasibility
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Feasibility
QwQ-32B
Category=Reasoning
2025.08
96.2
97
NPG-Muse-7B
Category=NPG-Muse
2025.08
96
97.8
LLaMA-8B-GT
Category=Graph-oriented
2025.08
95.2
-
NPG-Muse-8B
Category=NPG-Muse
2025.08
93.6
97.8
GPT-4o
Category=Closed-source
2025.08
62.4
79.6
Claude-3.5-sonnet
Category=Closed-source
2025.08
62.2
88
S1.1-7B
Category=Reasoning
2025.08
46.2
80.4
LLaMA3-70B-Ins
Category=Non-reasoning
2025.08
42.8
44.2
Qwen2.5-7B-Ins-1M
Category=Non-reasoning
2025.08
34
60.2
G1-7B
Category=Graph-oriented
2025.08
30
-
Qwen3-8B-Base
Category=Non-reasoning
2025.08
22.4
52.8
GraphWiz-7B-DPO
Category=Graph-oriented
2025.08
1.6
39.6
Feedback
Search any
task
Search any
task