Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
NP-hard Graph Reasoning on Small-scale NP-hard graph problems Average
Loading...
86.7
Accuracy
NPG-Muse-7B
-1.908
21.096
44.1
67.104
Aug 28, 2025
Accuracy
Feasibility
Updated 4d ago
Evaluation Results
Method
Method
Links
Accuracy
Feasibility
NPG-Muse-7B
Category=NPG-Muse
2025.08
86.7
98.1
QwQ-32B
Category=Reasoning
2025.08
85.3
98.7
NPG-Muse-8B
Category=NPG-Muse
2025.08
79.6
98.7
LLaMA-8B-GT
Category=Graph-oriented
2025.08
60.1
-
Claude-3.5-sonnet
Category=Closed-source
2025.08
48.3
94.1
GPT-4o
Category=Closed-source
2025.08
46.4
92.7
G1-7B
Category=Graph-oriented
2025.08
40
-
S1.1-7B
Category=Reasoning
2025.08
34.3
72.6
LLaMA3-70B-Ins
Category=Non-reasoning
2025.08
32.5
80.3
Qwen2.5-7B-Ins-1M
Category=Non-reasoning
2025.08
25.7
65.2
Qwen3-8B-Base
Category=Non-reasoning
2025.08
23.5
61.1
GraphWiz-7B-DPO
Category=Graph-oriented
2025.08
1.5
19.2
Feedback
Search any
task
Search any
task