Share your thoughts, 1 month free Claude Pro on usSee more

NP-hard Graph Reasoning on Small-scale NP-hard graph problems Average

86.7Accuracy

NPG-Muse-7B

Updated 1mo ago

Evaluation Results

Method	Links
NPG-Muse-7B 2025.08		86.7	98.1
QwQ-32B 2025.08		85.3	98.7
NPG-Muse-8B 2025.08		79.6	98.7
LLaMA-8B-GT 2025.08		60.1	-
Claude-3.5-sonnet 2025.08		48.3	94.1
GPT-4o 2025.08		46.4	92.7
G1-7B 2025.08		40	-
S1.1-7B 2025.08		34.3	72.6
LLaMA3-70B-Ins 2025.08		32.5	80.3
Qwen2.5-7B-Ins-1M 2025.08		25.7	65.2
Qwen3-8B-Base 2025.08		23.5	61.1
GraphWiz-7B-DPO 2025.08		1.5	19.2