Share your thoughts, 1 month free Claude Pro on usSee more

Logical Reasoning on Rule-chaining

84Accuracy

OVM

Updated 2mo ago

Evaluation Results

Method	Links
OVM 2026.05		84
Self-Consistency 2026.05		78
PT-SFT 2026.05		77
HyperGuide 2026.05		77
Self-Consistency 2026.05		74.5
HyperGuide 2026.05		74
SoftCoT 2026.05		73.5
PT-SFT 2026.05		72
PT-SFT 2026.05		72
OVM 2026.05		71.5
OVM 2026.05		70
HyperGuide 2026.05		70
SoftCoT 2026.05		67.5
SoftCoT 2026.05		63.5
Few-shot 2026.05		54
Few-shot 2026.05		53
Tree of Thoughts 2026.05		52
Self-Consistency 2026.05		52
Tree of Thoughts 2026.05		50
Tree of Thoughts 2026.05		50
Few-shot 2026.05		16.1