Share your thoughts, 1 month free Claude Pro on usSee more

Theorem-based Question Answering on TheoremQA

86.32Accuracy

Meta-reasoner

Updated 2mo ago

Evaluation Results

Method	Links
Meta-reasoner 2025.02		86.32
HiAR-ICL 2025.02		84.41
Meta-reasoner 2025.02		84.13
HiAR-ICL 2025.02		83.48
Evo-Prompt 2025.02		81.28
Evo-Prompt 2025.02		80.32
Ministral-14B-Instruct-2512 2026.01		56.13
Qwen3-14B 2026.01		55.88
LLaDA-2.0-Mini 2026.01		51.5
Ling-Mini 2026.01		50.38
Qwen3-30B-A3B-Instruct-2507 2026.01		50.12
LLaDA-2.0-Mini 2026.01		47
LLaDA-2.0-Mini 2026.01		45.88