Share your thoughts, 1 month free Claude Pro on usSee more

Theorem Question Answering on TheoremQA standard (test)

56Accuracy

Ours (theory-guided context selection strategy)

Updated 4mo ago

Evaluation Results

Method	Links
Ours (theory-guided context selection strategy) 2026.02		56
ExpRAG 2026.02		55.6
ReMem 2026.02		55.6
DC 2026.02		55.4
BM25 2026.02		54.8
Zero 2026.02		54.6
Ours (theory-guided context selection strategy) 2026.02		28
ReMem 2026.02		27.7
DC 2026.02		27.6
ExpRAG 2026.02		27.6
BM25 2026.02		27
Zero 2026.02		26.8