Share your thoughts, 1 month free Claude Pro on usSee more

Language Understanding on MMLU Redux (test)

66.9Accuracy

Ours (theory-guided context selection strategy)

Updated 4mo ago

Evaluation Results

Method	Links
Ours (theory-guided context selection strategy) 2026.02		66.9
ReMem 2026.02		66.8
ExpRAG 2026.02		66.6
DC 2026.02		66.5
BM25 2026.02		66
Zero 2026.02		65.8
Ours (theory-guided context selection strategy) 2026.02		65
ReMem 2026.02		64.9
ExpRAG 2026.02		64.7
DC 2026.02		64.6
BM25 2026.02		64.1
Zero 2026.02		63.8