Share your thoughts, 1 month free Claude Pro on usSee more

Knowledge-intensive reasoning on MMLU-CF first 1,000 samples (test)

74.2Exact Match Accuracy

MGRS

Updated 4mo ago

Evaluation Results

Method	Links
MGRS 2025.11		74.2
CoT-SC 2025.11		71.1
AoT 2025.11		70.9
FoT 2025.11		70.6
Self-refine 2025.11		69.7
CoT 2025.11		69.6
AFlow 2025.11		69.5