Share your thoughts, 1 month free Claude Pro on usSee more

Knowledge Preservation and Reasoning on MMLU

61.46MMLU Score

Base Model (Llama3.2-3B)

Updated 4mo ago

Evaluation Results

Method	Links
Base Model (Llama3.2-3B) 2026.01		61.46
DUET 2026.01		61.45
GA (DQA_f) + KL (Dr) 2026.01		60.62
NPO (DQA_f) + KL (Dr) 2026.01		60.55
NPO (DQA_f) 2026.01		60.48
Refusal-Training 2026.01		60.48
SimNPO 2026.01		60.4
GA + KL (Dr) 2026.01		60.18
NPO + KL (Dr) 2026.01		59.47
FLAT 2026.01		58.92
NPO 2026.01		54.79
GA (DQA_f) 2026.01		36.45
GA 2026.01		24.87