Share your thoughts, 1 month free Claude Pro on usSee more

Multi-task Language Understanding on MMLU (0-shot)

69.11Exact Match (EM)

VAR

Updated 4mo ago

Evaluation Results

Method	Links
VAR 2025.02		69.11
DPO 2025.02		68.64
ALoL 2025.02		68.62
Base 2025.02		67.13
VAR 2025.02		38.57
Base 2025.02		37.46
ALoL 2025.02		35.78
DPO 2025.02		32.45