Share your thoughts, 1 month free Claude Pro on usSee more

Multi-subject Knowledge on MMLU

79.7Accuracy

Baseline

Updated 2mo ago

Evaluation Results

Method	Links
Baseline 2026.05		79.7
ScaleSearch 2026.05		79.4
NVFP4 2026.05		77.7
TCA-Attention 2025.12		74.26
FlexPrefill 2025.12		74.23
Qwen2.5-7B-Instruct 2025.12		74.22
XAttention 2025.12		74.2
MInference 2025.12		74.14
LLaMA3.1-8B-Instruct 2025.12		69.38
XAttention 2025.12		69.21
TCA-Attention 2025.12		69.21
FlexPrefill 2025.12		69.16
MInference 2025.12		69.14
Baseline 2026.05		48
ScaleSearch 2026.05		45.4
NVFP4 2026.05		45.2