Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
General Language Understanding on MMLU (Accuracy, Utility Preservation)
Loading...
76.89
Accuracy
Baseline
37.4844
47.7147
57.945
68.1753
Mar 16, 2026
Mar 19, 2026
Mar 23, 2026
Mar 26, 2026
Mar 30, 2026
Apr 2, 2026
Apr 6, 2026
Accuracy
Utility Preservation
Updated 11d ago
Evaluation Results
Method
Method
Links
Accuracy
Utility Preservation
Baseline
2026.03
76.89
-
SFCoT
2026.03
69.84
90.8
LLADA
TIME (S)=320.5, MEMORY...
2026.04
48
-
DUALDIFFUSION
TIME (S)=82.0, MEMORY...
2026.04
47
-
FASTDLLM
TIME (S)=21.3, MEMORY...
2026.04
39
-
Feedback
Search any
task
Search any
task