Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Common Sense Reasoning on HellaSwag (acc_n)
Loading...
37.7
Accuracy (acc_n)
Mamba-2
30.836
32.618
34.4
36.182
Apr 8, 2026
Accuracy (acc_n)
Updated 8d ago
Evaluation Results
Method
Method
Links
Accuracy (acc_n)
Mamba-2
Model Scale=440M, Eval...
2026.04
37.7
Mamba-2 + PoST
Model Scale=440M, Eval...
2026.04
37.5
RWKV-7
Model Scale=180M, Eval...
2026.04
32.1
RWKV-7 + PoST
Model Scale=180M, Eval...
2026.04
32.1
Gated DeltaNet
Model Scale=180M, Eval...
2026.04
31.9
Gated DeltaNet + PoST
Model Scale=180M, Eval...
2026.04
31.5
Mamba-2 + PoST
Model Scale=180M, Eval...
2026.04
31.3
Mamba-2
Model Scale=180M, Eval...
2026.04
31.1
Feedback
Search any
task
Search any
task