Share your thoughts, 1 month free Claude Pro on usSee more

Common Sense Reasoning on HellaSwag (dev)

95.4Accuracy

DeBERTa-ASA

Updated 3mo ago

Evaluation Results

Method	Links
DeBERTa-ASA 2022.06		95.4
DeBERTa (large) 2022.06		94.3
Megatron-NLG 2021.12		82.4
GPT-3 2021.12		79.3
Gopher 2021.12		79.2
GPT-3 2021.12		78.9
GPT-3 2021.12		78.1
GLaM 2021.12		77.2
GLaM 2021.12		76.8
GLaM 2021.12		76.6
Mamba2 2026.03		58.58
FusionGated-FIRNet 2026.03		58.47
Gated DeltaNet 2026.03		57.55
AdaMulti-PathGateNet 2026.03		57.17
Content-SharpRouter 2026.03		57
PathGate-FusionNet 2026.03		56.99
Hier-GateNet 2026.03		56.85
DeltaNet 2026.03		56.29
BERT-ASA 2022.06		40.8
BERT 2022.06		39.7