Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Language Understanding on HellaSwag (test)
Loading...
77.63
Accuracy
Full Prec
48.7076
56.2163
63.725
71.2337
Aug 5, 2025
Accuracy
Updated 1mo ago
Evaluation Results
Method
Method
Links
Accuracy
Full Prec
2025.08
77.63
VLMQ
2025.08
53.19
GPTQ
2025.08
51.29
GPTAQ
2025.08
49.82
Feedback
Search any
task
Search any
task