Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Sentence Completion on HellaSwag DE
Loading...
59.6
Normalized Log Accuracy
HATified
53.256
54.903
56.55
58.197
Mar 16, 2026
Normalized Log Accuracy
Bytes per Sequence Position
Updated 1mo ago
Evaluation Results
Method
Method
Links
Normalized Log Accuracy
Bytes per Sequence Position
HATified
training=SFT, shots=10...
2026.03
59.6
6.5
Tülu
training=SFT, shots=10...
2026.03
53.5
3.67
Feedback
Search any
task
Search any
task