Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Downstream Performance Prediction on HellaSwag
Loading...
0.0032
MSE
All token loss
0.001944
0.010085
0.018225
0.026366
Jun 16, 2025
MSE
Updated 1mo ago
Evaluation Results
Method
Method
Links
MSE
All token loss
loss type=All token
2025.06
0.0032
CSV
loss type=CSV (Capabil...
2025.06
0.0045
Label token loss
loss type=Label token
2025.06
0.0333
Feedback
Search any
task
Search any
task