Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Wiki2

Benchmarks

Task NameDataset NameSOTA ResultTrend
Language ModelingWiki2
PPL4.01
326
Quantization Distribution EvaluationWiki2 (calibration set)
KL Divergence0
11
Hierarchical ForecastingWiki2
Rollout MSE1.61
5
Language ModelingWiki2
Delta PPL (Llama2-7B)0.46
4
Showing 4 of 4 rows