Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

MoE models

Benchmarks

Task NameDataset NameSOTA ResultTrend
Expert Pruning EfficiencyMoE Models
Calibration Time (h)0.22
6
Loss curve fitting across model sizesMoE models (various sizes)
ASMT MAPE0.341
3
Showing 2 of 2 rows