Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Constrained LLM Decoding on Qwen2-14B INT8

16.52Inference Time (ms)

Pre³

5.76478.367150.97223.573Jun 4, 2025
Updated 4d ago

Evaluation Results

MethodLinks
2025.06
16.521.52
2025.06
16.77-
2025.06
43.0910.12
2025.06
47.94-
2025.06
49.9912.37
2025.06
57.05-
2025.06
65.512.14
2025.06
74.54-
2025.06
80.3418.55
2025.06
98.64-
2025.06
143.8311.47
2025.06
162.47-
2025.06
232.1818.65
2025.06
285.42-