Our new X account is live! Follow @wizwand_team for updates

LLM Generation Efficiency on Qwen 2.5 14B (2048 input + 256 generation tokens)

8.7End-to-end Latency (s)

GRIFFIN

Updated 4d ago

Evaluation Results

Method	Links
GRIFFIN 2025.05		8.7	34
Caprese 2025.05		8.7	34
LoRA 2025.05		9.5	37
Full 2025.05		10.5	41