Our new X account is live! Follow @wizwand_team for updates

Inference Efficiency on 30k Context Length (Llama-3.1-8B)

15.8Inference Throughput (QPS)

Finetuning

Updated 3d ago

Evaluation Results

Method	Links
Finetuning 2025.03		15.8	-
DBSA 2025.03		12.8	-
Fixed ICL 2025.03		11.6	-
RetICL 2025.03		1.3	-