Our new X account is live! Follow @wizwand_team for updates

Inference Efficiency on 90k Context Length Llama-3.1-8B

8.9Throughput (queries/s)

Finetuning

Updated 3d ago

Evaluation Results

Method	Links
Finetuning 2025.03		8.9	-	-	-
DBSA 2025.03		7.7	-	-	-
Fixed ICL 2025.03		6.8	-	-	-
RetICL 2025.03		0.4	-	-	-
RetICL 2025.03		-	-	1	1
Fixed ICL 2025.03		-	-	6.5	0.06
Finetuning 2025.03		-	-	-	0.046
DBSA 2025.03		-	-	4	0.053