Our new X account is live! Follow @wizwand_team for updates

Inference Efficiency on 30k Context Length Llama-2-7B

6.6Inference Throughput (QPS)

Finetuning

Updated 3d ago

Evaluation Results

Method	Links
Finetuning 2025.03		6.6	-	-	-
DBSA 2025.03		3.4	-	-	-
Fixed ICL 2025.03		1.5	-	-	-
RetICL 2025.03		0.8	-	-	-
RetICL 2025.03		-	-	1	1
Fixed ICL 2025.03		-	-	4.5	0.51
Finetuning 2025.03		-	-	-	0.12
DBSA 2025.03		-	-	3	0.22