Our new X account is live! Follow @wizwand_team for updates
WorkDL logo mark

Inference Efficiency on 90k Context Length Llama-3.1-8B

8.9Throughput (queries/s)

Finetuning

0.062.3554.656.945Mar 11, 2025
Updated 3d ago

Evaluation Results

MethodLinks
2025.03
8.9---
2025.03
7.7---
2025.03
6.8---
2025.03
0.4---
2025.03
--11
2025.03
--6.50.06
2025.03
---0.046
2025.03
--40.053