Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Inference Efficiency on Synthetic LLM Workload Input 8192 Output 4096
Loading...
162.78
Latency (s)
ShotKV
161.9544
167.5272
173.1
178.6728
Feb 4, 2025
Latency (s)
Throughput (T/S)
Updated 23d ago
Evaluation Results
Method
Method
Links
Latency (s)
Throughput (T/S)
ShotKV
Input Sequence Length=...
2025.02
162.78
63.24
FullKV
Input Sequence Length=...
2025.02
183.42
55.93
Feedback
Search any
task
Search any
task