Share your thoughts, 1 month free Claude Pro on usSee more

Inference Latency on WikiText-103, SQuAD v2, and OpenBookQA

57.9Inference Latency (ms)

LLMCache

Updated 3mo ago

Evaluation Results

Method	Links
LLMCache 2025.12		57.9
LLMCache 2025.12		91.3
LLMCache 2025.12		112.5
NoCache 2025.12		123.4
KV-Cache 2025.12		177.3
NoCache 2025.12		218.6
NoCache 2025.12		304.8