Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inference Latency on WikiText-103, SQuAD v2, and OpenBookQA

57.9Inference Latency (ms)

LLMCache

48.024114.687181.35248.013Dec 18, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.12
57.9
2025.12
91.3
2025.12
112.5
2025.12
123.4
2025.12
177.3
2025.12
218.6
2025.12
304.8