Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

LLM Inference Efficiency on Long sequence prompts

1.05TTFT (s)

FedRAG

-57.4816337.6067732.6951,127.7833May 25, 2026
Updated 8d ago

Evaluation Results

MethodLinks
2026.05
1.0536.792,462.82267
2026.05
1.1544.492,207.37267
2026.05
1.2931.592,719.49267
2026.05
1.3521.88611.33140
2026.05
7.7213.543,037.731,647
2026.05
1211.854,238.544,219
2026.05
12.519.175,042.985,267
2026.05
13.1510.024,639.354,743
2026.05
36.4218.0614,331.91,060
2026.05
39.136.0117,274.699,530
2026.05
52.8226.2221,620.41,204
2026.05
55.414.8324,644.6714,246
2026.05
58.0631.4423,159.131,156
2026.05
60.155.526,202.2512,674
2026.05
76.6511.2342,475.82,603
2026.05
77.5112.7441,963.672,603
2026.05
77.5311.7542,196.112,603
2026.05
77.9110.2440,677.892,603
2026.05
79.6524.7532,134.341,252
2026.05
80.994.2535,278.6715,818
2026.05
966.511.16483,843.4231,203
2026.05
1,243.090.96587,393.7441,571
2026.05
1,341.590.86640,416.5946,755
2026.05
1,464.343.39673,616.1712,839