Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inference Efficiency on Alpaca

102.44Latency (ms/tok)

Baseline

101.7236106.5593111.395116.2307Aug 14, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.08
102.449.7616.1
2025.08
110.599.0416.12
2025.08
115.728.6416.26
2025.08
120.358.3816.29