Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Time-to-first-token (TTFT) on Llama 8B Instruct 32K input length 3.1

3.4TTFT (s)

TriangleMix + FlexPrefill

3.3163.8834.455.017Jul 29, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
3.4
3.5
3.6
3.6
3.6
2025.07
4.1
2025.07
4.1
4.2
2025.07
5.5