Share your thoughts, 1 month free Claude Pro on usSee more

Agentic Coding on TerminalBench

0.3375Accuracy

LongCat-Flash-Lite

Updated 3mo ago

Evaluation Results

Method	Links
LongCat-Flash-Lite 2026.01		0.3375
Kimi-Linear-48B-A3B 2026.01		0.2
Gemini 2.5 Flash-Lite 2026.01		0.2
Kimi-Linear-48B-A3B 2026.03		0.2
LongCat-Next 2026.03		0.1875
Qwen3-Next-80B-A3B-Instruct 2026.01		0.1519
Qwen3-Next-80B-A3B-Instruct 2026.03		0.1519