Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Agentic Coding on TerminalBench
Loading...
0.3375
Accuracy
LongCat-Flash-Lite
0.144476
0.194588
0.2447
0.294812
Jan 29, 2026
Feb 7, 2026
Feb 17, 2026
Feb 27, 2026
Mar 9, 2026
Mar 19, 2026
Mar 29, 2026
Accuracy
Updated 19d ago
Evaluation Results
Method
Method
Links
Accuracy
LongCat-Flash-Lite
Architecture=MoE + NE,...
2026.01
0.3375
Kimi-Linear-48B-A3B
Architecture=MoE, # To...
2026.01
0.2
Gemini 2.5 Flash-Lite
2026.01
0.2
Kimi-Linear-48B-A3B
2026.03
0.2
LongCat-Next
2026.03
0.1875
Qwen3-Next-80B-A3B-Instruct
Architecture=MoE, # To...
2026.01
0.1519
Qwen3-Next-80B-A3B-Instruct
2026.03
0.1519
Feedback
Search any
task
Search any
task