Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Terminal Task Execution on Terminal-Bench 1.0 (test)
Loading...
34.9
Avg Pass Rate
SkillFlow-specific
18.156
22.503
26.85
31.197
Apr 8, 2025
May 28, 2025
Jul 18, 2025
Sep 7, 2025
Oct 27, 2025
Dec 17, 2025
Feb 6, 2026
Avg Pass Rate
Avg Steps per Task
Avg Cost per Task ($)
Updated 2mo ago
Evaluation Results
Method
Method
Links
Avg Pass Rate
Avg Steps per Task
Avg Cost per Task ($)
SkillFlow-specific
Skillset=SkillFlow-spe...
2025.04
34.9
24.2
0.035
No Skills
Skillset=No Skills
2025.04
34.8
21.1
0.03
Vercel
Skillset=Vercel
2025.04
32.6
21.2
0.031
SkillFlow
Skillset=SkillFlow
2025.04
31.8
23
0.034
Reptile
Base Model=Devstral-25...
2026.02
18.9
-
-
Terminus 2
Base Model=LiteCoder-3...
2026.02
18.8
-
-
Feedback
Search any
task
Search any
task