Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
End-to-end inference tuning on LLaMA
Loading...
29.5
Tuning Time (s)
STOF
-2.8
215.225
433.25
651.275
Jun 6, 2025
Tuning Time (s)
Updated 14d ago
Evaluation Results
Method
Method
Links
Tuning Time (s)
STOF
Input Size (Batch Size...
2025.06
29.5
STOF
Input Size (Batch Size...
2025.06
43.6
MCFuser
Input Size (Batch Size...
2025.06
48.8
Bolt
Input Size (Batch Size...
2025.06
52.1
MCFuser
Input Size (Batch Size...
2025.06
110.8
Bolt
Input Size (Batch Size...
2025.06
124.6
STOF
Input Size (Batch Size...
2025.06
264.6
MCFuser
Input Size (Batch Size...
2025.06
820.6
Bolt
Input Size (Batch Size...
2025.06
837
Feedback
Search any
task
Search any
task