Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
LLM Inference on OPT-125M
Loading...
46.71
Latency (ms)
Baseline
40.4124
82.9212
125.43
167.9388
Mar 17, 2026
Latency (ms)
Extra Storage
Latency Overhead (%)
Updated 1mo ago
Evaluation Results
Method
Method
Links
Latency (ms)
Extra Storage
Latency Overhead (%)
Baseline
2026.03
46.71
-
-
RoR
2026.03
52.28
0.23
11.9
RADAR
2026.03
76.2
50
63.1
FaR
2026.03
204.15
2.1
336.9
Feedback
Search any
task
Search any
task