Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Large Language Model Inference on Llama 3.1
Loading...
48.1
Latency (ms)
RoME
47.992
48.721
49.45
50.179
Apr 10, 2026
Latency (ms)
Speedup
Updated 5d ago
Evaluation Results
Method
Method
Links
Latency (ms)
Speedup
RoME
Model=Llama3.1, Parame...
2026.04
48.1
4.7
RoPE
Model=Llama3.1, Parame...
2026.04
50.8
-
Feedback
Search any
task
Search any
task