Share your thoughts, 1 month free Claude Pro on us
See more
Home
/
Benchmarks
Generative Recommendation on KuaiRand 1K
Loading...
14.4
Latency (ms)
MTServe
9.232
44.116
79
113.884
Apr 24, 2026
Latency (ms)
Speedup (vs. RE)
GPU Hit Ratio
Total Hit Ratio
Updated 1mo ago
Evaluation Results
Method
Method
Links
Latency (ms)
Speedup (vs. RE)
GPU Hit Ratio
Total Hit Ratio
MTServe
Batch Size=1
2026.04
14.4
1.47
63.52
94.66
GPU-Only
Batch Size=1
2026.04
17.4
1.22
63.52
63.52
Recomp.
Batch Size=1
2026.04
21.2
1
-
-
MTServe
Batch Size=4
2026.04
33.5
2.17
63.92
98.57
GPU-Only
Batch Size=4
2026.04
42.1
1.73
63.92
63.92
MTServe
Batch Size=8
2026.04
47.3
3.04
64.36
98.59
Recomp.
Batch Size=4
2026.04
72.8
1
-
-
GPU-Only
Batch Size=8
2026.04
72.9
1.97
64.36
64.36
Recomp.
Batch Size=8
2026.04
143.6
1
-
-
Feedback
Search any
task
Search any
task