Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

64K/64K serving scenario

Benchmarks

Task NameDataset NameSOTA ResultTrend
Max token throughput64K/64K serving scenario 8xH100 node 1.0
Max Throughput (K tok/s)9.3
5
Showing 1 of 1 rows