Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

BERT, GPT-2, and OPT Inference Workload

Benchmarks

Task NameDataset NameSOTA ResultTrend
Inference ThroughputBERT, GPT-2, and OPT Inference Workload BS=2, SL=256
Original Throughput (tokens/s)57,117.4
6
Showing 1 of 1 rows