Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Inference Efficiency on 1x V100 (16GB) (synthetic)

28,298Throughput (tokens/s)

SRM

-472.566,996.7214,46621,935.28May 9, 2026
Updated 22d ago

Evaluation Results

MethodLinks
2026.05
28,29864,00014.75
2026.05
28,09164,0009.66
2026.05
27,44532,00043.29
2026.05
27,44132,00024.59
2026.05
2,908400-
2026.05
1,918200-
2026.05
1,116100-
2026.05
63450-