Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Speculative Decoding Inference on PubMedQA

182.24Throughput (tokens/s)

EvoSpec

153.1512160.7031168.255175.8069Apr 17, 2026
Updated 6d ago

Evaluation Results

MethodLinks
2026.04
182.24
2026.04
181.31
2026.04
180.98
2026.04
178.11
2026.04
171.53
2026.04
169.98
2026.04
168.37
2026.04
160.56
2026.04
160.41
2026.04
157.4
2026.04
154.99
2026.04
154.27