Share your thoughts, 1 month free Claude Pro on usSee more
WorkDL logo mark

Speculative Decoding on Pre-training dataset

0.658Speculative Accept %

Full Knowledge Distillation

0.645520.648760.6520.65524Mar 21, 2025
Updated 1mo ago

Evaluation Results

MethodLinks
2025.03
0.658
2025.03
0.657
2025.03
0.646