Our new X account is live! Follow @wizwand_team for updates
Home
/
Benchmarks
Long-Context Generation on Qwen3 Context length 20K
Loading...
3.73
Throughput Speedup (α)
SpecPV
2.5444
2.8522
3.16
3.4678
Dec 2, 2025
Throughput Speedup (α)
Draft Acceptance Length (τ)
Updated 4d ago
Evaluation Results
Method
Method
Links
Throughput Speedup (α)
Draft Acceptance Length (τ)
SpecPV
Model Size=14B, Partia...
2025.12
3.73
3.08
SpecPV
Model Size=4B, Partial...
2025.12
3.69
3.31
SpecPV
Model Size=14B, Partia...
2025.12
3.68
3.14
SpecPV
Model Size=8B, Partial...
2025.12
3.58
3.05
SpecPV
Model Size=4B, Partial...
2025.12
3.5
3.3
SpecPV
Model Size=8B, Partial...
2025.12
3.47
3.06
SpecPV
Model Size=4B, Partial...
2025.12
3.4
3.11
SpecPV
Model Size=14B, Partia...
2025.12
3.35
3.11
SpecPV
Model Size=8B, Partial...
2025.12
3.33
3.19
EAGLE3-YARN
Model Size=14B
2025.12
2.74
3.3
EAGLE3-YARN
Model Size=8B
2025.12
2.65
3.25
EAGLE3-YARN
Model Size=4B
2025.12
2.59
3.28
Feedback
Search any
task
Search any
task